Usage

Description

Questions and aswers related to FreeLing usage

How to add new possible PoS tags?

Submitted by Xsoto on Thu, 11/16/2017 - 15:11
Forums

Hi,

I'm trying to adapt Freeling 4.0 so it can analyze domain specific dictionary, identified by specific PoS tags. To do so, I have modified the file "tagset.dat" including the new codes; for instance, to recognize the PoS tag "NC00SC0" I have changed the following line on "tagset.dat":

N 2 noun type/C:common;P:proper gen/F:feminine;M:masculine;C:common num/S:singular;P:plural;N:invariable neclass/S:person;G:location;O:organization;V:other nesubclass/0:0;P:0 degree/V:evaluative

For this one:

How to analyze an already tokenized file

Submitted by andres on Tue, 11/07/2017 - 20:40
Forums

I have already tried:

analyze --inplv token -f ca.cfg < orig.txt > target.txt
analyze --inplv splitted -f ca.cfg < orig.txt > target.txt

And also changing at the ca.cfg file the original "InputLevel=text" to "InputLevel=token"
But it alwasys says:
Error - 'text' input format only accepts input analysis level 'text'.

Freeling tagset, some values are not documented

Submitted by David on Fri, 10/13/2017 - 13:37
Forums

Hello,
We have found for some french documents we got the tag PPSCNN0, which is not documented
since the value 'S' not exist 'Person' in the category 'Pronoun' (P) according to
https://talp-upc.gitbooks.io/freeling-user-manual/content/tagsets/tagse…

The same happens for the tag DF0CN0, 'F' is not in the possible type values for a
determiner.

Language detection

Submitted by andres on Fri, 07/21/2017 - 10:17
Forums

Hi there, at the old forum you answered a question on this topic with the code below, so my question is, where do I get the analyzer.php file, or.... can you share the code of your demo version....
I tried this code already unsuccessfully though..

Thanks!

<code>
include("analyzer.php");

// Adjust this path to your local FreeLing installation
$FL_DIR = "/usr/local";

train-nerc

Submitted by tmyapple on Tue, 04/25/2017 - 06:52
Forums

Hi,
I'm trying to train a NER model using the train-nerc directory and the demo data that exist in train-nerc/corpus
I'm following the scripts, and encountered several problems:
1. corpus/bin/extract-gaz.sh (is being called from prepare-corpus) - is stuck inside the second loop, looks like the achieved ratio doesn't achieve the goal ratio - i suspect that it is due to the small volume of the demo set. Should it work fine and something else is wrong on my side?

pt NER extraction

Submitted by break on Wed, 04/12/2017 - 16:57
Forums

Hello Freeling friends, i have a little problem running NER / NEC.
I'm trying to get detections about persons, organizations, places, dates etc, with a given plain text. As far as i know the text must be sentence splited (?not sure) and tokenized (one word per line), however i'm trying to run analyze with a plain text that have nouns, places, organizations etc, and see whats the output, but i got an error on np.dat file, maybe i'm calling it wrongly.
The config file has NER and NEC enabled: