Questions and aswers related to FreeLing usage

How to add new possible PoS tags?

Submitted by Xsoto on Thu, 11/16/2017 - 15:11


I'm trying to adapt Freeling 4.0 so it can analyze domain specific dictionary, identified by specific PoS tags. To do so, I have modified the file "tagset.dat" including the new codes; for instance, to recognize the PoS tag "NC00SC0" I have changed the following line on "tagset.dat":

N 2 noun type/C:common;P:proper gen/F:feminine;M:masculine;C:common num/S:singular;P:plural;N:invariable neclass/S:person;G:location;O:organization;V:other nesubclass/0:0;P:0 degree/V:evaluative

For this one:

How to analyze an already tokenized file

Submitted by andres on Tue, 11/07/2017 - 20:40

I have already tried:

analyze --inplv token -f ca.cfg < orig.txt > target.txt
analyze --inplv splitted -f ca.cfg < orig.txt > target.txt

And also changing at the ca.cfg file the original "InputLevel=text" to "InputLevel=token"
But it alwasys says:
Error - 'text' input format only accepts input analysis level 'text'.

Freeling tagset, some values are not documented

Submitted by David on Fri, 10/13/2017 - 13:37

We have found for some french documents we got the tag PPSCNN0, which is not documented
since the value 'S' not exist 'Person' in the category 'Pronoun' (P) according to…

The same happens for the tag DF0CN0, 'F' is not in the possible type values for a

Language detection

Submitted by andres on Fri, 07/21/2017 - 10:17

Hi there, at the old forum you answered a question on this topic with the code below, so my question is, where do I get the analyzer.php file, or.... can you share the code of your demo version....
I tried this code already unsuccessfully though..



// Adjust this path to your local FreeLing installation
$FL_DIR = "/usr/local";


Submitted by tmyapple on Tue, 04/25/2017 - 06:52

I'm trying to train a NER model using the train-nerc directory and the demo data that exist in train-nerc/corpus
I'm following the scripts, and encountered several problems:
1. corpus/bin/ (is being called from prepare-corpus) - is stuck inside the second loop, looks like the achieved ratio doesn't achieve the goal ratio - i suspect that it is due to the small volume of the demo set. Should it work fine and something else is wrong on my side?

pt NER extraction

Submitted by break on Wed, 04/12/2017 - 16:57

Hello Freeling friends, i have a little problem running NER / NEC.
I'm trying to get detections about persons, organizations, places, dates etc, with a given plain text. As far as i know the text must be sentence splited (?not sure) and tokenized (one word per line), however i'm trying to run analyze with a plain text that have nouns, places, organizations etc, and see whats the output, but i got an error on np.dat file, maybe i'm calling it wrongly.
The config file has NER and NEC enabled: