Linguistic Data


Questions and answers related to FreeLing linguistic data

Autodetect language

Submitted by xdavid on Fri, 11/11/2016 - 09:37

Hello, I'm procesing text in server with "analyzer_client myoutput" only with spanish text for the moment, but I should do it for all languages and analyze if someone speaks in English, French, Spanish or someother. I see in the documentation "lang_ident" and "lang "", but i don't know use that. Can you provide me any help for do it?

Thank you.

incorrect language on Semantic Graph Frame lemma

Submitted by carlesg on Fri, 07/29/2016 - 14:37


When I use the SemanticGraph to ask for the lemmas of the Frames on the Graph, I get the lemma in English, although I configure Freeling in Spanish, and I ask a question in Spanish.
If I try the Spanish sentence 'Dime el valor del coche.', the tagger says:

-------- TAGGER results -----------
Di decir VMM02S0
me me PP1CS00
el el DA0MS0
valor valor NCMS000
de de SP
el el DA0MS0
coche coche NCMS000
. . Fp

Domain specific NLP

Submitted by flopezbello on Tue, 07/05/2016 - 20:51

Is it possible to train FreeLing in specific domains? More precisely, we are thinking of processing medical (health and genomics) text, which includes specific terminology as genes, fenotypes (or health conditions / traits) and proteins.

Also, it would be very interesting to add (besides specific dictionaries) onthologies that pertain to these domains.

We can assume that English is the base language.


Information about DepTreeler labels for spanish

Submitted by jlarteaga on Fri, 06/17/2016 - 06:00

Hi! I was looking for some descriptions of the tags used in the DepTreeler module for spanish since the documentation only offers explanation for the Txala parser. Searching through other posts someone adviced trying to deduce their meaning out from context, but considering my lack of serious knowledge in linguistics at least a little guidance would be really appreciated...
Thanks in advance!