Linguistic Data


Questions and answers related to FreeLing linguistic data

Version of sense dictionary for Spanish?

Submitted by ulrike.henny on Tue, 12/14/2021 - 15:41

I have been using FreeLing for some time now to annotate literary texts in Spanish. Thank you very much for this great and very useful tool!
Today I have a question about the version of the sense dictionary for Spanish which is used in FreeLing. In the documentation I found the information that the sense dictionary from the MRC 3.0 project is used. However, I noticed that some words that I would have expected to be annotated with sense information are not.

Verbs suddenly missing in Freeling dictionary (PT)?

Submitted by pbholmen on Wed, 12/09/2020 - 19:39


I'm using the Portuguese Freeling dictionary data file for a tool for personal use. I just upgraded Freeling using Homebrew on macOS, the current version is stated as 4.2_1

A very common verb, "resolver" is suddenly missing. I looked in the GitHub repo, and see that it is also missing in FreeLing/data/pt/dictionary/entries/verbs.

Error in the Portuguese dictionary?

Submitted by pbholmen on Wed, 12/09/2020 - 13:04


I think there might be an error in the dictionary found under:

Line 232708 reads:

dar dar VMN0000 dar VMN01S0 dar VMN03S0 dar VMSF1S0 dar VMSF3S0

I believe this means that "dar" is supposed to be the future subjunctive of the verb "dar" for third and first person singular. This would be the case if the verb was regular, however it is irregular. Here is an authoritative source for the correct conjugation:

Lemas file

Submitted by Sebastià Salvà… on Mon, 11/16/2020 - 20:21

Dear all,

Do you know how to get, from a dicc.dat/dicc.txt file, a file with all the lemas and all the tokens forms, in oder to load it in a concordance program such as AntConc?

casa -> casa, cases
menjar -> menjo, menjava, menjaré, mengin...

Could yo recommend me some kind of instruction, such as awk...?

Personal articles in Catalan: "En Pere" & "na Maria"

Submitted by Sebastià Salvà… on Mon, 11/16/2020 - 20:14

Dear all,

Any idea or suggestion for modifying some catalan FreeLing language DAT file(s) in order to get the proper tokenization and labeling for the personal articles before a proper a name in Catalan?

For instance: "en Pere" should be labeled "en DA0MS | Pere NP00SP0", not as a preposicional phrase ("en SP | Pere NP00SP0"). And "na Maria" should be tokenized in two words and tagged such as "na DA0FS | Maria_NP, not as "na_Maria_NP00SP0" (with a "_" merging the article and the proper name).

Demo is not working

Submitted by Sara_Saad on Tue, 07/24/2018 - 15:12

Hi All,

Please I need your kind assistance as the online demo is not working. I have about 5 days trying and I always have the below error message:

Fatal error: Uncaught Exception: String could not be parsed as XML in /home/operador/public_html/freeling-8.5.0/demo/demo.php:265 Stack trace: #0 /home/operador/public_html/freeling-8.5.0/demo/demo.php(265): SimpleXMLElement->__construct('') #1 {main} thrown in /home/operador/public_html/freeling-8.5.0/demo/demo.php on line 265

would you please advise?

Thanks you!

Example 07: Extracting Triples with Semantic Information

Submitted by kashyap on Mon, 07/09/2018 - 14:51

As a part of my current task, I am trying to feed(/extracted) knowledge(/triples) from Freeling to the Ontology. I did go through Freeling tutorials and I did try to run the example 7(…) but when I run the programme it does gives below error:

Different behavior with Catalan numbers

Submitted by jmfc90 on Fri, 02/10/2017 - 13:03

I am having some troubles working with numbers in Catalan. In your demo page I use the text "tres mil" with Catalan language and using the default configuration and I get the following output:


But If I use the text "nou mil" with exactly the same configuration I get the output:

nou mil
nou 1000