Linguistic Data

Description

Questions and answers related to FreeLing linguistic data

Version of sense dictionary for Spanish?

Submitted by ulrike.henny on Tue, 12/14/2021 - 15:41

Hello,
I have been using FreeLing for some time now to annotate literary texts in Spanish. Thank you very much for this great and very useful tool!
Today I have a question about the version of the sense dictionary for Spanish which is used in FreeLing. In the documentation I found the information that the sense dictionary from the MRC 3.0 project is used. However, I noticed that some words that I would have expected to be annotated with sense information are not.

Verbs suddenly missing in Freeling dictionary (PT)?

Submitted by pbholmen on Wed, 12/09/2020 - 19:39

Hi

I'm using the Portuguese Freeling dictionary data file for a tool for personal use. I just upgraded Freeling using Homebrew on macOS, the current version is stated as 4.2_1

A very common verb, "resolver" is suddenly missing. I looked in the GitHub repo, and see that it is also missing in FreeLing/data/pt/dictionary/entries/verbs.

Error in the Portuguese dictionary?

Submitted by pbholmen on Wed, 12/09/2020 - 13:04

Hi

I think there might be an error in the dictionary found under:
share/freeling/pt/dicc.src

Line 232708 reads:

dar dar VMN0000 dar VMN01S0 dar VMN03S0 dar VMSF1S0 dar VMSF3S0

I believe this means that "dar" is supposed to be the future subjunctive of the verb "dar" for third and first person singular. This would be the case if the verb was regular, however it is irregular. Here is an authoritative source for the correct conjugation:

Lemas file

Submitted by Sebastià Salvà… on Mon, 11/16/2020 - 20:21

Dear all,

Do you know how to get, from a dicc.dat/dicc.txt file, a file with all the lemas and all the tokens forms, in oder to load it in a concordance program such as AntConc?

casa -> casa, cases
menjar -> menjo, menjava, menjaré, mengin...

Could yo recommend me some kind of instruction, such as awk...?

Personal articles in Catalan: "En Pere" & "na Maria"

Submitted by Sebastià Salvà… on Mon, 11/16/2020 - 20:14

Dear all,

Any idea or suggestion for modifying some catalan FreeLing language DAT file(s) in order to get the proper tokenization and labeling for the personal articles before a proper a name in Catalan?

For instance: "en Pere" should be labeled "en DA0MS | Pere NP00SP0", not as a preposicional phrase ("en SP | Pere NP00SP0"). And "na Maria" should be tokenized in two words and tagged such as "na DA0FS | Maria_NP, not as "na_Maria_NP00SP0" (with a "_" merging the article and the proper name).

Demo is not working

Submitted by Sara_Saad on Tue, 07/24/2018 - 15:12

Hi All,

Please I need your kind assistance as the online demo is not working. I have about 5 days trying and I always have the below error message:

Fatal error: Uncaught Exception: String could not be parsed as XML in /home/operador/public_html/freeling-8.5.0/demo/demo.php:265 Stack trace: #0 /home/operador/public_html/freeling-8.5.0/demo/demo.php(265): SimpleXMLElement->__construct('') #1 {main} thrown in /home/operador/public_html/freeling-8.5.0/demo/demo.php on line 265

would you please advise?

Thanks you!
Sara

Example 07: Extracting Triples with Semantic Information

Submitted by kashyap on Mon, 07/09/2018 - 14:51

As a part of my current task, I am trying to feed(/extracted) knowledge(/triples) from Freeling to the Ontology. I did go through Freeling tutorials and I did try to run the example 7(https://talp-upc.gitbooks.io/freeling-tutorial/content/code/example07.p…) but when I run the programme it does gives below error:

Different behavior with Catalan numbers

Submitted by jmfc90 on Fri, 02/10/2017 - 13:03

Hello.
I am having some troubles working with numbers in Catalan. In your demo page I use the text "tres mil" with Catalan language and using the default configuration and I get the following output:

tres_mil
3000
Z

But If I use the text "nou mil" with exactly the same configuration I get the output:

nou mil
nou 1000
AQ0MS00 Z

incorrect language on Semantic Graph Frame lemma

Submitted by carlesg on Fri, 07/29/2016 - 14:37

Hello,

When I use the SemanticGraph to ask for the lemmas of the Frames on the Graph, I get the lemma in English, although I configure Freeling in Spanish, and I ask a question in Spanish.
If I try the Spanish sentence 'Dime el valor del coche.', the tagger says:

-------- TAGGER results -----------
Di decir VMM02S0
me me PP1CS00
el el DA0MS0
valor valor NCMS000
de de SP
el el DA0MS0
coche coche NCMS000
. . Fp