Linguistic Data

Description

Questions and answers related to FreeLing linguistic data

Verbs suddenly missing in Freeling dictionary (PT)?

Submitted by pbholmen on Wed, 12/09/2020 - 19:39

Hi

I'm using the Portuguese Freeling dictionary data file for a tool for personal use. I just upgraded Freeling using Homebrew on macOS, the current version is stated as 4.2_1

A very common verb, "resolver" is suddenly missing. I looked in the GitHub repo, and see that it is also missing in FreeLing/data/pt/dictionary/entries/verbs.

Error in the Portuguese dictionary?

Submitted by pbholmen on Wed, 12/09/2020 - 13:04

Hi

I think there might be an error in the dictionary found under:
share/freeling/pt/dicc.src

Line 232708 reads:

dar dar VMN0000 dar VMN01S0 dar VMN03S0 dar VMSF1S0 dar VMSF3S0

I believe this means that "dar" is supposed to be the future subjunctive of the verb "dar" for third and first person singular. This would be the case if the verb was regular, however it is irregular. Here is an authoritative source for the correct conjugation:

Lemas file

Submitted by Sebastià Salvà… on Mon, 11/16/2020 - 20:21

Dear all,

Do you know how to get, from a dicc.dat/dicc.txt file, a file with all the lemas and all the tokens forms, in oder to load it in a concordance program such as AntConc?

casa -> casa, cases
menjar -> menjo, menjava, menjaré, mengin...

Could yo recommend me some kind of instruction, such as awk...?

Personal articles in Catalan: "En Pere" & "na Maria"

Submitted by Sebastià Salvà… on Mon, 11/16/2020 - 20:14

Dear all,

Any idea or suggestion for modifying some catalan FreeLing language DAT file(s) in order to get the proper tokenization and labeling for the personal articles before a proper a name in Catalan?

For instance: "en Pere" should be labeled "en DA0MS | Pere NP00SP0", not as a preposicional phrase ("en SP | Pere NP00SP0"). And "na Maria" should be tokenized in two words and tagged such as "na DA0FS | Maria_NP, not as "na_Maria_NP00SP0" (with a "_" merging the article and the proper name).

Demo is not working

Submitted by Sara_Saad on Tue, 07/24/2018 - 15:12

Hi All,

Please I need your kind assistance as the online demo is not working. I have about 5 days trying and I always have the below error message:

Fatal error: Uncaught Exception: String could not be parsed as XML in /home/operador/public_html/freeling-8.5.0/demo/demo.php:265 Stack trace: #0 /home/operador/public_html/freeling-8.5.0/demo/demo.php(265): SimpleXMLElement->__construct('') #1 {main} thrown in /home/operador/public_html/freeling-8.5.0/demo/demo.php on line 265

would you please advise?

Thanks you!
Sara

Example 07: Extracting Triples with Semantic Information

Submitted by kashyap on Mon, 07/09/2018 - 14:51

As a part of my current task, I am trying to feed(/extracted) knowledge(/triples) from Freeling to the Ontology. I did go through Freeling tutorials and I did try to run the example 7(https://talp-upc.gitbooks.io/freeling-tutorial/content/code/example07.p…) but when I run the programme it does gives below error:

Different behavior with Catalan numbers

Submitted by jmfc90 on Fri, 02/10/2017 - 13:03

Hello.
I am having some troubles working with numbers in Catalan. In your demo page I use the text "tres mil" with Catalan language and using the default configuration and I get the following output:

tres_mil
3000
Z

But If I use the text "nou mil" with exactly the same configuration I get the output:

nou mil
nou 1000
AQ0MS00 Z

Verb at the beginning of a sentence in spanish

Submitted by samuelhrg on Tue, 01/10/2017 - 18:23

I found a small problem with the PoS tagging. In the sentence "Esperé a que volvieras" Freeling tags "Esperé" as a noun. If instead I use "Yo esperé a que volvieras" then "Yo" is tagged as the noun and "esperé" as the verb.
I can guess where the problem comes from. In spanish you don't need to specify the noun at the beginning of the sentence because the verb gives you this information. "Esperé" means "I waited", there's no need to write "Yo esperé".