Questions and aswers related to FreeLing usage

Diferencias entre resultado de Demo Online y programaticamente version 4.1

Submitted by lag.manuel on Mon, 09/24/2018 - 16:21

Buenos días, al querer obtener el árbol de dependencias nos surge el siguiente problema,
tenemos diferencias entre la demo Online y prográmaticamente en la frase "La tormenta azotó Montevideo y el huracán destruyó Miami". El resultado correcto es el que da la demo online. Nosotros estamos usando "dep_treeler" porque cuando usabamos "grammar-chunk" y "dep_txala" el árbol de dependecias que obteníamos como resultado era el "full parsing".
El conll nuestro es:

1 La el DA0FS0 - - - - - 2 spec - -

No rule to get short versions of chunk tags (English)

Submitted by PaulBreugnot on Wed, 08/08/2018 - 17:19

I am using Freeling to process English sentences, more particularly using hmm_tagger.
At the beginning I add a lot of errors "TAGSET: No rule to get short version of tag [tag]" where tag was of the form "tag1+tag2" (e.g. : 'DT+MD', 'VB+PRP'... It mainly corresponds to all the English abbreviation I'll, We'd, you're...).
Even if they don't make the program crash, those errors was just overflowing the terminal, slowing the program, and it seems that those PoS tags was not assigned to corresponding words.

Use Freeling with sentences without full stop at the end?

Submitted by nantesfer on Wed, 07/04/2018 - 17:44

Hello. I have just started using Freeling, and I am using it to obtain the lemma form (get_lemma() ) and saving it on a string of some Spanish reviews I get from Google Maps API. Freeling works well with sentences that have full stop at the end (for example, “Buen lugar, comodo y agradable.), but it does not when the review doesn’t have full stop (for example, Buen lugar. Trato amigable). In that case, Freeling won’t return the lemma form of each one of the words in the sentence, so the string remains empty.

Output formatting in Freeling 4.0's analyzer.bat

Submitted by jgregores on Tue, 05/15/2018 - 23:33

I'm using the analyzer.bat to analyze literary texts and I would like to be able to distinguish dialogue from narration. To this end, I would need to know when paragraphs start or end. The thing is, the standard output from analyzer.bat puts an empty line between each sentence, but doesn't mark paragraphs at all from what I can understand.

Does the analyzer.bat actually mark paragraphs? If not, is there a way to change this without coding my own main program using the libraries?

Thank you in advance.

How to find out if suffix rule was applied

Submitted by fcbr on Fri, 04/20/2018 - 14:52

I'm using Freeling 4.0 and I need to check if a particular word was found via the suffixes rules.

Using Python, I'm using the following code excerpt:

w = freeling.word(form)

It looks like the only way to find out if a word was "found" via suffix rules is when get_lemma() returns something and fund_in_dict() returns None.

Is that correct?

Integration of Freeling and UIMA

Submitted by JoanCodina on Fri, 04/13/2018 - 17:03

Inside the OPenMINTED project we have integrated the FreeLing parser within the UIMA framework
In github there is the code of the wrapper and a sample program to read/write XMI or txt files, analyze them with FreeLIng and produce as output XMI files with the corresponding annotations
Also there is a docker that already includes FreeLing and the wrapper. The docker processes all the files in an input folder and leaves the processed files in the output folder.
You can find them here, the github repository: