UTF encoding
It looks like text is stored in std::wstring's in the FreeLing library. What is the Unicode encoding of the text, e.g., UTF-8, UTF-16?
Thanks.
- Read more about UTF encoding
- 2 comments
- Log in or register to post comments
Questions and aswers related to FreeLing usage
It looks like text is stored in std::wstring's in the FreeLing library. What is the Unicode encoding of the text, e.g., UTF-8, UTF-16?
Thanks.
Hi, I'm using the Freeling 4.1 binaries in Windows 10 with the SDK installed in its latest version.
The only thing that my .txt contains are the words "This is a demostration.".
By running ... analyze.bat -f en.cfg --outlv coref <demostration.txt,
apart from the message about NEC and UKB, I get two assertion failures (Debug assertion failed!, Windows says nothing about them being runtime exceptions):
Buenos días,
I'm using Freeling to parse certain languages with java API. (As a test to compare to other POS-taggers.) I don't have final results for my tests yet, but Freeling is a great library for sure :)
Hi, I'm a newbie in freeling and in c++.
I want to understand how the NLP is been doing.
I looking for some simple examples that call Freeling to generate the output of the shallow and full parsing output of the demo web page.
Buenos días, al querer obtener el árbol de dependencias nos surge el siguiente problema,
tenemos diferencias entre la demo Online y prográmaticamente en la frase "La tormenta azotó Montevideo y el huracán destruyó Miami". El resultado correcto es el que da la demo online. Nosotros estamos usando "dep_treeler" porque cuando usabamos "grammar-chunk" y "dep_txala" el árbol de dependecias que obteníamos como resultado era el "full parsing".
El conll nuestro es:
1 La el DA0FS0 - - - - - 2 spec - -
Hello,
I am using Freeling to process English sentences, more particularly using hmm_tagger.
At the beginning I add a lot of errors "TAGSET: No rule to get short version of tag [tag]" where tag was of the form "tag1+tag2" (e.g. : 'DT+MD', 'VB+PRP'... It mainly corresponds to all the English abbreviation I'll, We'd, you're...).
Even if they don't make the program crash, those errors was just overflowing the terminal, slowing the program, and it seems that those PoS tags was not assigned to corresponding words.
Hello. I have just started using Freeling, and I am using it to obtain the lemma form (get_lemma() ) and saving it on a string of some Spanish reviews I get from Google Maps API. Freeling works well with sentences that have full stop at the end (for example, “Buen lugar, comodo y agradable.), but it does not when the review doesn’t have full stop (for example, Buen lugar. Trato amigable). In that case, Freeling won’t return the lemma form of each one of the words in the sentence, so the string remains empty.
I'm using the analyzer.bat to analyze literary texts and I would like to be able to distinguish dialogue from narration. To this end, I would need to know when paragraphs start or end. The thing is, the standard output from analyzer.bat puts an empty line between each sentence, but doesn't mark paragraphs at all from what I can understand.
Does the analyzer.bat actually mark paragraphs? If not, is there a way to change this without coding my own main program using the libraries?
Thank you in advance.
I am trying to use FreeLing with a C++ project in Visual Studio 2017. After going through a lot of pain of linking the library to my code, when I try to use it, i get errors (code C2589) in io-sentence.h and io-dep.h saying '(': illegal token on right side of '::'.