Features

FreeLing is designed to be used as an external library from any application requiring this kind of services.
Nevertheless, a simple main program is also provided as a basic interface to the library, which enables the user to analyze text files from the command line.

Main services offered by FreeLing library:

  • Text tokenization
  • Sentence splitting
  • Morphological analysis
  • Suffix treatment, retokenization of clitic pronouns
  • Compound-word recognition
  • Flexible multiword recognition
  • Contraction splitting
  • Probabilistic prediction of unkown word categories
  • Phonetic encoding
  • SED-based search for similar words in dictionary
  • Named entity detection
  • Recognition of dates, numbers, ratios, currency, and physical magnitudes (speed, weight, temperature, density, etc.)
  • PoS tagging
  • Chart-based shallow parsing
  • Named entity classification
  • WordNet-based sense annotation and disambiguation
  • Rule-based dependency parsing
  • Statistical dependency parsing
  • Statistical semantic role labelling
  • Coreference resolution
  • Semantic graph extraction

Currently supported languages are English, Spanish, Portuguese, French, Italian, German, Russian, Norwegian, Catalan, Galician, Croatian, Slovene, Asturian, and Welsh.
See the user manual for more information about which services are available for each language.