Hello. I'm using freeling in python to analyze a set of newspaper articles in Spanish. I've updated the location gazetteers with a bunch of locations and I've removed all locations from other gazetteers (i.e. organization) because it would return some location as ORG and all I need are locations. I'm analyzing my text with tokenizer, splitter, ner-ab-rich.dat NER and finally nec-ab-rich.dat NEC (all with the default configuration, I haven't changed them). I'm having a problem with the two following sentences:
1- Título: VIRUS ZIKA - BRASIL: (SP) PRIMER CASO AUTOCTONO.
2- Título: ZIKA - BRASIL: (03) MICROCEFALIA, MUERTES, ACTUALIZACIÓN.
When feeling analyzes the first sentence, it returns the tag NP00G00 for brasil (being correctly a location) but for the second sentence it returns the tag NCMS000 for brasil.
So my question is why this could happen as both runs with the same configuration and are similar, but returns different results. And another question is if there's a way that I give higher priority to location gazetteers so I don't have to remove locations from ORG gazetteers or if it's right that I remove them.
Let me know if there's anything more that I can tell you about my configuration.