CitiusTagger / CitiusNec is an open source software, written in Perl, to perform both PoS tagging and Named Entity Classification in the Portuguese, English, Galician, and Spanish languages. Since 2017, it is the PoS tagger module of the LinguaKit project. It has been developed at CITIUS by the ProLNat@GE group.
It makes use of the same tagset as FreeLing.
You can test it in our DEMO.
You can find a description of this tool in the following articles:
'Storable' Perl module (for Galician language). You may install the module with CPAN:
# cpan>install Storable
# tar xzvf CitiusTool.tar.gz
# cd CitiusTool
# sh install-citiustool.sh
# sh nec.sh
Syntax: nec.sh language file
language=pt, es, en, gl
file= path of the file input
The Spanish POS-tagger has been trained with the Ancora corpus. The current version of the lexicon contains the same forms as FreeLing.
.The European Portuguese FreeLing POS-tagger has been trained with the following linguistic resources:
The Galician POS-tagger has been trained with the Xiada corpus. The current version of the lexicon contains the same forms as FreeLing.
.