Abstract | ||
---|---|---|
We present Entropy Guided Transformation Learning models for three Portuguese Language Processing tasks: Part-of-Speech Tagging,
Noun Phrase Chunking and Named Entity Recognition. For Part-of-Speech Tagging, we separately use the Mac-Morpho Corpus and
the Tycho Brahe Corpus. For Noun Phrase Chunking, we use the SNR-CLIC Corpus. For Named Entity Recognition, we separately
use three corpora: HAREM, MiniHAREM and LearnNEC06.
For each one of the tasks, the ETL modeling phase is quick and simple. ETL only requires the training set and no handcrafted
templates. ETL also simplifies the incorporation of new input features, such as capitalization information, which are sucessfully
used in the ETL based systems. Using the ETL approach, we obtain state-of-the-art competitive performance in all six corpora-based
tasks. These results indicate that ETL is a suitable approach for the construction of Portuguese corpus-based systems. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1590/S0104-65002008000400003 | Journal of the Brazilian Computer Society |
Keywords | Field | DocType |
Entropy Guided Transformation Learning,transformation-based learning,decision trees natural,language processing | Noun phrase,Training set,Data structure,Computer science,Portuguese,Artificial intelligence,Natural language processing,Chunking (psychology),Learning models,Transformation based learning,Named-entity recognition | Journal |
Volume | Issue | ISSN |
14 | 4 | 1678-4804 |
Citations | PageRank | References |
4 | 0.48 | 19 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ruy Luiz Milidiú | 1 | 192 | 20.18 |
Cícero Nogueira dos Santos | 2 | 771 | 37.83 |
Julio C. Duarte | 3 | 26 | 2.46 |