Title | ||
---|---|---|
UdS-(retrain|distributional|surface): Improving POS Tagging for OOV Words in German CMC and Web Data. |
Abstract | ||
---|---|---|
We present in this paper our three system submissions for the POS tagging subtask of the Empirist Shared Task: Our baseline systemUdS-retrain extends a standard training dataset with in-domain training data; UdSdistributional and UdS-surface add two different ways of handling OOV words on top of the baseline system by using either distributional information or a combination of surface similarity and language model information. We reach the best performance using the distributional model. |
Year | Venue | Field |
---|---|---|
2016 | WAC@ACL | Training set,Information retrieval,Computer science,Artificial intelligence,Natural language processing,Baseline system,Language model,German |
DocType | Citations | PageRank |
Conference | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jakob Prange | 1 | 0 | 0.34 |
Andrea Horbach | 2 | 22 | 7.23 |
Stefan Thater | 3 | 756 | 38.54 |