Title | ||
---|---|---|
A Classification-based Algorithm for Consistency Check of Part-of-Speech Tagging for Chinese Corpora |
Abstract | ||
---|---|---|
Ensuring consistency of Part-of-Speech (POS) tagging plays an important role in constructing high-quality Chinese corpora. After analyzing the POS tag- ging of multi-category words in large- scale corpora, we propose a novel con- sistency check method of POS tagging in this paper. Our method builds a vector model of the context of multi- category words, and uses the -NN al- gorithm to classify context vectors con- structed from POS tagging sequences and judge their consistency. The ex- perimental results indicate that the pro- posed method is feasible and effective. |
Year | Venue | DocType |
---|---|---|
2005 | IJCNLP (companion) | Conference |
Citations | PageRank | References |
0 | 0.34 | 1 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hu Zhang | 1 | 8 | 6.31 |
Jia-heng Zheng | 2 | 9 | 4.17 |
Ying Zhao | 3 | 902 | 49.19 |