Abstract | ||
---|---|---|
We explored a simple, fast and effective learning algorithm, the uneven margins Perceptron, for Chinese word segmen- tation. We adopted the character-based classification framework and trans- formed the task into several binary clas- sification problems. We participated the close and open tests for all the four corpora. For the open test we only used the utf-8 code knowledge for discrimi- nation among Latin characters, Arabic numbers and all other characters. Our system performed well on the as, cityu and msr corpora but was clearly worse than the best result on the pku corpus. |
Year | Venue | DocType |
---|---|---|
2005 | SIGHAN@IJCNLP 2005 | Conference |
Citations | PageRank | References |
2 | 0.57 | 5 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yaoyong Li | 1 | 393 | 26.55 |
Chuanjiang Miao | 2 | 3 | 0.94 |
Kalina Bontcheva | 3 | 2538 | 211.33 |
Hamish Cunningham | 4 | 2426 | 255.41 |