Title
Perceptron Learning for Chinese Word Segmentation
Abstract
We explored a simple, fast and effective learning algorithm, the uneven margins Perceptron, for Chinese word segmen- tation. We adopted the character-based classification framework and trans- formed the task into several binary clas- sification problems. We participated the close and open tests for all the four corpora. For the open test we only used the utf-8 code knowledge for discrimi- nation among Latin characters, Arabic numbers and all other characters. Our system performed well on the as, cityu and msr corpora but was clearly worse than the best result on the pku corpus.
Year
Venue
DocType
2005
SIGHAN@IJCNLP 2005
Conference
Citations 
PageRank 
References 
2
0.57
5
Authors
4
Name
Order
Citations
PageRank
Yaoyong Li139326.55
Chuanjiang Miao230.94
Kalina Bontcheva32538211.33
Hamish Cunningham42426255.41