Perceptron Learning for Chinese Word Segmentation - Citegraph

Paper Info

Title
Perceptron Learning for Chinese Word Segmentation

Abstract
We explored a simple, fast and effective learning algorithm, the uneven margins Perceptron, for Chinese word segmen- tation. We adopted the character-based classification framework and trans- formed the task into several binary clas- sification problems. We participated the close and open tests for all the four corpora. For the open test we only used the utf-8 code knowledge for discrimi- nation among Latin characters, Arabic numbers and all other characters. Our system performed well on the as, cityu and msr corpora but was clearly worse than the best result on the pku corpus.

Year	Venue	DocType
2005	SIGHAN@IJCNLP 2005	Conference
Citations	PageRank	References
2	0.57	5
Authors
4

Authors (4 rows)

Cited by (2 rows)

References (5 rows)

Name	Order	Citations	PageRank
Yaoyong Li	1	393	26.55
Chuanjiang Miao	2	3	0.94
Kalina Bontcheva	3	2538	211.33
Hamish Cunningham	4	2426	255.41

1