Title
Support vector machine based orthographic disambiguation.
Abstract
Orthographic variation can be a serious problem for many nat- ural language-processing applica- tions. Japanese in particular con- tains orthographic variation, be- cause the large quantity of translit- eration from other languages causes many possible spelling variations. To manage this problem, this pa- per proposes a support vector ma- chine (SVM)-based classifier that can determine whether two terms are equivalent. We automatically collected both positive examples (sets of equivalent term pairs) and negative examples (sets of inequiv- alent term pairs). Experimental re- sults yielded high levels of accuracy (87.8%), demonstrating the feasibil- ity of the proposed approach.
Year
Venue
DocType
2007
TMI
Conference
Citations 
PageRank 
References 
1
0.35
6
Authors
4
Name
Order
Citations
PageRank
Eiji Aramaki137145.89
Takeshi Imai2203.73
Kengo Miyo3245.65
Kazuhiko Ohe411515.91