Title | ||
---|---|---|
A bilingual word alignment algorithm of Vietnamese-Chinese based on feature constraint. |
Abstract | ||
---|---|---|
It is difficult to achieve auto-alignment between Vietnamese and Chinese, because their syntax and structure are quite different. In this case we present a novel method for the Vietnamese-Chinese word alignment which merges a variety of feature constraint models. In this article, an improved model based on the Vietnamese-Chinese progressive structure and offset features of word sequence is described. From this model which is trained by a log-linear model framework, and with parameters trained by the minimum error rate algorithm, the result of the Vietnamese-Chinese auto-alignment is obtained. The basic model of the experiments is IBM Model 3, and as experimental results suggest, this bilingual word alignment method for Vietnamese and Chinese performs well and precision, recall rates are increased by 28.57 and 25.02 %, AER is reduced by 14.25 %. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1007/s13042-014-0293-6 | Int. J. Machine Learning & Cybernetics |
Keywords | Field | DocType |
Vietnamese, Chinese, Word Alignment, Log-Linear Model | IBM,Computer science,Word error rate,Algorithm,Speech recognition,Artificial intelligence,Natural language processing,Vietnamese,Log-linear model,Recall,Syntax,Offset (computer science) | Journal |
Volume | Issue | ISSN |
6 | 4 | 1868-808X |
Citations | PageRank | References |
0 | 0.34 | 17 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yuanyuan Mo | 1 | 0 | 0.34 |
Jianyi Guo | 2 | 20 | 10.99 |
Zhengtao Yu | 3 | 460 | 69.08 |
Lin Luo | 4 | 0 | 0.34 |
Shengxiang Gao | 5 | 5 | 5.17 |