Title | ||
---|---|---|
A Phrase Table Filtering Model Based On Binary Classification For Uyghur-Chinese Machine Translation |
Abstract | ||
---|---|---|
In statistical machine translation, large amount of unreasonable phrase pairs in a phrase table can affect the decoding efficiency and the overall translation performance, especially in Uyghur-Chinese machine translation. In this paper, we present a novel phrase table filtering model based on binary classification, which consider differences between Uyghur and Chinese, and draw lessons from binary classification in machine learning. In our model, four features are considered: 1) Difference in length between source and target phrase; 2) Proportion of translated words in phrase pairs; 3) Proportion of symbol words; 4) Average number of co-occurrence words in training corpus. We use this model to generate a filtered phrase table. Experimental results show that this new filtering model can improve the performance and efficiency of our current Uygur-Chinese machine translation system. |
Year | DOI | Venue |
---|---|---|
2014 | 10.4304/jcp.9.12.2780-2786 | JOURNAL OF COMPUTERS |
Keywords | Field | DocType |
Uyghur-Chinese machine translation, Phrase table filtering, Binary classification | Rule-based machine translation,Example-based machine translation,Phrase search,Binary classification,Computer science,Machine translation,Phrase,Natural language processing,Artificial intelligence,Pattern recognition,Filter (signal processing),Speech recognition,Decoding methods | Journal |
Volume | Issue | ISSN |
9 | 12 | 1796-203X |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Chenggang Mi | 1 | 0 | 4.39 |
Yating Yang | 2 | 1 | 5.14 |
Xi Zhou | 3 | 2 | 6.83 |
Lei Wang | 4 | 6 | 6.89 |
Xiao Li | 5 | 17 | 7.86 |
Eziz Tursun | 6 | 1 | 0.75 |