Title
A Phrase Table Filtering Model Based On Binary Classification For Uyghur-Chinese Machine Translation
Abstract
In statistical machine translation, large amount of unreasonable phrase pairs in a phrase table can affect the decoding efficiency and the overall translation performance, especially in Uyghur-Chinese machine translation. In this paper, we present a novel phrase table filtering model based on binary classification, which consider differences between Uyghur and Chinese, and draw lessons from binary classification in machine learning. In our model, four features are considered: 1) Difference in length between source and target phrase; 2) Proportion of translated words in phrase pairs; 3) Proportion of symbol words; 4) Average number of co-occurrence words in training corpus. We use this model to generate a filtered phrase table. Experimental results show that this new filtering model can improve the performance and efficiency of our current Uygur-Chinese machine translation system.
Year
DOI
Venue
2014
10.4304/jcp.9.12.2780-2786
JOURNAL OF COMPUTERS
Keywords
Field
DocType
Uyghur-Chinese machine translation, Phrase table filtering, Binary classification
Rule-based machine translation,Example-based machine translation,Phrase search,Binary classification,Computer science,Machine translation,Phrase,Natural language processing,Artificial intelligence,Pattern recognition,Filter (signal processing),Speech recognition,Decoding methods
Journal
Volume
Issue
ISSN
9
12
1796-203X
Citations 
PageRank 
References 
0
0.34
0
Authors
6
Name
Order
Citations
PageRank
Chenggang Mi104.39
Yating Yang215.14
Xi Zhou326.83
Lei Wang466.89
Xiao Li5177.86
Eziz Tursun610.75