Title
Using multiple edit distances to automatically rank machine translation output.
Abstract
This paper addresses the challenging problem of automatically evaluating output from machine translation (MT) systems in order to support the developers of these systems. Conventional approaches to the problem include methods that automatically assign a rank such as A, B, C, or D to MT output according to a single edit distance between this output and a correct translation example. The single edit distance can be differently designed, but changing its design makes assigning a certain rank more accurate, but another rank less accurate. This inhibits improving accuracy of rank assignment. To overcome this obstacle, this paper proposes an automatic ranking method that, by using multiple edit distances, encodes machine-translated sentences with a rank assigned by humans into multi-dimensional vectors from which a classifier of ranks is learned in the form of a decision tree (DT). The proposed method assigns a rank to MT output through the learned DT. The proposed method is evaluated using transcribed texts of real conversations in the travel arrangement domain. Experimental results show that the proposed method is more accurate than the single-edit-distance-based ranking methods, in both closed and open tests. Moreover, the proposed method could estimate MT quality within 3% error in some cases.
Year
Venue
Keywords
2001
MTSummit
multiple edit distances,automatic evaluation,machine learning,machine translation system,decision trees,edit distance,machine translation,decision tree
DocType
Citations 
PageRank 
Conference
33
5.66
References 
Authors
9
3
Name
Order
Citations
PageRank
Yasuhiro Akiba114324.43
Kenji Imamura2335.66
Eiichiro SUMITA31466190.87