Using multiple edit distances to automatically rank machine translation output. - Citegraph

Paper Info

Title
Using multiple edit distances to automatically rank machine translation output.

Abstract
This paper addresses the challenging problem of automatically evaluating output from machine translation (MT) systems in order to support the developers of these systems. Conventional approaches to the problem include methods that automatically assign a rank such as A, B, C, or D to MT output according to a single edit distance between this output and a correct translation example. The single edit distance can be differently designed, but changing its design makes assigning a certain rank more accurate, but another rank less accurate. This inhibits improving accuracy of rank assignment. To overcome this obstacle, this paper proposes an automatic ranking method that, by using multiple edit distances, encodes machine-translated sentences with a rank assigned by humans into multi-dimensional vectors from which a classifier of ranks is learned in the form of a decision tree (DT). The proposed method assigns a rank to MT output through the learned DT. The proposed method is evaluated using transcribed texts of real conversations in the travel arrangement domain. Experimental results show that the proposed method is more accurate than the single-edit-distance-based ranking methods, in both closed and open tests. Moreover, the proposed method could estimate MT quality within 3% error in some cases.

Year	Venue	Keywords
2001	MTSummit	multiple edit distances,automatic evaluation,machine learning,machine translation system,decision trees,edit distance,machine translation,decision tree
DocType	Citations	PageRank
Conference	33	5.66
References	Authors
9	3

Authors (3 rows)

Cited by (33 rows)

References (9 rows)

Name	Order	Citations	PageRank
Yasuhiro Akiba	1	143	24.43
Kenji Imamura	2	33	5.66
Eiichiro SUMITA	3	1466	190.87

1