Title
Improved Quality Estimation of Machine Translation with Pre-trained Language Representation.
Abstract
Translation quality estimation (QE) is a task of estimating the quality of translation output from an unknown machine translation (MT) system without reference at various granularity (sentence/word/phrase) levels, and it has been attracting much attention due to the potential to reduce post-editing human effort. However, QE suffers heavily from the fact that the quality annotation data remain expensive and small. In this paper, we focus on the limited QE data problem and seek to find how to utilize the high level latent features learned by the pre-trained language models for improving QE. Specifically, we explore three strategies to integrate the pre-trained language representations into QE models: (1) a mixed integration model, where the pre-trained language features are mixed with other features for QE; (2) a direct integration model, which regards the pre-trained language model as the only feature extracting component of the entire QE model; and (3) a constrained integration model, where a constraint mechanism is added to optimize the quality prediction based on the direct integration model. Experiments and analysis presented in this paper demonstrate the effectiveness of our approaches on QE task.
Year
DOI
Venue
2019
10.1007/978-3-030-32233-5_32
Lecture Notes in Artificial Intelligence
Keywords
DocType
Volume
Quality estimation,Machine translation,Pre-trained language model
Conference
11838
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
Guoyi Miao101.69
Hui Di211.71
Jinan Xu300.34
Zhongcheng Yang400.34
Yufeng Chen53816.55
Kazushige Ouchi69513.52