Title
Translation model adaptation by resampling
Abstract
The translation model of statistical machine translation systems is trained on parallel data coming from various sources and domains. These corpora are usually concatenated, word alignments are calculated and phrases are extracted. This means that the corpora are not weighted according to their importance to the domain of the translation task. This is in contrast to the training of the language model for which well known techniques are used to weight the various sources of texts. On a smaller granularity, the automatic calculated word alignments differ in quality. This is usually not considered when extracting phrases either. In this paper we propose a method to automatically weight the different corpora and alignments. This is achieved with a resampling technique. We report experimental results for a small (IWSLT) and large (NIST) Arabic/English translation tasks. In both cases, significant improvements in the BLEU score were observed.
Year
Venue
Keywords
2010
WMT@ACL
translation task,different corpus,statistical machine translation system,language model,english translation task,translation model adaptation,translation model,various source,bleu score,word alignment,automatic calculated word alignment
Field
DocType
Citations 
Rule-based machine translation,Example-based machine translation,Computer science,Evaluation of machine translation,Machine translation,Speech recognition,NIST,Artificial intelligence,Concatenation,Natural language processing,Resampling,Language model
Conference
5
PageRank 
References 
Authors
0.48
12
3
Name
Order
Citations
PageRank
Kashif Shah110311.69
Loïc Barrault228422.91
Holger Schwenk32533228.83