Translation model adaptation by resampling - Citegraph

Paper Info

Title
Translation model adaptation by resampling

Abstract
The translation model of statistical machine translation systems is trained on parallel data coming from various sources and domains. These corpora are usually concatenated, word alignments are calculated and phrases are extracted. This means that the corpora are not weighted according to their importance to the domain of the translation task. This is in contrast to the training of the language model for which well known techniques are used to weight the various sources of texts. On a smaller granularity, the automatic calculated word alignments differ in quality. This is usually not considered when extracting phrases either. In this paper we propose a method to automatically weight the different corpora and alignments. This is achieved with a resampling technique. We report experimental results for a small (IWSLT) and large (NIST) Arabic/English translation tasks. In both cases, significant improvements in the BLEU score were observed.

Year	Venue	Keywords
2010	WMT@ACL	translation task,different corpus,statistical machine translation system,language model,english translation task,translation model adaptation,translation model,various source,bleu score,word alignment,automatic calculated word alignment
Field	DocType	Citations
Rule-based machine translation,Example-based machine translation,Computer science,Evaluation of machine translation,Machine translation,Speech recognition,NIST,Artificial intelligence,Concatenation,Natural language processing,Resampling,Language model	Conference	5
PageRank	References	Authors
0.48	12	3

Authors (3 rows)

Cited by (5 rows)

References (12 rows)

Name	Order	Citations	PageRank
Kashif Shah	1	103	11.69
Loïc Barrault	2	284	22.91
Holger Schwenk	3	2533	228.83

1