Title
Dynamic translation memory: using statistical machine translation to improve translation memory fuzzy matches
Abstract
Professional translators of technical documents often use Translation Memory (TM) systems in order to capitalize on the repetitions frequently observed in these documents. TM systems typically exploit not only complete matches between the source sentence to be translated and some previously translated sentence, but also so-called fuzzy matches, where the source sentence has some substantial commonality with a previously translated sentence. These fuzzy matches can be very worthwhile as a starting point for the human translator, but the translator then needs to manually edit the associated TM-based translation to accommodate the differences with the source sentence to be translated. If part of this process could be automated, the cost of human translation could be significantly reduced. The paper proposes to perform this automation in the following way: a phrase-based Statistical Machine Translation (SMT) system (trained on a bilingual corpus in the same domain as the TM) is combined with the TM fuzzy match, by extracting from the fuzzy-match a large (possibly gapped) bi-phrase that is dynamically added to the usual set of "static" bi-phrases used for decoding the source. We report experiments that show significant improvements in terms of BLEU and NIST scores over both the translations produced by the stand-alone SMT system and the fuzzy-match translations proposed by the stand-alone TM system.
Year
Venue
Keywords
2008
CICLing
translation memory fuzzy match,fuzzy match,dynamic translation memory,fuzzy-match translation,tm system,stand-alone smt system,so-called fuzzy match,statistical machine translation,stand-alone tm system,translation memory,tm fuzzy match,tm-based translation,source sentence
DocType
Volume
ISSN
Conference
4919
0302-9743
ISBN
Citations 
PageRank 
3-540-78134-X
20
1.33
References 
Authors
11
2
Name
Order
Citations
PageRank
Ergun Biçici113313.23
Marc Dymetman227538.86