Abstract | ||
---|---|---|
English is a typical SVO (Subject-Verb-Object) language, while Japanese is a typical SOV language. Conventional Statistical Machine Translation (SMT) systems work well within each of these language families. However, SMT-based translation from an SVO language to an SOV language does not work well because their word orders are completely different. Recently, a few groups have proposed rule-based preprocessing methods to mitigate this problem (Xu et al., 2009; Hong et al., 2009). These methods rewrite SVO sentences to derive more SOV-like sentences by using a set of handcrafted rules. In this paper, we propose an alternative single reordering rule: Head Finalization. This is a syntax-based preprocessing approach that offers the advantage of simplicity. We do not have to be concerned about part-of-speech tags or rule weights because the powerful Enju parser allows us to implement the rule at a general level. Our experiments show that its result, Head Final English (HFE), follows almost the same order as Japanese. We also show that this rule improves automatic evaluation scores. |
Year | Venue | Keywords |
---|---|---|
2010 | WMT@ACL | language family,simple reordering rule,typical svo,head final english,head finalization,handcrafted rule,alternative single reordering rule,rule weight,svo language,sov language,svo sentence,typical sov language,rule based,word order,machine translation |
Field | DocType | Citations |
Computer science,Machine translation,Preprocessor,Artificial intelligence,Finalization,Natural language processing,Parsing,Syntax | Conference | 41 |
PageRank | References | Authors |
1.25 | 16 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hideki Isozaki | 1 | 934 | 64.50 |
Katsuhito Sudoh | 2 | 326 | 34.44 |
Hajime Tsukada | 3 | 449 | 29.46 |
Kevin Duh | 4 | 819 | 72.94 |