Abstract | ||
---|---|---|
Japanese sentences have completely different word orders from corresponding English sentences. Typical phrase-based statistical machine translation (SMT) systems such as Moses search for the best word permutation within a given distance limit (distortion limit). For English-to-Japanese translation, we need a large distance limit to obtain acceptable translations, and the number of translation candidates is extremely large. Therefore, SMT systems often fail to find acceptable translations within a limited time. To solve this problem, some researchers use rule-based preprocessing approaches, which reorder English words just like Japanese by using dozens of rules. Our idea is based on the following two observations: (1) Japanese is a typical head-final language, and (2) we can detect heads of English sentences by a head-driven phrase structure grammar (HPSG) parser. The main contributions of this article are twofold: First, we demonstrate how off-the-shelf, state-of-the-art HPSG parser enables us to write the reordering rules in an abstract level and can easily improve the quality of English-to-Japanese translation. Second, we also show that syntactic heads achieve better results than semantic heads. The proposed method outperforms the best system of NTCIR-7 PATMT EJ task. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1145/2334801.2334802 | ACM Trans. Asian Lang. Inf. Process. |
Keywords | Field | DocType |
corresponding english sentence,english-to-japanese translation,translation candidate,japanese sentence,acceptable translation,distortion limit,large distance limit,english sentence,hpsg-based preprocessing,distance limit,english word,machine translation,english,japanese,hpsg | Head-driven phrase structure grammar,Rule-based machine translation,Computer science,Permutation,Machine translation,Phrase structure grammar,Phrase,Speech recognition,Artificial intelligence,Natural language processing,Parsing,Syntax | Journal |
Volume | Issue | Citations |
11 | 3 | 8 |
PageRank | References | Authors |
0.50 | 32 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hideki Isozaki | 1 | 934 | 64.50 |
Katsuhito Sudoh | 2 | 326 | 34.44 |
Hajime Tsukada | 3 | 449 | 29.46 |
Kevin Duh | 4 | 819 | 72.94 |