Title
HPSG-Based Preprocessing for English-to-Japanese Translation
Abstract
Japanese sentences have completely different word orders from corresponding English sentences. Typical phrase-based statistical machine translation (SMT) systems such as Moses search for the best word permutation within a given distance limit (distortion limit). For English-to-Japanese translation, we need a large distance limit to obtain acceptable translations, and the number of translation candidates is extremely large. Therefore, SMT systems often fail to find acceptable translations within a limited time. To solve this problem, some researchers use rule-based preprocessing approaches, which reorder English words just like Japanese by using dozens of rules. Our idea is based on the following two observations: (1) Japanese is a typical head-final language, and (2) we can detect heads of English sentences by a head-driven phrase structure grammar (HPSG) parser. The main contributions of this article are twofold: First, we demonstrate how off-the-shelf, state-of-the-art HPSG parser enables us to write the reordering rules in an abstract level and can easily improve the quality of English-to-Japanese translation. Second, we also show that syntactic heads achieve better results than semantic heads. The proposed method outperforms the best system of NTCIR-7 PATMT EJ task.
Year
DOI
Venue
2012
10.1145/2334801.2334802
ACM Trans. Asian Lang. Inf. Process.
Keywords
Field
DocType
corresponding english sentence,english-to-japanese translation,translation candidate,japanese sentence,acceptable translation,distortion limit,large distance limit,english sentence,hpsg-based preprocessing,distance limit,english word,machine translation,english,japanese,hpsg
Head-driven phrase structure grammar,Rule-based machine translation,Computer science,Permutation,Machine translation,Phrase structure grammar,Phrase,Speech recognition,Artificial intelligence,Natural language processing,Parsing,Syntax
Journal
Volume
Issue
Citations 
11
3
8
PageRank 
References 
Authors
0.50
32
4
Name
Order
Citations
PageRank
Hideki Isozaki193464.50
Katsuhito Sudoh232634.44
Hajime Tsukada344929.46
Kevin Duh481972.94