Abstract | ||
---|---|---|
Named entity (NE) translation is a fundamental task in multilingual natural language processing. The performance of a machine translation system depends heavily on precise translation of the inclusive NEs. Furthermore, organization name (ON) is the most complex NE for translation among all the NEs. In this article, the structure formulation of ONs is investigated and a hierarchical structure-based ON translation model for Chinese-to-English translation system is presented. First, the model performs ON chunking; then both the translation of words within chunks and the process of chunk-reordering are achieved by synchronous context-free grammar (CFG). The CFG rules are extracted from bilingual ON pairs in a training program. The main contributions of this article are: (1) defining appropriate chunk-units for analyzing the internal structure of Chinese ONs; (2) making the chunk-based ON translation feasible and flexible via a hierarchical CFG derivation; and (3) proposing a training architecture to automatically learn the synchronous CFG for constructing ONs with chunk-units from aligned bilingual ON pairs. The experiments show that the proposed approach translates the Chinese ONs into English with an accuracy of 93.75% and significantly improves the performance of a baseline statistical machine translation (SMT) system. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1145/1330291.1330292 | ACM Trans. Asian Lang. Inf. Process. |
Keywords | Field | DocType |
rules extraction,alignment,chunk,hierarchical cfg derivation,chinese organization name translation,synchronous cfg,chinese ons,translation model,cfg rule,machine translation system,additional key words and phrases: machine translation,chinese-to-english translation system,structure-based model,hierarchical derivation,named entity,appropriate chunk-units,synchronous context-free grammar,organization name,baseline statistical machine translation,struc- tural analysis,precise translation,structural analysis,machine translation,natural language processing,context free grammar | Rule-based machine translation,Example-based machine translation,Evaluation of machine translation,Computer science,Machine translation,Synchronous context-free grammar,Machine translation software usability,Natural language processing,Transfer-based machine translation,Artificial intelligence,Computer-assisted translation | Journal |
Volume | Issue | Citations |
7 | 1 | 9 |
PageRank | References | Authors |
0.52 | 32 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yufeng Chen | 1 | 38 | 16.55 |
Chengqing Zong | 2 | 1004 | 102.38 |