Title
Syntax-Based Context Representation For Statistical Machine Translation
Abstract
Learning semantic representation for translation context is beneficial to statistical machine translation (SMT). Previous efforts have focused on implicitly encoding syntactic and semantic knowledge in translation context by neural networks, which are weak in capturing explicit structural syntax information. In this paper, we propose a new neural network with a tree-based convolutional architecture to explicitly learn structural syntax information in translation context, thus improving translation prediction. Specifically, we first convert parallel sentences with source parse trees into syntax-based linear sequences based on a minimum syntax subtree algorithm, and then define a tree-based convolutional network over the linear sequences to learn syntax-based context representation and translation prediction jointly. To verify the effectiveness, the proposed model is integrated into phrase-based SMT. Experiments on large-scale Chinese-to-English and German-to-English translation tasks show that the proposed approach can achieve a substantial and significant improvement over several baseline systems.
Year
DOI
Venue
2018
10.1587/transinf.2018EDP7209
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Keywords
Field
DocType
syntax context representation, tree-based neural network, translation prediction, statistical machine translation
Pattern recognition,Computer science,Machine translation,Natural language processing,Artificial intelligence,Syntax
Journal
Volume
Issue
ISSN
E101D
12
1745-1361
Citations 
PageRank 
References 
0
0.34
0
Authors
3
Name
Order
Citations
PageRank
Kehai Chen14316.34
Tiejun Zhao2643102.68
Yang Muyun311229.50