Title
Semantically Smooth Bilingual Phrase Embeddings Based on Recursive Autoencoders
Abstract
In this paper, we propose Semantically Smooth Bilingual Recursive Autoencoders to learn bilingual phrase embeddings. The intuition behind our work is to exploit the intrinsic geometric structure of the embedding space and enforce the learned phrase embeddings to be semantically smooth. Specifically, we extend the conventional bilingual recursive autoencoders by preserving the translation and paraphrase probability distributions via regularization terms to simultaneously exploit richer explicit and implicit similarity constraints for bilingual phrase embeddings. To examine the effectiveness of our model, we incorporate two phrase-level similarity features based on the proposed model into a state-of-the-art phrase-based statistical machine translation system. Experiments on NIST Chinese–English test sets show that our model achieves substantial improvements over the baseline.
Year
DOI
Venue
2020
10.1007/s11063-020-10210-1
Neural Processing Letters
Keywords
DocType
Volume
Bilingual phrase embeddings, Similarity constraints, Machine translation
Journal
51
Issue
ISSN
Citations 
3
1370-4621
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Qian Lin19611.17
Jing Yang215858.81
Xiangwen Zhang3173.51
Hongji Wang491.12
Yaojie Lu500.68
Jinsong Su626041.51