Title
Phrase-based data selection for language model adaptation in spoken language translation
Abstract
In this paper, we propose an unsupervised phrase-based data selection model, address the problem of selecting no-domain-specific language model (LM) training data to build adapted LM for use. In spoken language translation (SLT) system, we aim at finding the LM training sentences which are similar to the translation task. Compared with the traditional bag-of-words models, the phrase-based data selection model is more effective because it captures contextual information in modeling the selection of phrase as a whole, rather than selection of single words in isolation. Large-scale experimental results demonstrate that our approach significantly outperforms the state-of-the-art approaches on both LM perplexity and translation performance, respectively.
Year
DOI
Venue
2012
10.1109/ISCSLP.2012.6423483
ISCSLP
Keywords
Field
DocType
speech processing,no-domain-specific language model,unsupervised phrase-based data selection model,spoken language translation system,contextual information,phrase-based data selection,language model adaptation,language translation,lm training sentence,spoken language translation,slt system
Speech processing,Perplexity,Cache language model,Language translation,Computer science,Machine translation,Phrase,Speech recognition,Natural language processing,Universal Networking Language,Artificial intelligence,Language model
Conference
Volume
Issue
ISBN
null
null
978-1-4673-2505-9
Citations 
PageRank 
References 
1
0.35
13
Authors
5
Name
Order
Citations
PageRank
Shixiang Lu1193.39
Wei Wei22220.02
Xiaoyin Fu3102.53
Lichun Fan431.07
Bo Xu524136.59