Title
Tunable discounting and visual exploration for language models.
Abstract
A language model is fundamental to many applications in natural language processing. Most language models are trained on a large amount of dataset and difficult to be adapted to other domains which may have only a small dataset available. Tuning discounting parameters for smoothing is one way to adapt language models for a new domain. In this work, we present novel language models based on tunable discounting mechanisms. The language models are trained on a large dataset, but their discounting parameters can be tuned to a target dataset afterwards. We explore tunable discounting and polynomial discounting functions based on the modified Kneser–Ney (mKN) models. Specifically, we propose the tunable mKN (TmKN) model, polymomial discounting mKN (PmKN) model, and tunable and polynomial discounting mKN (TPmKN) model. We test our proposed models and compared with the mKN model, improved KN model, and the tunable mKN with the interpolation model (mKN + interp). With the implementation, our language models achieve perplexity improvements in both in-domain and out-of-domain evaluation. Experimental results indicate that our new models significantly outperform the baseline model and our models are especially suitable for adapting to new domains. In addition, we use the visualization technique to depict the relationship between parameter settings and the language model performances for guiding our parameter optimization process. The exploratory visual analysis is then used to examine the performance of the proposed language models which will reveal the strength and characteristic of the models.
Year
DOI
Venue
2017
10.1016/j.neucom.2016.08.145
Neurocomputing
Keywords
Field
DocType
Language model,Tunable discounting,Polynomial discounting,Domain adaptation,Visualization
Perplexity,Discounting,Polynomial,Domain adaptation,Computer science,Visualization,Interpolation,Smoothing,Artificial intelligence,Language model,Machine learning
Journal
Volume
Issue
ISSN
269
C
0925-2312
Citations 
PageRank 
References 
0
0.34
19
Authors
5
Name
Order
Citations
PageRank
Junfei Guo100.34
Qi Han2114.90
Guangzhi Ma3304.36
Hong Liu49618.53
Seth van Hooland5518.71