Title
Word Similarity Based Label Smoothing in Rnnlm Training for ASR
Abstract
Label smoothing has been shown as an effective regularization approach for deep neural networks. Recently, a context-sensitive label smoothing approach was proposed for training RNNLMs that improved word error rates on speech recognition tasks. Despite the performance gains, its plausible candidate words for label smoothing were confined to n-grams observed in training data. To investigate the potential of label smoothing in model training with insufficient data, in this current work, we propose to utilize the similarity between word embeddings to build a candidate word set for each target word, where by doing so, plausible words outside the n-grams in training data may be found and introduced into candidate word sets for label smoothing. Moreover, we propose to combine the smoothing labels from the n-gram based and the word similarity based methods to improve the generalization capability of RNNLMs. Our proposed approach to RNNLM training has been evaluated for n-best list rescoring on speech recognition tasks of WSJ and AMI, with improved experimental results on word error rates confirming its effectiveness.
Year
DOI
Venue
2021
10.1109/SLT48900.2021.9383598
2021 IEEE Spoken Language Technology Workshop (SLT)
Keywords
DocType
ISSN
neural network,speech recognition,language modeling,label smoothing
Conference
2639-5479
ISBN
Citations 
PageRank 
978-1-7281-7067-1
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Minguang Song102.37
Yunxin Zhao2807121.74
Shaojun Wang300.34
Mei Han495257.87