Word Similarity Based Label Smoothing in Rnnlm Training for ASR - Citegraph

Paper Info

Title
Word Similarity Based Label Smoothing in Rnnlm Training for ASR

Abstract
Label smoothing has been shown as an effective regularization approach for deep neural networks. Recently, a context-sensitive label smoothing approach was proposed for training RNNLMs that improved word error rates on speech recognition tasks. Despite the performance gains, its plausible candidate words for label smoothing were confined to n-grams observed in training data. To investigate the potential of label smoothing in model training with insufficient data, in this current work, we propose to utilize the similarity between word embeddings to build a candidate word set for each target word, where by doing so, plausible words outside the n-grams in training data may be found and introduced into candidate word sets for label smoothing. Moreover, we propose to combine the smoothing labels from the n-gram based and the word similarity based methods to improve the generalization capability of RNNLMs. Our proposed approach to RNNLM training has been evaluated for n-best list rescoring on speech recognition tasks of WSJ and AMI, with improved experimental results on word error rates confirming its effectiveness.

Year	DOI	Venue
2021	10.1109/SLT48900.2021.9383598	2021 IEEE Spoken Language Technology Workshop (SLT)
Keywords	DocType	ISSN
neural network,speech recognition,language modeling,label smoothing	Conference	2639-5479
ISBN	Citations	PageRank
978-1-7281-7067-1	0	0.34
References	Authors
0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Minguang Song	1	0	2.37
Yunxin Zhao	2	807	121.74
Shaojun Wang	3	0	0.34
Mei Han	4	952	57.87

1