Title
Semi-supervised Training of Acoustic Models Leveraging Knowledge Transferred from Out-of-Domain Data
Abstract
More recently, a novel objective function of discriminative acoustic model training, namely lattice-free MMI (LF-MMI), has been proposed and achieved the new state-of-the-art in automatic speech recognition (ASR). Although LF-MMI shows excellent performance in a wide array of ASR tasks with supervised training settings, there is a dearth of work on investigating its effectiveness in the scenario of unsupervised or semi-supervised training. On the other hand, semi-supervised (or self-training) of acoustic model suffers from the problem that it is hard to estimate a good model when only a limited amount of correctly transcribed data is made available. It is also generally acknowledged that the performance of discriminative training is vulnerable to correctness of speech transcripts employed for training. In view of the above, this paper explores two novel extensions to LF-MMI. The first one is to distill knowledge (acoustic training statistics) from a large amount of out-of-domain data to better estimate the seed models for use in semi-supervised training. The second one is to make effective selection of the untranscribed target domain data for semi-supervised training. A series of experiments conducted on the AMI benchmark corpus demonstrate the gains from these two extensions are pronounced and additive, which also reveals their effectiveness and viability.
Year
DOI
Venue
2019
10.1109/APSIPAASC47483.2019.9023040
2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
Keywords
DocType
ISSN
acoustic training statistics,out-of-domain data,semisupervised training,discriminative acoustic model training,lattice-free MMI,LF-MMI,supervised training settings,discriminative training,objective function,automatic speech recognition
Conference
2640-009X
ISBN
Citations 
PageRank 
978-1-7281-3249-5
0
0.34
References 
Authors
12
2
Name
Order
Citations
PageRank
Tien-Hong Lo113.74
Berlin Chen215134.59