Title | ||
---|---|---|
I-vector based deep neural network acoustic model adaptation using multilingual language resource. |
Abstract | ||
---|---|---|
I-vector adaptation of DNN-HMM acoustic models has shown clear performance improvement for speech recognition. In this paper, we study this technique on Babel task. we use Swahili as target language (training data of 50 hours) and another 6 languages as multilingual resources to train i-vector extractors respectively. Our study shows that i-vector extractors trained with more multilingual data only produce slightly improved results. Moreover, we compared two i-vectors adaptation methods, 1) concatenate i-vectors with spectral features; 2) predict a bias term adding it to spectral features from i-vectors using a NN. When DNN is trained from scratch, the two methods perform similarly. However, only the second method is appropriate in a cross-lingual transfer learning scenario. We investigate it as well, and results show further word error rate reduction can be gained. |
Year | Venue | Keywords |
---|---|---|
2016 | Asia-Pacific Signal and Information Processing Association Annual Summit and Conference | I-vector,deep neural network,adaptation,multilingual,speech recognition |
Field | DocType | ISSN |
Data modeling,Computer science,Transfer of learning,Word error rate,Speech recognition,Feature extraction,Natural language processing,Artificial intelligence,Artificial neural network,Hidden Markov model,Acoustic model,Performance improvement | Conference | 2309-9402 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Haihua Xu | 1 | 55 | 11.41 |
Wei Rao | 2 | 67 | 8.73 |
Xiong Xiao | 3 | 281 | 34.97 |
Hao Huang | 4 | 0 | 1.01 |
Eng Siong Chng | 5 | 970 | 106.33 |
Haizhou Li | 6 | 3678 | 334.61 |