Title
I-vector based deep neural network acoustic model adaptation using multilingual language resource.
Abstract
I-vector adaptation of DNN-HMM acoustic models has shown clear performance improvement for speech recognition. In this paper, we study this technique on Babel task. we use Swahili as target language (training data of 50 hours) and another 6 languages as multilingual resources to train i-vector extractors respectively. Our study shows that i-vector extractors trained with more multilingual data only produce slightly improved results. Moreover, we compared two i-vectors adaptation methods, 1) concatenate i-vectors with spectral features; 2) predict a bias term adding it to spectral features from i-vectors using a NN. When DNN is trained from scratch, the two methods perform similarly. However, only the second method is appropriate in a cross-lingual transfer learning scenario. We investigate it as well, and results show further word error rate reduction can be gained.
Year
Venue
Keywords
2016
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference
I-vector,deep neural network,adaptation,multilingual,speech recognition
Field
DocType
ISSN
Data modeling,Computer science,Transfer of learning,Word error rate,Speech recognition,Feature extraction,Natural language processing,Artificial intelligence,Artificial neural network,Hidden Markov model,Acoustic model,Performance improvement
Conference
2309-9402
Citations 
PageRank 
References 
0
0.34
0
Authors
6
Name
Order
Citations
PageRank
Haihua Xu15511.41
Wei Rao2678.73
Xiong Xiao328134.97
Hao Huang401.01
Eng Siong Chng5970106.33
Haizhou Li63678334.61