I-vector based deep neural network acoustic model adaptation using multilingual language resource. - Citegraph

Paper Info

Title
I-vector based deep neural network acoustic model adaptation using multilingual language resource.

Abstract
I-vector adaptation of DNN-HMM acoustic models has shown clear performance improvement for speech recognition. In this paper, we study this technique on Babel task. we use Swahili as target language (training data of 50 hours) and another 6 languages as multilingual resources to train i-vector extractors respectively. Our study shows that i-vector extractors trained with more multilingual data only produce slightly improved results. Moreover, we compared two i-vectors adaptation methods, 1) concatenate i-vectors with spectral features; 2) predict a bias term adding it to spectral features from i-vectors using a NN. When DNN is trained from scratch, the two methods perform similarly. However, only the second method is appropriate in a cross-lingual transfer learning scenario. We investigate it as well, and results show further word error rate reduction can be gained.

Year	Venue	Keywords
2016	Asia-Pacific Signal and Information Processing Association Annual Summit and Conference	I-vector,deep neural network,adaptation,multilingual,speech recognition
Field	DocType	ISSN
Data modeling,Computer science,Transfer of learning,Word error rate,Speech recognition,Feature extraction,Natural language processing,Artificial intelligence,Artificial neural network,Hidden Markov model,Acoustic model,Performance improvement	Conference	2309-9402
Citations	PageRank	References
0	0.34	0
Authors
6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Haihua Xu	1	55	11.41
Wei Rao	2	67	8.73
Xiong Xiao	3	281	34.97
Hao Huang	4	0	1.01
Eng Siong Chng	5	970	106.33
Haizhou Li	6	3678	334.61

1