Title
Acoustic modeling for hindi speech recognition in low-resource settings
Abstract
We propose an approach for acoustic modeling of Hindi speech by borrowing from English data, for the purpose of Hindi LVCSR. Hindi, like many Indian languages, has a significant speaker base but there have not been a lot of resources to obtain large amounts of transcribed Hindi data for LVCSR. We compare a baseline Gaussian model-sharing approach with DNN training. A widely used data-borrowing method with DNN is to firstly train a DNN with English, for which a large amount of training data is available; then the whole DNN, except the last layer, is fine-tuned by using the target Hindi data. We propose to do phonetic mapping between Hindi and English in the first stage, training Hindi acoustic models by sharing data between Hindi-English phone pairs in the second stage, and finally fine-tuning the acoustic model by using the Hindi data. We evaluate and compare these approaches with experiments using 1 hour of transcribed Hindi data and 15 hours of Wall Street Journal English data. Experiments show that the proposed method significantly outperforms conventional baseline models in a low-resource setting for phone recognition tasks.
Year
DOI
Venue
2014
10.1109/ICALIP.2014.7009923
Audio, Language and Image Processing
Keywords
DocType
Citations 
gaussian processes,acoustic signal processing,feedforward neural nets,hidden markov models,learning (artificial intelligence),natural language processing,speaker recognition,speech processing,dnn training,gmm,gaussian mixture models,hmm,hindi lvcsr,hindi speech recognition,hindi-english phone pairs,indian languages,wall street journal english data,acoustic modeling,baseline gaussian model-sharing approach,data sharing,deep neural network,feed-forward network,low-resource settings,phone recognition tasks,phonetic mapping,hindi lvscr,data borrowing,low resource,phone mapping,feature extraction,speech,speech recognition,data models,acoustics
Conference
0
PageRank 
References 
Authors
0.34
6
3
Name
Order
Citations
PageRank
Anik Dey1143.23
Weibin Zhang23110.03
Pascale Fung3653135.24