Title
Improving Children's Speech Recognition Through Time Scale Modification Based Speaking Rate Adaptation
Abstract
In the work presented in this paper, we have explored the effect of speaking-rate adaptation on children's speech recognition using acoustic models trained on adults' speech. It is well known that, the shape of the vocal organs, pitch and speaking-rates are significantly different for adult and child speakers. Consequently, the recognition performance for children's speech in such mismatched setup is reported to be extremely poor. To address the acoustic mismatch resulting from the differences in pitch and vocal-tract geometry, a large number of studies have been reported that have presented a myriad of techniques. But, only a few works have studied the role of speaking-rate adaptation on children's speech recognition. Furthermore, those studies were performed on systems employing Gaussian mixture models. Motivated by these facts, we have explored speaking-rate adaptation in the context of systems employing deep neural network based acoustic modeling. Timescale modification using an approach based on phase-independent iterative spectrogram inversion is employed for speaking-rate adaptation. Significant reductions in errors are noted by adapting the speaking-rates. Furthermore, the effect of combining speaking-rate adaptation with vocal-tract length normalization and pitch scaling is also studied. Additive improvements are obtained by combining the explored techniques with speaking-rate adaptation.
Year
DOI
Venue
2018
10.1109/SPCOM.2018.8724465
2018 International Conference on Signal Processing and Communications (SPCOM)
Keywords
Field
DocType
Speech recognition,Acoustics,Histograms,Hidden Markov models,Adaptation models,Spectrogram,Discrete cosine transforms
Computer science,Speech recognition
Conference
ISSN
ISBN
Citations 
2474-9168
978-1-5386-3821-7
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Hemant Kumar Kathania1194.27
S. Shahnawazuddin26417.34
Waquar Ahmad385.90
Adiga, N.4103.60
S. K. Jana500.34
A. B. Samaddar610.69