Title | ||
---|---|---|
Improving Children's Speech Recognition Through Time Scale Modification Based Speaking Rate Adaptation |
Abstract | ||
---|---|---|
In the work presented in this paper, we have explored the effect of speaking-rate adaptation on children's speech recognition using acoustic models trained on adults' speech. It is well known that, the shape of the vocal organs, pitch and speaking-rates are significantly different for adult and child speakers. Consequently, the recognition performance for children's speech in such mismatched setup is reported to be extremely poor. To address the acoustic mismatch resulting from the differences in pitch and vocal-tract geometry, a large number of studies have been reported that have presented a myriad of techniques. But, only a few works have studied the role of speaking-rate adaptation on children's speech recognition. Furthermore, those studies were performed on systems employing Gaussian mixture models. Motivated by these facts, we have explored speaking-rate adaptation in the context of systems employing deep neural network based acoustic modeling. Timescale modification using an approach based on phase-independent iterative spectrogram inversion is employed for speaking-rate adaptation. Significant reductions in errors are noted by adapting the speaking-rates. Furthermore, the effect of combining speaking-rate adaptation with vocal-tract length normalization and pitch scaling is also studied. Additive improvements are obtained by combining the explored techniques with speaking-rate adaptation. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/SPCOM.2018.8724465 | 2018 International Conference on Signal Processing and Communications (SPCOM) |
Keywords | Field | DocType |
Speech recognition,Acoustics,Histograms,Hidden Markov models,Adaptation models,Spectrogram,Discrete cosine transforms | Computer science,Speech recognition | Conference |
ISSN | ISBN | Citations |
2474-9168 | 978-1-5386-3821-7 | 0 |
PageRank | References | Authors |
0.34 | 0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hemant Kumar Kathania | 1 | 19 | 4.27 |
S. Shahnawazuddin | 2 | 64 | 17.34 |
Waquar Ahmad | 3 | 8 | 5.90 |
Adiga, N. | 4 | 10 | 3.60 |
S. K. Jana | 5 | 0 | 0.34 |
A. B. Samaddar | 6 | 1 | 0.69 |