Title | ||
---|---|---|
Modified Splice And Its Extension To Non-Stereo Data For Noise Robust Speech Recognition |
Abstract | ||
---|---|---|
In this paper, a modification to the training process of the popular SPLICE algorithm has been proposed for noise robust speech recognition. The modification is based on feature correlations, and enables this stereo-based algorithm to improve the performance in all noise conditions, especially in unseen cases. Further, the modified framework is extended to work for non-stereo datasets where clean and noisy training utterances, but not stereo counterparts, are required. Finally, an MLLR-based computationally efficient run-time noise adaptation method in SPLICE framework has been proposed. The modified SPLICE shows 8.6% absolute improvement over SPLICE in Test C of Aurora-2 database, and 2.93% overall. Non-stereo method shows 10.37% and 6.93% absolute improvements over Aurora-2 and Aurora-4 baseline models respectively. Run-time adaptation shows 9.89% absolute improvement in modified framework as compared to SPLICE for Test C, and 4.96% overall w.r.t. standard MLLR adaptation on HMMs. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1109/ASRU.2013.6707725 | 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) |
Keywords | DocType | Volume |
Robust speech recognition, SPLICE, stereo data, feature normalisation, MFCC | Conference | abs/1307.4048 |
Citations | PageRank | References |
0 | 0.34 | 5 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
D. S. Pavan Kumar | 1 | 0 | 0.34 |
N Vishnu Prasad | 2 | 2 | 1.12 |
Vikas Joshi | 3 | 17 | 4.51 |
Srinivasan Umesh | 4 | 93 | 16.31 |