Title | ||
---|---|---|
Evaluation Of Modulation Spectrum Equalization Techniques For Large Vocabulary Robust Speech Recognition |
Abstract | ||
---|---|---|
Previous approaches for modulation spectrum equalization were evaluated only for the Aurora 2 small vocabulary task. We further apply these approaches on the Aurora 4 large vocabulary task. In the spectral histogram equalization (SHE) approach, we equalize the histogram of the modulation spectrum for each utterance to a reference histogram obtained from clean training data. In the magnitude ratio equalization (MRE) approach, we equalize the magnitude ratio of lower to higher frequency components on the modulation spectrum to a reference value also obtained from clean training data. Experimental test results indicate significant performance improvements using these approaches when cascaded with cepstral mean and variance normalization (CMVN). Cascading MRE with more advanced feature normalization approaches such as histogram equalization (HEQ) and higher-order cepstral moment normalization (HOCMN) yielded additional performance improvements. |
Year | Venue | Keywords |
---|---|---|
2008 | INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | temporal filtering, modulation spectrum, feature normalization, robust feature extraction |
Field | DocType | Citations |
Speech processing,Pattern recognition,Equalization (audio),Voice activity detection,Computer science,Speech recognition,Artificial intelligence,Vocabulary,Modulation spectrum | Conference | 0 |
PageRank | References | Authors |
0.34 | 1 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Liang-Che Sun | 1 | 36 | 3.43 |
Chang-Wen Hsu | 2 | 39 | 2.50 |
Lin-shan Lee | 3 | 1525 | 182.03 |