Abstract | ||
---|---|---|
This paper describes a novel modification of Histogram Equalization (HEQ) approach to robust speech recognition. We propose separate equalization of the high frequency (HF) and low frequency (LF) bands. We study different combinations of the sub-band equalization and obtain best results when we perform a two-stage equalization. First, conventional HEQ is performed on the cepstral features, which does not completely equalize HF and LF bands, even though the overall histogram equalization is good. In the second stage, an equalization is done separately on the HF and the LF components of the above equalized cepstra. We refer to this approach as Sub-band Histogram Equalization (S-HEQ). The new set of features has better equalization of the sub-bands as well as the overall cepstral histogram. Recognition results show a relative improvement of 12% and 15% over conventional HEQ in WER on Aurora-2 and Aurora-4 databases respectively. |
Year | Venue | Keywords |
---|---|---|
2011 | 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | Histogram Equalization, S-HEQ, Noise robust speech recognition |
Field | DocType | Citations |
Histogram,Equalization (audio),Pattern recognition,Computer science,Cepstrum,Histogram matching,Speech recognition,Adaptive histogram equalization,Artificial intelligence,Histogram equalization,Color normalization | Conference | 6 |
PageRank | References | Authors |
0.47 | 4 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Vikas Joshi | 1 | 17 | 4.51 |
Raghavendra Bilgi | 2 | 14 | 1.67 |
Srinivasan Umesh | 3 | 93 | 16.31 |
Luz García | 4 | 63 | 9.48 |
M. Carmen Benítez | 5 | 303 | 25.05 |