Abstract | ||
---|---|---|
This paper analyzes the benefits and drawbacks of PEQ (Parametric Non-linear Equalization), a features normalization technique based on the parametric equalization of the MFCC parameters to match a reference probability distribution. Two limitations have been outlined: the distortion intrinsic to the normalization process and the lack of accuracy in estimating normalization statistics on short sentences. Two evolutions of PEQ are presented as solutions to the limitations encountered. The effects of the proposed evolutions are evaluated on three speech corpora, namely WSJ0, AURORA-3 and HIWIRE cockpit databases, with different mismatch conditions given by convolutional and/or additive noise and non-native speakers. The obtained results show that the encountered limitations can be overcome by the newly introduced techniques. |
Year | Venue | Keywords |
---|---|---|
2009 | INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | Histogram Equalization, Parametric Equalization, Feature Normalization, Robust Speech Recognition |
Field | DocType | Citations |
Nonlinear system,Pattern recognition,Equalization (audio),Computer science,Speech recognition,Parametric statistics,Artificial intelligence | Conference | 2 |
PageRank | References | Authors |
0.40 | 5 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Luz García | 1 | 63 | 9.48 |
Roberto Gemello | 2 | 228 | 32.16 |
Franco Mana | 3 | 196 | 25.23 |
José C. Segura | 4 | 481 | 38.14 |