Parametric Cepstral Mean Normalization For Robust Speech Recognition - Citegraph

Paper Info

Title
Parametric Cepstral Mean Normalization For Robust Speech Recognition

Abstract
This paper proposes a new channel normalization algorithm called parametric cepstral mean normalization (PCMN) to increase robustness of speech recognition to varying acoustic conditions. Rather than using a simple average of input speech features as channel estimate, as done in the traditional CMN, PCMN weighs the running average of input speech frames in a frequency dependent manner. These weights are jointly optimized together with parameters of the acoustic model training. Experimental results show that, in contrast to traditional CMN, which degrades performance on clean data, PCMN provides 5% relative improvement on clean data, while also providing 11.2% relative improvement on far-field test data. We also propose an adaptive version of PCMN, called aPCMN, where both input speech features and channel estimates have weights. These weights are computed at run time and they change dynamically based on the input speech. aPCMN provides 13.0% relative improvement on far-field test set, while still maintaining 5% relative improvement on clean data.

Year	DOI	Venue
2019	10.1109/icassp.2019.8683674	2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
Keywords	Field	DocType
Robust automatic speech recognition, cepstral mean normalization, channel normalization	Normalization (statistics),Pattern recognition,Computer science,Communication channel,Robustness (computer science),Speech recognition,Parametric statistics,Artificial intelligence,Test data,Moving average,Test set,Acoustic model	Conference
ISSN	Citations	PageRank
1520-6149	0	0.34
References	Authors
0	3

Authors (3 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Ozlem Kalinli	1	1	3.39
Gautam Bhattacharya	2	62	6.98
Chao Weng	3	113	19.75

1