Title
Variational Model Composition For Robust Speech Recognition With Time-Varying Background Noise
Abstract
This paper proposes a novel model composition method to improve speech recognition performance in time-varying background noise conditions. It is suggested that each order of the cepstral coefficients represents the frequency degree of changing components in the envelope of the log-spectrum. With this motivation, in the proposed method, variational noise models are generated by selectively applying perturbation factors to a basis model, resulting in a collection of various types of spectral patterns in the log-spectral domain. The basis noise model is obtained from the silent duration segments of the input speech. The proposed Variational Model Composition (VMC) method is employed to generate multiple environmental models for our previously proposed feature compensation method. Experimental results prove that the proposed method is considerably more effective at increasing speech recognition performance in time-varying background noise conditions with 30.34% and 9.02% average relative improvements in word error rate for speech babble and background music conditions respectively, compared to an existing single model-based method.
Year
Venue
Keywords
2009
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5
variational model composition (VMC), time-varying noise, feature compensation, multiple environmental models, robust speech recognition
Field
DocType
Citations 
Background noise,Pattern recognition,Computer science,Variational model,Speech recognition,Artificial intelligence,Feature compensation
Conference
1
PageRank 
References 
Authors
0.36
1
2
Name
Order
Citations
PageRank
Wooil Kim112016.95
John H. L. Hansen23215365.75