Title
Feature domain compensation of nonstationary noise for robust speech recognition
Abstract
One of the key issues in practical speech recognition is to achieve robustness against the environmental mismatches resulting from the background noises or different channels. Most of the conventional approaches have tried to compensate for the effects of such mismatches based on the assumption that the environmental characteristics are stationary, which, however, is far from the real observation. In this paper, we propose an approach to cope with time-varying environmental characteristics. With a direct modeling of the environment evolution process and the clean speech feature distribution, we construct a set of multiple linear state space models. Suboptimal state estimation under the given model structure can be efficiently performed with the interacting multiple model (IMM) algorithm. In addition to providing a comprehensive description of the compensation technique, we propose an adaptive Kalman filtering approach with which nonstationary noise evolution characteristics can be tracked. Moreover, we propose a novel way to do fixed-interval smoothing within the IMM framework. Performance of the presented compensation technique in both the slowly and rapidly varying noise conditions is evaluated through a number of continuous digit recognition experiments.
Year
DOI
Venue
2002
10.1016/S0167-6393(01)00013-9
Speech Communication
Keywords
Field
DocType
Robust speech recognition,Nonstationary noise,Interacting multiple model,Fixed-interval smoothing
Pattern recognition,Computer science,Communication channel,Robustness (computer science),Speech recognition,Kalman filter,Smoothing,Artificial intelligence,Digit recognition,State space
Journal
Volume
Issue
ISSN
37
3
0167-6393
Citations 
PageRank 
References 
13
0.96
13
Authors
1
Name
Order
Citations
PageRank
Nam Soo Kim127529.16