Title
Single-Channel Speech Enhancement with Sequentially Trained DNN System
Abstract
One of the recent methods for speech enhancement is to find the mapping function between noisy speech mixture and the clean speech signals with a trained deep neural network(DNN) model, especially in the monaural case. Such a model, however, is often over-fit with the training data, and limited when dealing with noise and interferences that are unseen in the training process. To address this issue, we propose an enhancement system with two sequentially trained DNNs, in order to improve the generalization ability of the model. Two DNNs are trained sequentially using different training targets, with one applied to remove the noise interference and the other used to further improve the quality with time-frequency (T-F) mask. The TIMIT corpus, non-speech noise and NOISEX datasets are used to generate the training and testing data. Evaluations using perceptual evaluation of speech quality (PESQ), the short-time objective intelligibility (STOI) and signal to distortion ratio (SDR) show the improved performance of the proposed method over the state-of-the-art method.
Year
DOI
Venue
2019
10.1109/ICSPCS47537.2019.9008699
2019 13th International Conference on Signal Processing and Communication Systems (ICSPCS)
Keywords
DocType
ISBN
speech enhancement,mapping-based,sequentially,time-frequency mask
Conference
978-1-7281-2195-6
Citations 
PageRank 
References 
0
0.34
13
Authors
4
Name
Order
Citations
PageRank
Yang Sun14615.21
Yang Xian201.01
Wenwu Wang333352.60
Syed Mohsen Naqvi4278.01