Title
PSD and Signal Approximation-LSTM Based Speech Enhancement
Abstract
Monaural speech enhancement is a challenging problem because the desired signal is estimated from singlechannel recordings. Numbers of methods have been proposed, however, due to the ignored pertinence of the specific frequency range of speech signals, the performance of the current approaches is limited. In this paper, we divide the speech mixture into two subbands and extract the desired speech signal from each frequency band based on the power spectral density (PSD) of noise mixtures. The proposed method trains two long short-term memory (LSTM) recurrent neural networks (RNNs) in parallel for the subband short time Fourier transform (STFT) of speech segments. The proposed LSTM RNN-based signal approximation (SA) method is evaluated with the IEEE and the TIMIT datasets with various noise interferences from the NOISEX dataset. The evaluation results confirm that the proposed method outperforms the state-of-the-art.
Year
DOI
Venue
2019
10.1109/ICSPCS47537.2019.9008731
2019 13th International Conference on Signal Processing and Communication Systems (ICSPCS)
Keywords
DocType
ISBN
Monaural speech enhancement,power spectral density,long short-term memory,recurrent neural network,short time Fourier transform
Conference
978-1-7281-2195-6
Citations 
PageRank 
References 
0
0.34
13
Authors
3
Name
Order
Citations
PageRank
Yi Li121.40
Yang Sun24615.21
Syed Mohsen Naqvi341755.49