PSD and Signal Approximation-LSTM Based Speech Enhancement - Citegraph

Paper Info

Title
PSD and Signal Approximation-LSTM Based Speech Enhancement

Abstract
Monaural speech enhancement is a challenging problem because the desired signal is estimated from singlechannel recordings. Numbers of methods have been proposed, however, due to the ignored pertinence of the specific frequency range of speech signals, the performance of the current approaches is limited. In this paper, we divide the speech mixture into two subbands and extract the desired speech signal from each frequency band based on the power spectral density (PSD) of noise mixtures. The proposed method trains two long short-term memory (LSTM) recurrent neural networks (RNNs) in parallel for the subband short time Fourier transform (STFT) of speech segments. The proposed LSTM RNN-based signal approximation (SA) method is evaluated with the IEEE and the TIMIT datasets with various noise interferences from the NOISEX dataset. The evaluation results confirm that the proposed method outperforms the state-of-the-art.

Year	DOI	Venue
2019	10.1109/ICSPCS47537.2019.9008731	2019 13th International Conference on Signal Processing and Communication Systems (ICSPCS)
Keywords	DocType	ISBN
Monaural speech enhancement,power spectral density,long short-term memory,recurrent neural network,short time Fourier transform	Conference	978-1-7281-2195-6
Citations	PageRank	References
0	0.34	13
Authors
3

Authors (3 rows)

Cited by (0 rows)

References (13 rows)

Name	Order	Citations	PageRank
Yi Li	1	2	1.40
Yang Sun	2	46	15.21
Syed Mohsen Naqvi	3	417	55.49

1