Monaural Source Separation Based On Sequentially Trained Lstms In Real Room Environments - Citegraph

Paper Info

Title
Monaural Source Separation Based On Sequentially Trained Lstms In Real Room Environments

Abstract
In recent studies on Monaural Source Separation (MSS), the long short-term memory (LSTM) network has been introduced to solve this problem, however, its performance is still limited particularly in real room environments. According to the training objectives, the LSTM-based MSS is categorized into three aspects, namely mapping, masking and signal approximation (SA) based methods. In this paper, we introduce dereverberation mask (DM) and establish a system to train two SA-LSTMs sequentially, which dereverberate speech mixture and improve the separation performance. The DM is exploited as the training target of the first LSTM. Then, the enhanced ratio mask (ERM) is proposed and set as the training target of the second LSTM. We evaluate the proposed method with the IEEE and the TIMIT datasets with real room impulse responses and noise interferences from the NOISEX dataset. The detailed evaluations confirm that the proposed method outperforms the state-of-the-art.

Year	DOI	Venue
2019	10.23919/EUSIPCO.2019.8902640	2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)
Keywords	Field	DocType
Monaural source separation, long short-term memory, signal approximation, dereverberation mask, enhanced ratio mask	TIMIT,Masking (art),Computer science,Long short term memory,Speech recognition,Impulse (physics),Monaural,Source separation	Conference
ISSN	Citations	PageRank
2076-1465	0	0.34
References	Authors
0	3

Authors (3 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yi Li	1	2	1.40
Yang Sun	2	46	15.21
Syed Mohsen Naqvi	3	27	8.01

1