Title
Monaural Source Separation Based On Sequentially Trained Lstms In Real Room Environments
Abstract
In recent studies on Monaural Source Separation (MSS), the long short-term memory (LSTM) network has been introduced to solve this problem, however, its performance is still limited particularly in real room environments. According to the training objectives, the LSTM-based MSS is categorized into three aspects, namely mapping, masking and signal approximation (SA) based methods. In this paper, we introduce dereverberation mask (DM) and establish a system to train two SA-LSTMs sequentially, which dereverberate speech mixture and improve the separation performance. The DM is exploited as the training target of the first LSTM. Then, the enhanced ratio mask (ERM) is proposed and set as the training target of the second LSTM. We evaluate the proposed method with the IEEE and the TIMIT datasets with real room impulse responses and noise interferences from the NOISEX dataset. The detailed evaluations confirm that the proposed method outperforms the state-of-the-art.
Year
DOI
Venue
2019
10.23919/EUSIPCO.2019.8902640
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)
Keywords
Field
DocType
Monaural source separation, long short-term memory, signal approximation, dereverberation mask, enhanced ratio mask
TIMIT,Masking (art),Computer science,Long short term memory,Speech recognition,Impulse (physics),Monaural,Source separation
Conference
ISSN
Citations 
PageRank 
2076-1465
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Yi Li121.40
Yang Sun24615.21
Syed Mohsen Naqvi3278.01