Title | ||
---|---|---|
Monaural Source Separation Based On Sequentially Trained Lstms In Real Room Environments |
Abstract | ||
---|---|---|
In recent studies on Monaural Source Separation (MSS), the long short-term memory (LSTM) network has been introduced to solve this problem, however, its performance is still limited particularly in real room environments. According to the training objectives, the LSTM-based MSS is categorized into three aspects, namely mapping, masking and signal approximation (SA) based methods. In this paper, we introduce dereverberation mask (DM) and establish a system to train two SA-LSTMs sequentially, which dereverberate speech mixture and improve the separation performance. The DM is exploited as the training target of the first LSTM. Then, the enhanced ratio mask (ERM) is proposed and set as the training target of the second LSTM. We evaluate the proposed method with the IEEE and the TIMIT datasets with real room impulse responses and noise interferences from the NOISEX dataset. The detailed evaluations confirm that the proposed method outperforms the state-of-the-art. |
Year | DOI | Venue |
---|---|---|
2019 | 10.23919/EUSIPCO.2019.8902640 | 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) |
Keywords | Field | DocType |
Monaural source separation, long short-term memory, signal approximation, dereverberation mask, enhanced ratio mask | TIMIT,Masking (art),Computer science,Long short term memory,Speech recognition,Impulse (physics),Monaural,Source separation | Conference |
ISSN | Citations | PageRank |
2076-1465 | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yi Li | 1 | 2 | 1.40 |
Yang Sun | 2 | 46 | 15.21 |
Syed Mohsen Naqvi | 3 | 27 | 8.01 |