Title
GLMSnet: Single Channel Speech Separation Framework in Noisy and Reverberant Environments
Abstract
In real noisy and reverberant environments, the performance of current single channel speech separation algorithms decreases significantly. Given this situation, this paper proposes a novel speech separation framework, called Graph convolution and Leading global Multi-scale separation network (GLMSnet). The graph convolution network (GCN) is introduced on high-level features for modeling global context and incorporating long-range information, and it can be arbitrarily inserted into the desired position. Furthermore, Global multi-scale convolution is proposed to aggregate different levels features and improve the audio quality of separation. The leading factor is applied to increase valid information of target speech. We evaluate our method on WHAMR! Database. The results show that our proposed method can obtain state-of-the-art speech separation effect in the presence of noise and reverberation. Compared with the most advanced model before, the performance is improved by 22.7%.
Year
DOI
Venue
2021
10.1109/ASRU51503.2021.9688217
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
Keywords
DocType
ISBN
Speech separation,speech enhancement,cock-tail party problem,reverberation
Conference
978-1-6654-3740-0
Citations 
PageRank 
References 
0
0.34
0
Authors
5
Name
Order
Citations
PageRank
Huiyu Shi100.34
Xi Chen233370.76
Tianlong Kong300.34
shouyi yin457999.95
Peng Ouyang500.34