Convolutional maxout neural networks for speech separation. - Citegraph

Paper Info

Title
Convolutional maxout neural networks for speech separation.

Abstract
Speech separation based on deep neural networks (DNNs) has been widely studied recently, and has achieved considerable success. However, previous studies are mostly based on fully-connected neural networks. In order to capture the local information of speech signals, we propose to use convolutional maxout neural networks (CMNNs) to separate speech and noise by estimating the ideal ratio mask of the time-frequency units. In our work the proposed CMNN is applied in the frequency domain. By using local filtering and max-pooling, convolutional neural networks can model the local structure of speech signals. Instead of sigmoid function, maxout is selected to address the saturation problem. In addition, dropout is integrated into the network to get better generalization ability. The proposed system outperforms a traditional DNN-based system in both objective speech quality and intelligibility.

Year	DOI	Venue
2015	10.1109/ISSPIT.2015.7394335	ISSPIT
Keywords	Field	DocType
convolutional maxout neural network,speech separation,deep neural network,local information capture,time-frequency unit,local filtering,max-pooling,objective speech quality,objective speech intelligibility	Speech processing,Pattern recognition,Computer science,Voice activity detection,Convolutional neural network,Speech recognition,Time delay neural network,Artificial intelligence,Deep learning,Artificial neural network,Acoustic model,Intelligibility (communication)	Conference
Citations	PageRank	References
3	0.47	13
Authors
6

Authors (6 rows)

Cited by (3 rows)

References (13 rows)

Name	Order	Citations	PageRank
Like Hui	1	8	2.92
Meng Cai	2	68	8.24
Cong Guo	3	3	0.47
Liang He	4	67	17.35
Wei-Qiang Zhang	5	136	31.22
Jia Liu	6	277	50.34

1