Title | ||
---|---|---|
Generalized Method For Solving The Permutation Problem In Frequency-Domain Blind Source Separation Of Convolved Speech Signals |
Abstract | ||
---|---|---|
The blind speech separation of convolutive mixtures can be performed in the time-frequency domain. The separation problem becomes to a set of instantaneous mixing problems, one for each frequency bin, that can be solved independently by any appropiated instantaneous ICA algorithm. However, the arbitrary order of the estimated sources in each frequency, known as permutation problem, has to be solved to succesfully recover the original sources. This paper deals with the permutation problem in the general case of N sources and N observations. The proposed method combines a correlation approach based on the amplitude correlation property of speech signals, and an optimal pairing scheme to align the permuted solutions. Our method is robust to artificially permuted speech signals. Experimental results on simulated convolutive mixtures show the effectiveness of the proposed method in terms of quality of separated signals by objective and perceptually measures. |
Year | Venue | Keywords |
---|---|---|
2011 | 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | convolutive mixtures, permutation correction, frequency domain separation |
Field | DocType | Citations |
Frequency domain,Pattern recognition,Computer science,Convolution,Permutation,Speech recognition,Artificial intelligence,Blind signal separation | Conference | 3 |
PageRank | References | Authors |
0.39 | 4 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Auxiliadora Sarmiento | 1 | 28 | 4.64 |
Iván Durán-Díaz | 2 | 21 | 3.85 |
Sergio Cruces | 3 | 206 | 19.05 |
Pablo Aguilera-Bonet | 4 | 20 | 3.15 |