Abstract | ||
---|---|---|
A perceptually motivated method is proposed for solving the permutation ambiguity of frequency-domain independent component analysis when the mixing environment is noisy and reverberant. In this method, perceptually irrelevant frequencies are removed from the speech spectrum using block based perceptual masking (simultaneous frequency masking) and then independent component analysis is applied. After source separation in frequency domain, a physical property of the mixing matrix, i.e., the coherency in adjacent frequencies, is utilized to solve the permutation ambiguity. From the simulation results it appears that the perceptual masking avoids the permutation problem. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1109/ICASSP.2005.1416293 | 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING |
Keywords | Field | DocType |
physical properties,frequency domain,audio signal processing,spectrum,speech recognition,speech coding,independent component analysis,frequency domain analysis,filter bank,simulation,acoustic noise,blind source separation,speech intelligibility | Frequency domain,Masking (art),Computer science,Permutation,Speech recognition,Independent component analysis,Audio signal processing,Blind signal separation,Source separation,Perceptual Masking | Conference |
ISSN | Citations | PageRank |
1520-6149 | 2 | 0.43 |
References | Authors | |
4 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ram Mohana Reddy Guddeti | 1 | 48 | 8.76 |
Bernard Mulgrew | 2 | 724 | 85.23 |