Title
3D Localization of Multiple Simultaneous Speakers with Discrete Wavelet Transform and Proposed 3D Nested Microphone Array
Abstract
Multiple sound source localization is one of the important topic in speech processing. GCC function is used as a traditional algorithm for sound source localization. This function estimates DOA for multiple speakers by calculation the cross-correlation between microphone signals but its accuracy decreases in adverse conditions. The aim of proposed method in this paper is localization of multiple simultaneous speakers in undesirable condition. The proposed method is based on novel 3D nested microphone array in combination with obtained information of Discrete Wavelet Transform (DWT) and subband processing. The proposed 3D nested microphone array prepares the condition for 3D localization and eliminates the spatial aliasing between microphone signals. Also, we propose the DWT for extraction the information of speech signal. Since, the spectral information of speech signal concentrates on low frequencies, we propose a structure of filter bank based on DWT to increase the frequency resolution on low frequencies. The performed evaluation on real and simulated data shows the superiority of our proposed method in comparison with Fullband and subband processing with uniform filters and uniform microphone array.
Year
DOI
Venue
2018
10.23919/EUSIPCO.2018.8553471
2018 26th European Signal Processing Conference (EUSIPCO)
Keywords
Field
DocType
Simultaneous sound source localization,Wavelet Transform,Generalized Cross-Correlation,Nested microphone array,Subband processing
Speech processing,Computer vision,3d localization,Computer science,Filter bank,Microphone array,Aliasing,Artificial intelligence,Discrete wavelet transform,Microphone,Acoustic source localization
Conference
ISSN
ISBN
Citations 
2219-5491
978-1-5386-3736-4
0
PageRank 
References 
Authors
0.34
4
4
Name
Order
Citations
PageRank
ali dehghan firoozabadi165.86
Hugo Durney273.00
Ismael Soto33616.67
Miguel Sanhueza-Olave401.35