Title
SSLIDE: SOUND SOURCE LOCALIZATION FOR INDOORS BASED ON DEEP LEARNING
Abstract
This paper presents SSLIDE, Sound Source Localization for Indoors using DEep learning, which applies deep neural networks (DNNs) with encoder-decoder structure to localize sound sources with random positions in a continuous space. The spatial features of sound signals received by each microphone are extracted and represented as likelihood surfaces for the sound source locations in each point. Our DNN consists of an encoder network followed by two decoders. The encoder obtains a compressed representation of the input likelihoods. One decoder resolves the multipath caused by reverberation, and the other decoder estimates the source location. Experiments based on both the simulated and experimental data show that our method can not only outperform multiple signal classification (MUSIC), steered response power with phase transform (SRP-PHAT), sparse Bayesian learning (SBL), and a competing convolutional neural network (CNN) approach in the reverberant environment but also achieve a good generalization performance.
Year
DOI
Venue
2021
10.1109/ICASSP39728.2021.9415109
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021)
Keywords
DocType
Citations 
Indoor sound source localization, multipath, encoder-decoder structure, deep neural networks
Conference
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Yifan Wu100.34
Roshan Sai Ayyalasomayajula251.87
Michael Lo Bianco343.35
Dinesh Bharadia482247.06
Peter Gerstoft58622.34