Title
Multimodal fusion for indoor sound source localization
Abstract
•We propose a novel solution based on fusing visual and acoustic models to accurately identify the localization information of sound localization.•We develop a HMM-based method for separation of the acoustic transfer function (ATF) to describe clean speech sound.•We propose a new Fourier domain method for fast implementation of the HOG-type polar feature descriptor.•The proposed method has rotation-invariant capabilities and also preserves the discriminative power of extracted features.
Year
DOI
Venue
2021
10.1016/j.patcog.2021.107906
Pattern Recognition
Keywords
DocType
Volume
Sound source localization,Acoustic transfer function,HMM,Polar HOG,SVM
Journal
115
Issue
ISSN
Citations 
1
0031-3203
1
PageRank 
References 
Authors
0.35
35
7
Name
Order
Citations
PageRank
Jinhui Chen110.35
Ryoichi Takashima29512.16
Xingchen Guo311.36
Zhihong Zhang410015.85
Xuexin Xu511.02
Tetsuya Takiguchi6858.77
Edwin R. Hancock75432462.92