Title
Exploiting Deep Neural Networks And Head Movements For Binaural Localisation Of Multiple Speakers In Reverberant Conditions
Abstract
This paper presents a novel machine-hearing system that exploits deep neural networks (DNNs) and head movements for binaural localisation of multiple speakers in reverberant conditions. DNNs are used to map binaural features, consisting of the complete cross-correlation function (CCF) and interaural level differences (ILDs), to the source azimuth. Our approach was evaluated using a localisation task in which sources were located in a full 360-degree azimuth range. As a result, front back confusions often occurred due to the similarity of binaural features in the front and rear hemifields. To address this, a head movement strategy was incorporated in the DNN-based model to help reduce the front-back errors. Our experiments show that, compared to a system based on a Gaussian mixture model (GMM) classifier, the proposed DNN system substantially reduces localisation errors under challenging acoustic scenarios in which multiple speakers and room reverberation are present.
Year
Venue
Keywords
2015
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5
Binaural source localisation, deep neural networks, head movements, machine hearing, reverberation
Field
DocType
Citations 
Reverberation,Pattern recognition,Computer science,Head movements,Azimuth,Speech recognition,Artificial intelligence,Binaural recording,Classifier (linguistics),Mixture model,Deep neural networks
Conference
5
PageRank 
References 
Authors
0.55
8
3
Name
Order
Citations
PageRank
Ning Ma122417.86
Guy J. Brown276097.54
Tobias May3434.97