Title
Robust Localisation Of Multiple Speakers Exploiting Head Movements And Multi-Conditional Training Of Binaural Cues
Abstract
This paper addresses the problem of localising multiple competing speakers in the presence of room reverberation, where sound sources can be positioned at any azimuth on the horizontal plane. To reduce the amount of front-back confusions which can occur due to the similarity of interaural time differences (ITDs) and interaural level differences (ILDs) in the front and rear hemifield, a machine hearing system is presented which combines supervised learning of binaural cues using multi-conditional training (MCT) with a head movement strategy. A systematic evaluation showed that this approach substantially reduced the amount of front-back confusions in challenging acoustic scenarios. Moreover, the system was able to generalise to a variety of different acoustic conditions not seen during training.
Year
Venue
Keywords
2015
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP)
binaural sound source localisation, head movements, multi-conditional training, generalisation
Field
DocType
ISSN
Reverberation,Computer science,Generalization,Azimuth,Auditory system,Supervised learning,Robustness (computer science),Speech recognition,Binaural recording,Horizontal plane
Conference
1520-6149
Citations 
PageRank 
References 
9
0.73
9
Authors
3
Name
Order
Citations
PageRank
Tobias May1434.97
Ning Ma2212.66
Guy J. Brown3313.38