Title
Towards Fast And Convenient End-To-End HRTF Personalization
Abstract
Incorporating individualized head-related transfer functions (HRTFs) into a high fidelity sound engine can further improve the perceived quality and realism of binaurally-rendered spatial audio. Traditional methods to measure individual HRTFs tend to be cumbersome, expensive and require physical access to the subject. To address these issues, we develop a convolutional neural network model that, given a single photo of an ear, predicts pinna landmarks that can be used to extract anthropometric features commonly used for HRTF personalization, and match to a database of subjects whose HRTFs and pictures are available. We propose and evaluate a system utilizing this model to generate an individualized HRTF using a minimal set of easily obtainable measurements: single photographs of both ears, as well as head and ear scale for matching interaural time difference (ITD). To extend the reach of our database we employ ideas from Kendall shape theory to match ears non-dimensionally, match all ears to right ears, and make corresponding changes to the database HRIRs. We also apply HAT models to the HRIRs to provide better matching.
Year
DOI
Venue
2022
10.1109/ICASSP43922.2022.9746315
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Keywords
DocType
ISSN
HRTFs,Landmark matching,Kendall Shape,Inter Aural time differences,HAT Model
Conference
1520-6149
ISBN
Citations 
PageRank 
978-1-6654-0541-6
0
0.34
References 
Authors
5
3
Name
Order
Citations
PageRank
Bowen Zhi100.34
Dmitry N. Zotkin217119.06
Ramani Duraiswami31721161.98