Towards Fast And Convenient End-To-End HRTF Personalization - Citegraph

Paper Info

Title
Towards Fast And Convenient End-To-End HRTF Personalization

Abstract
Incorporating individualized head-related transfer functions (HRTFs) into a high fidelity sound engine can further improve the perceived quality and realism of binaurally-rendered spatial audio. Traditional methods to measure individual HRTFs tend to be cumbersome, expensive and require physical access to the subject. To address these issues, we develop a convolutional neural network model that, given a single photo of an ear, predicts pinna landmarks that can be used to extract anthropometric features commonly used for HRTF personalization, and match to a database of subjects whose HRTFs and pictures are available. We propose and evaluate a system utilizing this model to generate an individualized HRTF using a minimal set of easily obtainable measurements: single photographs of both ears, as well as head and ear scale for matching interaural time difference (ITD). To extend the reach of our database we employ ideas from Kendall shape theory to match ears non-dimensionally, match all ears to right ears, and make corresponding changes to the database HRIRs. We also apply HAT models to the HRIRs to provide better matching.

Year	DOI	Venue
2022	10.1109/ICASSP43922.2022.9746315	ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Keywords	DocType	ISSN
HRTFs,Landmark matching,Kendall Shape,Inter Aural time differences,HAT Model	Conference	1520-6149
ISBN	Citations	PageRank
978-1-6654-0541-6	0	0.34
References	Authors
5	3

Authors (3 rows)

Cited by (0 rows)

References (5 rows)

Name	Order	Citations	PageRank
Bowen Zhi	1	0	0.34
Dmitry N. Zotkin	2	171	19.06
Ramani Duraiswami	3	1721	161.98

1