Title | ||
---|---|---|
Immersive Audio Coding For Virtual Reality Using A Metadata-Assisted Extension Of The 3gpp Evs Codec |
Abstract | ||
---|---|---|
Virtual Reality (VR) audio scenes may be composed of a very large number of audio elements, including dynamic audio objects, fixed audio channels and scene-based audio elements such as Higher Order Ambisonics (HOA). Potentially, the subjective listening experience may be replicated using a compact spatial format with a set number of dynamic objects and scene-based elements, retaining only the perceptual essence of the audio scene. The compact format would further enable a reduction in the complexity of subsequent compression and rendering. This paper investigates these hypotheses by exploring the use of a compact format that consists of up to four dynamic objects and nine HOA channels, with the Enhanced Voice Services (EVS) codec being applied to a 4-channel down-mix of the compact format. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/icassp.2019.8683712 | 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
Keywords | Field | DocType |
Audio Coding, Virtual Reality, Spatial Audio, Immersive Audio, Ambisonics | Metadata,Virtual reality,Pattern recognition,Computer science,Ambisonics,Communication channel,Coding (social sciences),Immersion (virtual reality),Artificial intelligence,Rendering (computer graphics),Multimedia,Codec | Conference |
ISSN | Citations | PageRank |
1520-6149 | 0 | 0.34 |
References | Authors | |
0 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
D. McGrath | 1 | 0 | 0.34 |
S. Bruhn | 2 | 18 | 2.02 |
H. Purnhagen | 3 | 0 | 0.34 |
M. Eckert | 4 | 0 | 0.34 |
J. Torres | 5 | 0 | 0.34 |
Robert Brown | 6 | 27 | 1.07 |
D. Darcy | 7 | 0 | 0.34 |