Title
Improved Representation Learning For Acoustic Event Classification Using Tree-Structured Ontology
Abstract
Acoustic events have a hierarchical structure analogous to a tree (or a directed acyclic graph). In this work, we propose a structure-aware semi-supervised learning framework for acoustic event classification (AEC). Our hypothesis is that the audio label structure contains useful information that is not available in audios and plain tags. We show that by organizing audio representations with a human-curated tree ontology, we can improve the quality of the learned audio representations for downstream AEC tasks. We use consistency training to use large amounts of unlabeled data for structured representation manifold learning. Experimental results indicate that our framework learns high quality representations which enable us to achieve comparable performance in discriminative tasks as fully supervised baselines. Moreover, our framework can better handle audios with unseen tags by confidently assigning a super-category (internal node like "animal" in Fig. 1) tag to the audio.
Year
DOI
Venue
2022
10.1109/ICASSP43922.2022.9746266
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Keywords
DocType
ISSN
Acoustic event classification,Representation learning,Audio ontology,Decision tree
Conference
1520-6149
ISBN
Citations 
PageRank 
978-1-6654-0541-6
0
0.34
References 
Authors
4
8
Name
Order
Citations
PageRank
Arman Zharmagambetov100.34
Qingming Tang200.34
Chieh-Chi Kao300.34
Qin Zhang400.34
Ming Sun59116.25
Viktor Rozgic6829.22
Jasha Droppo786168.35
Chao Wang8895190.04