Title
Deep Embeddings For Rare Audio Event Detection With Imbalanced Data
Abstract
In this paper, we present a method to handle data imbalance for classification with neural networks, and apply it to acoustic event detection (AED) problem. The common approach to tackle data imbalance is to use class-weights in the objective function while training. An existing more sophisticated approach is to map the input to clusters in an embedding space, so that learning is locally balanced by incorporating inter-cluster and inter-class margins. On these lines, we propose a method to learn the embedding using a novel objective function, called triple-header cross entropy. Our scheme integrates in a simple way with back-propagation based training, and is computationally more efficient than general hinge-loss based embedding learning schemes. The empirical evaluation results demonstrate the effectiveness of the proposed method for AED with imbalanced training data.
Year
DOI
Venue
2019
10.1109/icassp.2019.8682395
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
Keywords
Field
DocType
Data imbalance, embedding space, acoustic event detection, neural networks, classification
Cross entropy,Training set,Embedding,Pattern recognition,Computer science,Data imbalance,Acoustic event detection,Artificial intelligence,Artificial neural network,Hidden Markov model,Machine learning
Conference
ISSN
Citations 
PageRank 
1520-6149
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Vipul Arora1326.24
Ming Sun29116.25
Chao Wang3895190.04