Title
Insights into Audio-Based Multimedia Event Classification with Neural Networks
Abstract
Multimedia Event Detection (MED) aims to identify events-also called scenes-in videos, such as a flash mob or a wedding ceremony. Audio content information complements cues such as visual content and text. In this paper, we explore the optimization of neural networks (NNs) for audio-based multimedia event classification, and discuss some insights towards more effectively using this paradigm for MED. We explore different architectures, in terms of number of layers and number of neurons. We also assess the performance impact of pre-training with Restricted Boltzmann Machines (RBMs) in contrast with random initialization, and explore the effect of varying the context window for the input to the NNs. Lastly, we compare the performance of Hidden Markov Models (HMMs) with a discriminative classifier for the event classification. We used the publicly available event-annotated YLI-MED dataset. Our results showed a performance improvement of more than 6% absolute accuracy compared to the latest results reported in the literature. Interestingly, these results were obtained with a single-layer neural network with random initialization, suggesting that standard approaches with deep learning and RBM pre-training are not fully adequate to address the high-level video event-classification task.
Year
DOI
Venue
2015
10.1145/2814815.2814816
MMCommons@ACM Multimedia
Keywords
Field
DocType
Multimedia Event Classification,Multimedia Event Detection,Audio,Video,Web Video,Context Windows,Neural Networks,Hidden Markov Models
Boltzmann machine,Computer science,Artificial intelligence,Deep learning,Artificial neural network,Classifier (linguistics),Discriminative model,Computer vision,Speech recognition,Initialization,Hidden Markov model,Multimedia,Machine learning,Performance improvement
Conference
Citations 
PageRank 
References 
4
0.44
10
Authors
4
Name
Order
Citations
PageRank
Mirco Ravanelli118517.87
Benjamin Elizalde235922.38
Julia Bernd3194.98
Gerald Friedland4112796.23