Title
(Deep) Learning from Frames
Abstract
Learning content from videos is not an easy task and traditional machine learning approaches for computer vision have difficulties in doing it satisfactorily. However, in the past couple of years the machine learning community has seen the rise of deep learning methods that significantly improve the accuracy of several computer vision applications, e.g., Convolutional Neural Networks (ConvNets). In this paper, we explore the suitability of ConvNets for the movie trailers genre classification problem. Assigning genres to movies is particularly challenging because genre is an immaterial feature that is not physically present in a movie frame, so off-the-shelf image detection models cannot be directly applied to this context. Hence, we propose a novel classification method that encapsulates multiple distinct ConvNets to perform genre classification, namely CoNNECT, where each ConvNet learns features that capture distinct aspects from the movie frames. We compare our novel approach with the current state-of-the-art techniques for movie classification, which make use of well-known image descriptors and low-level handcrafted features. Results show that CoNNECT significantly outperforms the state-of-the-art approaches in this task, moving towards effectively solving the genre classification problem.
Year
DOI
Venue
2016
10.1109/BRACIS.2016.012
2016 5th Brazilian Conference on Intelligent Systems (BRACIS)
Keywords
Field
DocType
Movie Genre Classification,Convolutional Neural Networks,Deep Learning,Computer Vision,Video Classification
Computer science,Convolutional neural network,Image detection,Visual descriptors,Artificial intelligence,Deep learning,Machine learning
Conference
ISBN
Citations 
PageRank 
978-1-5090-3567-0
0
0.34
References 
Authors
15
5
Name
Order
Citations
PageRank
Jonatas Wehrmann1305.42
Rodrigo C. Barros244832.54
Gabriel S. Simoes350.77
Thomas S. Paula400.34
Duncan Dubugras Alcoba Ruiz514615.54