Title
Deep Spatio-Temporal Representation Learning for Multi-Class Imbalanced Data Classification
Abstract
Deep learning, particularly Convolutional Neural Networks (CNNs), has significantly improved visual data processing. In recent years, video classification has attracted significant attention in the multimedia and deep learning community. It is one of the most challenging tasks since both visual and temporal information should be processed effectively. Existing techniques either disregard temporal information between video sequences or generate very complex and computationally expensive models to integrate the spatio-temporal data. In addition, most deep learning techniques do not automatically consider the data imbalance problem. This paper presents an effective deep learning framework for imbalanced video classification by utilizing both spatial and temporal information. This framework includes a spatio-temporal synthetic oversampling to handle data with a skewed distribution, a pre-trained CNN model for spatial sequence feature extraction, followed by a residual bidirectional Long Short Term Memory (LSTM) to capture temporal knowledge in video datasets. Experimental results on two imbalanced video datasets demonstrate the superiority of the proposed framework compared to the state-of-the-art approaches.
Year
DOI
Venue
2018
10.1109/IRI.2018.00064
2018 IEEE International Conference on Information Reuse and Integration (IRI)
Keywords
Field
DocType
Deep learning,spatio-temporal learning,multi-class imbalanced data,video classification,CNN,LSTM
Data modeling,Data processing,Oversampling,Convolutional neural network,Computer science,Feature extraction,Artificial intelligence,Data classification,Deep learning,Machine learning,Feature learning
Conference
ISBN
Citations 
PageRank 
978-1-5386-2660-3
0
0.34
References 
Authors
24
3
Name
Order
Citations
PageRank
Samira Pouyanfar114113.06
Shu-Ching Chen21978182.74
Mei-Ling Shyu31863141.25