Title
Learning a sparse dictionary of video structure for activity modeling
Abstract
We present an approach which incorporates spatiotemporal features as well as the relationships between them, into a sparse dictionary learning framework for activity recognition. We propose that the dictionary learning framework can be adapted to learning complex relationships between features in an unsupervised manner. From a set of training videos, a dictionary is learned for individual features, as well as the relationships between them using a stacked predictive sparse decomposition framework. This combined dictionary provides a representation of the structure of the video and is spatio-temporally pooled in a local manner to obtain descriptors. The descriptors are then combined using a multiple kernel learning framework to design classifiers. Experiments have been conducted on two popular activity recognition datasets to demonstrate the superior performance of our approach on single person as well as multi-person activities.
Year
DOI
Venue
2014
10.1109/ICIP.2014.7025991
Image Processing
Keywords
Field
DocType
feature extraction,image classification,signal processing,spatiotemporal phenomena,unsupervised learning,video signal processing,activity modeling,activity recognition datasets,multiple kernel learning framework,sparse dictionary learning framework,spatiotemporal features,stacked predictive sparse decomposition framework,video structure,video training,Sparse coding,activity recognition,multiple kernel learning
Computer vision,Dictionary learning,Activity recognition,K-SVD,Pattern recognition,Neural coding,Computer science,Sparse approximation,Multiple kernel learning,Artificial intelligence,Machine learning
Conference
ISSN
Citations 
PageRank 
1522-4880
0
0.34
References 
Authors
15
2
Name
Order
Citations
PageRank
Nandita M. Nayak1784.68
Amit K. Roy Chowdhury2115373.96