Abstract | ||
---|---|---|
We present an approach which incorporates spatiotemporal features as well as the relationships between them, into a sparse dictionary learning framework for activity recognition. We propose that the dictionary learning framework can be adapted to learning complex relationships between features in an unsupervised manner. From a set of training videos, a dictionary is learned for individual features, as well as the relationships between them using a stacked predictive sparse decomposition framework. This combined dictionary provides a representation of the structure of the video and is spatio-temporally pooled in a local manner to obtain descriptors. The descriptors are then combined using a multiple kernel learning framework to design classifiers. Experiments have been conducted on two popular activity recognition datasets to demonstrate the superior performance of our approach on single person as well as multi-person activities. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1109/ICIP.2014.7025991 | Image Processing |
Keywords | Field | DocType |
feature extraction,image classification,signal processing,spatiotemporal phenomena,unsupervised learning,video signal processing,activity modeling,activity recognition datasets,multiple kernel learning framework,sparse dictionary learning framework,spatiotemporal features,stacked predictive sparse decomposition framework,video structure,video training,Sparse coding,activity recognition,multiple kernel learning | Computer vision,Dictionary learning,Activity recognition,K-SVD,Pattern recognition,Neural coding,Computer science,Sparse approximation,Multiple kernel learning,Artificial intelligence,Machine learning | Conference |
ISSN | Citations | PageRank |
1522-4880 | 0 | 0.34 |
References | Authors | |
15 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Nandita M. Nayak | 1 | 78 | 4.68 |
Amit K. Roy Chowdhury | 2 | 1153 | 73.96 |