Title
Dronecaps: Recognition Of Human Actions In Drone Videos Using Capsule Networks With Binary Volume Comparisons
Abstract
Understanding human actions from videos captured by drones is a challenging task in computer vision due to the unfamiliar viewpoints of individuals and changes in their size due to the camera's location and motion. This work proposes DroneCaps, a capsule network architecture for multi-label human action recognition (HAR) in videos captured by drones. DroneCaps uses features computed by 3D convolution neural networks plus a new set of features computed by a novel Binary Volume Comparison layer. All these features, in conjunction with the learning power of CapsNets, allow understanding and abstracting the different viewpoints and poses of the depicted individuals very efficiently, thus improving multi-label HAR. The evaluation of the DroneCaps architecture's performance for multi-label classification shows that it outperforms state-of-the-art methods on the Okutama-Action dataset.
Year
DOI
Venue
2020
10.1109/ICIP40778.2020.9190864
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP)
Keywords
DocType
ISSN
Capsule networks, EM Routing, Dynamic Routing, drone videos, Human Action Recognition
Conference
1522-4880
Citations 
PageRank 
References 
1
0.34
0
Authors
3
Name
Order
Citations
PageRank
Abdullah M. Algamdi110.68
Victor Sanchez214431.22
Chang-Tsun Li3245.11