Abstract | ||
---|---|---|
In this paper we are interested in recognizing human actions from sequences of 3D skeleton data. For this purpose we combine a 3D Convolutional Neural Network with body representations based on Euclidean Distance Matrices (EDMs), which have been recently shown to be very effective to capture the geometric structure of the human pose. One inherent limitation of the EDMs, however, is that they are defined up to a permutation of the skeleton joints, i.e., randomly shuffling the ordering of the joints yields many different representations. In oder to address this issue we introduce a novel architecture that simultaneously, and in an end-to-end manner, learns an optimal transformation of the joints, while optimizing the rest of parameters of the convolutional network. The proposed approach achieves state-of-the-art results on 3 benchmarks, including the recent NTU RGB-D dataset, for which we improve on previous LSTM-based methods by more than 10 percentage points, also surpassing other CNN-based methods while using almost 1000 times fewer parameters.
|
Year | DOI | Venue |
---|---|---|
2017 | 10.1145/3123266.3123299 | MM '17: ACM Multimedia Conference
Mountain View
California
USA
October, 2017 |
Field | DocType | ISBN |
Computer vision,Activity recognition,Distance matrices in phylogeny,Computer science,Convolutional neural network,Matrix (mathematics),Permutation,Euclidean distance,Shuffling,Artificial intelligence,Deep learning | Conference | 978-1-4503-4906-2 |
Citations | PageRank | References |
4 | 0.38 | 22 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Alejandro Hernandez Ruiz | 1 | 4 | 0.38 |
Lorenzo Porzi | 2 | 120 | 11.79 |
Samuel Rota Bulò | 3 | 564 | 33.69 |
Francesc Moreno-Noguer | 4 | 1647 | 93.46 |