Title
First-Person Action Decomposition and Zero-Shot Learning
Abstract
In this work, we decompose a first-person action into verb and noun. We then study how the coupling of an action's constituent verb and noun affects the learners' ability to learn them separately and to combine them to perform recognition. We compare different information fusion methods on conventional action recognition and zero-shot learning, of which the latter is a strong indication of the feature's ability to capture one concept (verb/noun) and not be confounded by the other. To achieve the decoupling of verb/noun concepts, we extract features that are specialized for each of them. Specifically, we use improved dense trajectories and convolutional neural network activations. We show that by constructing specialized features for the decomposed concepts, our method succeeds in zero-shot learning. More surprisingly, it also outperforms previous results in conventional action recognition when the performance gaps of different features on verb/noun concepts are significant.
Year
DOI
Venue
2017
10.1109/WACV.2017.21
2017 IEEE Winter Conference on Applications of Computer Vision (WACV)
Keywords
Field
DocType
first-person action decomposition,zero-shot learning,dense trajectories,convolutional neural network,action recognition
Computer vision,Verb,Computer science,Visualization,Convolutional neural network,Zero shot learning,Action recognition,Noun,Feature extraction,Natural language processing,Artificial intelligence,Information fusion
Conference
ISSN
ISBN
Citations 
2472-6737
978-1-5090-4823-6
0
PageRank 
References 
Authors
0.34
38
3
Name
Order
Citations
PageRank
Yun C. Zhang100.34
Yin Li279735.85
James M. Rehg35259474.66