Title
Watch-n-patch: Unsupervised understanding of actions and relations
Abstract
We focus on modeling human activities comprising multiple actions in a completely unsupervised setting. Our model learns the high-level action co-occurrence and temporal relations between the actions in the activity video. We consider the video as a sequence of short-term action clips, called action-words, and an activity is about a set of action-topics indicating which actions are present in the video. Then we propose a new probabilistic model relating the action-words and the action-topics. It allows us to model long-range action relations that commonly exist in the complex activity, which is challenging to capture in the previous works. We apply our model to unsupervised action segmentation and recognition, and also to a novel application that detects forgotten actions, which we call action patching. For evaluation, we also contribute a new challenging RGB-D activity video dataset recorded by the new Kinect v2, which contains several human daily activities as compositions of multiple actions interacted with different objects. The extensive experiments show the effectiveness of our model.
Year
DOI
Venue
2015
10.1109/CVPR.2015.7299065
IEEE Conference on Computer Vision and Pattern Recognition
Field
DocType
Volume
Computer vision,Pattern recognition,Segmentation,Computer science,Statistical model,Artificial intelligence,RGB color model,Machine learning
Conference
2015
Issue
ISSN
Citations 
1
1063-6919
30
PageRank 
References 
Authors
0.73
33
4
Name
Order
Citations
PageRank
Chenxia Wu11276.86
Jiemi Zhang2964.55
Silvio Savarese33975161.69
Ashutosh Saxena44575227.88