Title
Mining deep And-Or object structures via cost-sensitive question-answer-based active annotations.
Abstract
This paper presents a cost-sensitive active Question-Answering (QA) framework for learning a nine-layer And-Or graph (AOG) from web images. The AOG explicitly represents object categories, poses/viewpoints, parts, and detailed structures within the parts in a compositional hierarchy. The QA framework is designed to minimize an overall risk, which trades off the loss and query costs. The loss is defined for nodes in all layers of the AOG, including the generative loss (measuring the likelihood of the images) and the discriminative loss (measuring the fitness to human answers). The cost comprises both the human labor of answering questions and the computational cost of model learning. The cost-sensitive QA framework iteratively selects different storylines of questions to update different nodes in the AOG. Experiments showed that our method required much less human supervision (e.g. labeling parts on 3–10 training objects for each category) and achieved better performance than baseline methods.
Year
DOI
Venue
2018
10.1016/j.cviu.2018.09.008
Computer Vision and Image Understanding
Keywords
Field
DocType
41A05,41A10,65D05,65D17
Graph,Viewpoints,Question answer,Artificial intelligence,Generative grammar,Hierarchy,Discriminative model,Machine learning,Mathematics,Model learning
Journal
Volume
Issue
ISSN
176
1
1077-3142
Citations 
PageRank 
References 
0
0.34
43
Authors
4
Name
Order
Citations
PageRank
Quanshi Zhang128826.67
Ying Nian Wu21652267.72
Hao Zhang320364.03
Song-Chun Zhu46580741.75