Title
Can categories and attributes be learned in a multi-task way?
Abstract
Intuitively, we can think of object recognition and attribute prediction as correlated tasks. However, they appeared to conflict in a simple two-branch multi-task framework (a category branch and an attribute branch) with a shared backbone part (convolutional layers and pooling layers). The performance dropped along with the iterative training steps. This result might have been caused by the noncoherent feature distribution between the object recognition features and the attribute prediction features. Recognition features are discriminative for different categories and are not sensitive to intracategory variations, while attribute prediction features are discriminative for different attributes, although these attributes can exist in objects from the same category. Thus, a conflict occurs when we force the network to learn the two kinds of distinct features simultaneously. To address this problem, we propose the category and attribute prediction network (CAP-net), in which a category-constrained attribute prediction structure is introduced to transfer the object recognition knowledge and avoid the conflict between two features. The CAP-net parameters can be learned easily with a regularization method. Extensive experimental results show that the CAP-net outperforms the state-of-the-art methods on object recognition and attribute prediction tasks.
Year
DOI
Venue
2019
10.1109/TMM.2019.2919469
IEEE Transactions on Multimedia
Keywords
Field
DocType
Task analysis,Object recognition,Birds,Training,Dogs,Cats,Predictive models
Pattern recognition,Computer science,Pooling,Regularization (mathematics),Artificial intelligence,Discriminative model,Cognitive neuroscience of visual object recognition
Journal
Volume
Issue
ISSN
21
12
1520-9210
Citations 
PageRank 
References 
0
0.34
0
Authors
4
Name
Order
Citations
PageRank
Shu Yang100.68
Yaowei Wang213429.62
Yemin Shi3379.48
Zesong Fei469986.33