Title
End-to-End Human Object Interaction Detection with HOI Transformer
Abstract
We propose HOI Transformer to tackle human object interaction (HOI) detection in an end-to-end manner. Current approaches either decouple HOI task into separated stages of object detection and interaction classification or introduce surrogate interaction problem. In contrast, our method, named HOI Transformer, streamlines the HOI pipeline by eliminating the need for many hand-designed components. HOI Transformer reasons about the relations of objects and humans from global image context and directly predicts HOI instances in parallel. A quintuple matching loss is introduced to force HOI predictions in a unified way. Our method is conceptually much simpler and demonstrates improved accuracy. Without bells and whistles, HOI Transformer achieves 26:61% AP on HICO-DET and 52:9% AProle on V-COCO, surpassing previous methods with the advantage of being much simpler. We hope our approach will serve as a simple and effective alternative for HOI tasks.
Year
DOI
Venue
2021
10.1109/CVPR46437.2021.01165
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021
DocType
ISSN
Citations 
Conference
1063-6919
0
PageRank 
References 
Authors
0.34
11
11
Name
Order
Citations
PageRank
Cheng Zou100.68
Bohan Wang285.85
Yue Hu300.34
Junqi Liu400.34
Qian Wu500.34
Yu Zhao600.68
Boxun Li757131.13
Zhang Chenguang832.78
Chi Zhang9202.47
Yichen Wei10207467.87
Jian Sun1125842956.90