Compfeat: Comprehensive Feature Aggregation For Video Instance Segmentation - Citegraph

Paper Info

Title
Compfeat: Comprehensive Feature Aggregation For Video Instance Segmentation

Abstract
Video instance segmentation is a complex task in which we need to detect, segment, and track each object for any given video. Previous approaches only utilize single-frame features for the detection, segmentation, and tracking of objects and they suffer in the video scenario due to several distinct challenges such as motion blur and drastic appearance change. To eliminate ambiguities introduced by only using single-frame features, we propose a novel comprehensive feature aggregation approach (CompFeat) to refine features at both frame-level and object-level with temporal and spatial context information. The aggregation process is carefully designed with a new attention mechanism which significantly increases the discriminative power of the learned features. We further improve the tracking capability of our model through a siamese design by incorporating both feature similarities and spatial similarities. Experiments conducted on the YouTube-VIS dataset validate the effectiveness of proposed CompFeat.

Year	Venue	DocType
2021	THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE	Conference
Volume	ISSN	Citations
35	2159-5399	0
PageRank	References	Authors
0.34	7	5

Authors (5 rows)

Cited by (0 rows)

References (7 rows)

Name	Order	Citations	PageRank
Yang Fu	1	0	0.68
Linjie Yang	2	34	6.31
Ding Liu	3	611	32.97
Thomas S. Huang	4	27815	2618.42
Honghui Shi	5	183	20.24

1