Title
Improving Multiscale Object Detection With Off-Centered Semantics Refinement
Abstract
Feature Pyramid (FP) is typically a fundamental component for detecting multi-scale objects. However, as the network deepens, FP faces two problems: (1) Information loss caused by channel reduction. (2) The insufficient effective receptive field due to convolution with the sliding window mode. We found that the above problems can be alleviated by increasing the semantics extraction weights of the off-centered feature map. In this paper, a new feature pyramid architecture named Off-Centered Semantics Refinement Feature Pyramid Network (OSR-FPN) is proposed. Specifically, OSR-FPN contains two components exploiting the Off-Centered Semantics Refinement (OSR) mechanism: Features Supplement Module (FSM) and Receptive Field Enlargement Module (RFEM). FSM and RFEM are respectively designed to complement the lost context at the highest pyramid level and enrich the semantics by expanding the receptive field. In addition, we propose the Sigmoid-interpolation Padding method to enhance our OSR. Experiments on MS COCO dataset and UAVDT object detection benchmarks demonstrate the effectiveness of our method. As a result, OSR-FPN achieves a better accuracy of complex object detection.
Year
DOI
Venue
2022
10.1109/TCSVT.2022.3173960
IEEE Transactions on Circuits and Systems for Video Technology
Keywords
DocType
Volume
Multi-scale object detection,feature pyramid,off-centered semantics,features supplement,receptive field
Journal
32
Issue
ISSN
Citations 
10
1051-8215
0
PageRank 
References 
Authors
0.34
11
6
Name
Order
Citations
PageRank
Xianlun Tang174.86
Qiao Yang200.34
Deyi Xiong300.34
Ying Xie400.34
Huiming Wang51665103.97
Rui Li600.34