Abstract | ||
---|---|---|
Feature Pyramid (FP) is typically a fundamental component for detecting multi-scale objects. However, as the network deepens, FP faces two problems: (1) Information loss caused by channel reduction. (2) The insufficient effective receptive field due to convolution with the sliding window mode. We found that the above problems can be alleviated by increasing the semantics extraction weights of the off-centered feature map. In this paper, a new feature pyramid architecture named Off-Centered Semantics Refinement Feature Pyramid Network (OSR-FPN) is proposed. Specifically, OSR-FPN contains two components exploiting the Off-Centered Semantics Refinement (OSR) mechanism: Features Supplement Module (FSM) and Receptive Field Enlargement Module (RFEM). FSM and RFEM are respectively designed to complement the lost context at the highest pyramid level and enrich the semantics by expanding the receptive field. In addition, we propose the Sigmoid-interpolation Padding method to enhance our OSR. Experiments on MS COCO dataset and UAVDT object detection benchmarks demonstrate the effectiveness of our method. As a result, OSR-FPN achieves a better accuracy of complex object detection. |
Year | DOI | Venue |
---|---|---|
2022 | 10.1109/TCSVT.2022.3173960 | IEEE Transactions on Circuits and Systems for Video Technology |
Keywords | DocType | Volume |
Multi-scale object detection,feature pyramid,off-centered semantics,features supplement,receptive field | Journal | 32 |
Issue | ISSN | Citations |
10 | 1051-8215 | 0 |
PageRank | References | Authors |
0.34 | 11 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xianlun Tang | 1 | 7 | 4.86 |
Qiao Yang | 2 | 0 | 0.34 |
Deyi Xiong | 3 | 0 | 0.34 |
Ying Xie | 4 | 0 | 0.34 |
Huiming Wang | 5 | 1665 | 103.97 |
Rui Li | 6 | 0 | 0.34 |