Title
Enhance the recognition ability to occlusions and small objects with Robust Faster R-CNN.
Abstract
Recognizing objects with vastly different size scales and objects with occlusions is a fundamental challenge in computer vision. This paper addresses this issue by proposing a novel approach denoted as Robust Faster R-CNN for detecting objects in multi-label images. Robust Faster R-CNN employs a cascaded network structure based on the Faster R-CNN architecture to extract features from objects with different size scales. However, the proposed design provides greater robustness than Faster R-CNN by replacing the RoIPooling operation with RoIAligns to eliminate the harsh quantization conducted by RoIPooling, and we design a multi-scale RoIAligns operation by adding multiple pool sizes for adapting the detection ability of the network to objects with different sizes. Furthermore, we combine an adversarial network with the proposed network to generate training samples with occlusions significantly affecting the classification ability of the model, which improves its robustness to occlusions. Experimental results for the PASCAL VOC 2012 and 2007 datasets demonstrate the superiority of the proposed object detection approach relative to several state-of-the-art approaches.
Year
DOI
Venue
2019
10.1007/s13042-019-01006-4
International Journal of Machine Learning and Cybernetics
Keywords
Field
DocType
Object detection, Robust Faster R-CNN, Multi-cascaded network, Adversarial network, Feature fusion
Computer vision,Object detection,Feature fusion,Computer science,Robustness (computer science),Artificial intelligence,Quantization (signal processing),Network structure
Journal
Volume
Issue
ISSN
10
11
1868-8071
Citations 
PageRank 
References 
2
0.37
0
Authors
3
Name
Order
Citations
PageRank
Tao Zhou131.39
Zhixin Li211124.43
Canlong Zhang362.51