Title | ||
---|---|---|
Enhance the recognition ability to occlusions and small objects with Robust Faster R-CNN. |
Abstract | ||
---|---|---|
Recognizing objects with vastly different size scales and objects with occlusions is a fundamental challenge in computer vision. This paper addresses this issue by proposing a novel approach denoted as Robust Faster R-CNN for detecting objects in multi-label images. Robust Faster R-CNN employs a cascaded network structure based on the Faster R-CNN architecture to extract features from objects with different size scales. However, the proposed design provides greater robustness than Faster R-CNN by replacing the RoIPooling operation with RoIAligns to eliminate the harsh quantization conducted by RoIPooling, and we design a multi-scale RoIAligns operation by adding multiple pool sizes for adapting the detection ability of the network to objects with different sizes. Furthermore, we combine an adversarial network with the proposed network to generate training samples with occlusions significantly affecting the classification ability of the model, which improves its robustness to occlusions. Experimental results for the PASCAL VOC 2012 and 2007 datasets demonstrate the superiority of the proposed object detection approach relative to several state-of-the-art approaches. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1007/s13042-019-01006-4 | International Journal of Machine Learning and Cybernetics |
Keywords | Field | DocType |
Object detection, Robust Faster R-CNN, Multi-cascaded network, Adversarial network, Feature fusion | Computer vision,Object detection,Feature fusion,Computer science,Robustness (computer science),Artificial intelligence,Quantization (signal processing),Network structure | Journal |
Volume | Issue | ISSN |
10 | 11 | 1868-8071 |
Citations | PageRank | References |
2 | 0.37 | 0 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Tao Zhou | 1 | 3 | 1.39 |
Zhixin Li | 2 | 111 | 24.43 |
Canlong Zhang | 3 | 6 | 2.51 |