Enhance the recognition ability to occlusions and small objects with Robust Faster R-CNN. - Citegraph

Paper Info

Title
Enhance the recognition ability to occlusions and small objects with Robust Faster R-CNN.

Abstract
Recognizing objects with vastly different size scales and objects with occlusions is a fundamental challenge in computer vision. This paper addresses this issue by proposing a novel approach denoted as Robust Faster R-CNN for detecting objects in multi-label images. Robust Faster R-CNN employs a cascaded network structure based on the Faster R-CNN architecture to extract features from objects with different size scales. However, the proposed design provides greater robustness than Faster R-CNN by replacing the RoIPooling operation with RoIAligns to eliminate the harsh quantization conducted by RoIPooling, and we design a multi-scale RoIAligns operation by adding multiple pool sizes for adapting the detection ability of the network to objects with different sizes. Furthermore, we combine an adversarial network with the proposed network to generate training samples with occlusions significantly affecting the classification ability of the model, which improves its robustness to occlusions. Experimental results for the PASCAL VOC 2012 and 2007 datasets demonstrate the superiority of the proposed object detection approach relative to several state-of-the-art approaches.

Year	DOI	Venue
2019	10.1007/s13042-019-01006-4	International Journal of Machine Learning and Cybernetics
Keywords	Field	DocType
Object detection, Robust Faster R-CNN, Multi-cascaded network, Adversarial network, Feature fusion	Computer vision,Object detection,Feature fusion,Computer science,Robustness (computer science),Artificial intelligence,Quantization (signal processing),Network structure	Journal
Volume	Issue	ISSN
10	11	1868-8071
Citations	PageRank	References
2	0.37	0
Authors
3

Authors (3 rows)

Cited by (2 rows)

References (0 rows)

Name	Order	Citations	PageRank
Tao Zhou	1	3	1.39
Zhixin Li	2	111	24.43
Canlong Zhang	3	6	2.51

1