Title
Disentangling Monocular 3d Object Detection
Abstract
In this paper we propose an approach for monocular 3D object detection from a single RGB image, which leverages a novel disentangling transformation for 2D and 3D detection losses and a novel, self-supervised confidence score for 3D bounding boxes. Our proposed loss disentanglement has the twofold advantage of simplifying the training dynamics in the presence of losses with complex interactions of parameters, and sidestepping the issue of balancing independent regression terms. Our solution overcomes these issues by isolating the contribution made by groups of parameters to a given loss, without changing its nature. We further apply loss disentanglement to another novel, signed Intersection-over-Union criterion-driven loss for improving 2D detection results. Besides our methodological innovations, we critically review the AP metric used in KITTI3D, which emerged as the most important dataset for comparing 3D detection results. We identify and resolve a flaw in the 11-point interpolated AP metric, affecting all previously published detection results and particularly biases the results of monocular 3D detection. We provide extensive experimental evaluations and ablation studies and set a new state-of-the-art on the KITTI3D Car class.
Year
DOI
Venue
2019
10.1109/ICCV.2019.00208
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019)
DocType
Volume
Issue
Conference
2019
1
ISSN
Citations 
PageRank 
1550-5499
10
1.32
References 
Authors
2
5
Name
Order
Citations
PageRank
Andrea Simonelli1122.76
Samuel Rota Bulò256433.69
Lorenzo Porzi312011.79
Manuel Lopez-Antequera4256.22
Peter Kontschieder537621.10