Diffuser: Multi-View 2D-to-3D Label Diffusion for Semantic Scene Segmentation - Citegraph

Paper Info

Title
Diffuser: Multi-View 2D-to-3D Label Diffusion for Semantic Scene Segmentation

Abstract
Semantic 3D scene understanding is a fundamental problem in computer vision and robotics. Despite recent advances in deep learning, its application to multi-domain 3D semantic segmentation typically suffers from the lack of extensive enough annotated 3D datasets. On the contrary, 2D neural networks benefit from existing large amounts of training data and can be applied to a wider variety of environments, sometimes even without need for retraining. In this paper, we present 'Diffuser', a novel and efficient multi-view fusion framework that leverages 2D semantic segmentation of multiple image views of a scene to produce a consistent and refined 3D segmentation. We formulate the 3D segmentation task as a transductive label diffusion problem on a graph, where multi-view and 3D geometric properties are used to propagate semantic labels from the 2D image space to the 3D map. Experiments conducted on indoor and outdoor challenging datasets demonstrate the versatility of our approach, as well as its effectiveness for both global 3D scene labeling and single RGB-D frame segmentation. Furthermore, we show a significant increase in 3D segmentation accuracy compared to probabilistic fusion methods employed in several state-of-the-art multi-view approaches, with little computational overhead.

Year	DOI	Venue
2021	10.1109/ICRA48506.2021.9561801	2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021)
DocType	Volume	Issue
Conference	2021	1
ISSN	Citations	PageRank
1050-4729	0	0.34
References	Authors
2	3

Authors (3 rows)

Cited by (0 rows)

References (2 rows)

Name	Order	Citations	PageRank
Ruben Mascaro	1	0	1.01
Lucas Teixeira	2	30	6.93
Margarita Chli	3	1283	53.59

1