A Learned Stereo Depth System for Robotic Manipulation in Homes - Citegraph

Paper Info

Title
A Learned Stereo Depth System for Robotic Manipulation in Homes

Abstract
We present a passive stereo depth system that produces dense and accurate point clouds optimized for human environments, including dark, textureless, thin, reflective and specular surfaces and objects, at 2560 x 2048 resolution, with 384 disparities, in 30 ms. The system consists of an algorithm combining learned stereo matching with engineered filtering, a training and data-mixing methodology, and a sensor hardware design. Our architecture is 15x faster than approaches that perform similarly on the Middlebury and Flying Things Stereo Benchmarks. To effectively supervise the training of this model, we combine real data labelled using off-the-shelf depth sensors, as well as a number of different rendered, simulated labeled datasets. We demonstrate the efficacy of our system by presenting a large number of qualitative results in the form of depth maps and point-clouds, experiments validating the metric accuracy of our system and comparisons to other sensors on challenging objects and scenes. We also show the competitiveness of our algorithm compared to state-of-the-art learned models using the Middlebury and FlyingThings datasets.

Year	DOI	Venue
2022	10.1109/LRA.2022.3143895	IEEE ROBOTICS AND AUTOMATION LETTERS
Keywords	DocType	Volume
RGB-D perception, deep learning for visual perception, perception for grasping and manipulation	Journal	7
Issue	ISSN	Citations
2	2377-3766	0
PageRank	References	Authors
0.34	0	5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Krishna Shankar	1	26	3.56
Mark Tjersland	2	0	0.34
Jeremy Ma	3	181	9.93
Stone Kevin	4	0	1.69
Max Bajracharya	5	224	18.15

1