Title
Monocular Depth Estimation With Augmented Ordinal Depth Relationships
Abstract
AbstractMost existing algorithms for depth estimation from single monocular images need large quantities of metric ground-truth depths for supervised learning. We show that relative depth can be an informative cue for metric depth estimation and can be easily obtained from vast stereo videos. Acquiring metric depths from stereo videos are sometimes impracticable due to the absence of camera parameters. In this paper, we propose to improve the performance of metric depth estimation with relative depths collected from stereo movie videos using existing stereo matching algorithm. We introduce a new “relative depth in stereo” (RDIS) dataset densely labeled with relative depths. We first pretrain a ResNet model on our RDIS dataset. Then, we finetune the model on RGB-D datasets with metric ground-truth depths. During our finetuning, we formulate depth estimation as a classification task. This re-formulation scheme enables us to obtain the confidence of a depth prediction in the form of probability distribution. With this confidence, we propose an information gain loss to make use of the predictions that are close to ground-truth. We evaluate our approach on both indoor and outdoor benchmark RGB-D datasets and achieve the state-of-the-art performance.
Year
DOI
Venue
2020
10.1109/TCSVT.2019.2929202
Periodicals
Keywords
DocType
Volume
Estimation, Measurement, Videos, Training, Motion pictures, Task analysis, Labeling, Depth estimation, RGB-D dataset, ordinal relationship, deep network
Journal
30
Issue
ISSN
Citations 
8
1051-8215
2
PageRank 
References 
Authors
0.36
20
6
Name
Order
Citations
PageRank
Yuanzhouhan Cao1553.03
Tianqi Zhao272.27
Ke Xian3558.99
Chunhua Shen44817234.19
Zhiguo Cao531444.17
shugong xu61582147.73