A Large RGB-D Dataset for Semi-supervised Monocular Depth Estimation

04/23/2019
by   Jaehoon Cho, et al.
0

The recent advance of monocular depth estimation is largely based on deeply nested convolutional networks, combined with supervised training. However, it still remains arduous to collect large-scale ground truth depth (or disparity) maps for supervising the networks. This paper presents a simple yet effective semi-supervised approach for monocular depth estimation. Inspired by the human visual system, we propose a student-teacher strategy in which a shallow student network is trained with the auxiliary information obtained from a deeper and accurate teacher network. Specifically, we first train the stereo teacher network fully utilizing the binocular perception of 3D geometry, and then use depth predictions of the teacher network for supervising the student network for monocular depth inference. This enables us to exploit all available depth data from massive unlabeled stereo pairs that are relatively easier-to-obtain. We further introduce a data ensemble strategy that merges multiple depth predictions of the teacher network to improve the training samples for the student network. Additionally, stereo confidence maps are provided to avoid inaccurate depth estimates being used when supervising the student network. Our new training data, consisting of 1 million outdoor stereo images taken using hand-held stereo cameras, is hosted at the project webpage. Lastly, we demonstrate that the monocular depth estimation network provides feature representations that are suitable for some high-level vision tasks such as semantic segmentation and road detection. Extensive experiments demonstrate the effectiveness and flexibility of the proposed method in various outdoor scenarios.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 9

page 10

page 11

page 12

research
05/18/2022

Learning Monocular Depth Estimation via Selective Distillation of Stereo Knowledge

Monocular depth estimation has been extensively explored based on deep l...
research
09/27/2020

Adaptive confidence thresholding for semi-supervised monocular depth estimation

Self-supervised monocular depth estimation has become an appealing solut...
research
03/27/2023

Pushing the Envelope for Depth-Based Semi-Supervised 3D Hand Pose Estimation with Consistency Training

Despite the significant progress that depth-based 3D hand pose estimatio...
research
04/23/2019

Student Becoming the Master: Knowledge Amalgamation for Joint Scene Parsing, Depth Estimation, and More

In this paper, we investigate a novel deep-model reusing task. Our goal ...
research
05/11/2019

Monocular Depth Estimation with Directional Consistency by Deep Networks

As processing power has become more available, more human-like artificia...
research
07/08/2022

BlindSpotNet: Seeing Where We Cannot See

We introduce 2D blind spot estimation as a critical visual task for road...

Please sign up or login with your details

Forgot password? Click here to reset