Domain Decluttering: Simplifying Images to Mitigate Synthetic-Real Domain Shift and Improve Depth Estimation

02/27/2020
by   Yunhan Zhao, et al.
12

Leveraging synthetically rendered data offers great potential to improve monocular depth estimation, but closing the synthetic-real domain gap is a non-trivial and important task. While much recent work has focused on unsupervised domain adaptation, we consider a more realistic scenario where a large amount of synthetic training data is supplemented by a small set of real images with ground-truth. In this setting we find that existing domain translation approaches are difficult to train and offer little advantage over simple baselines that use a mix of real and synthetic data. A key failure mode is that real-world images contain novel objects and clutter not present in synthetic training. This high-level domain shift isn't handled by existing image translation models. Based on these observations, we develop an attentional module that learns to identify and remove (hard) out-of-domain regions in real images in order to improve depth prediction for a model trained primarily on synthetic data. We carry out extensive experiments to validate our attend-remove-complete approach (ARC) and find that it significantly outperforms state-of-the-art domain adaptation methods for depth prediction. Visualizing the removed regions provides interpretable insights into the synthetic-real domain gap.

READ FULL TEXT

page 1

page 4

page 7

page 8

page 13

page 15

research
05/10/2020

Domain Adaptation for Image Dehazing

Image dehazing using learning-based methods has achieved state-of-the-ar...
research
05/19/2020

Focus on defocus: bridging the synthetic to real domain gap for depth estimation

Data-driven depth estimation methods struggle with the generalization ou...
research
05/04/2022

ShoeRinsics: Shoeprint Prediction for Forensics with Intrinsic Decomposition

Shoe tread impressions are one of the most common types of evidence left...
research
04/02/2021

S2R-DepthNet: Learning a Generalizable Depth-specific Structural Representation

Human can infer the 3D geometry of a scene from a sketch instead of a re...
research
07/16/2019

Learning Depth from Monocular Videos Using Synthetic Data: A Temporally-Consistent Domain Adaptation Approach

Majority of state-of-the-art monocular depth estimation methods are supe...
research
01/19/2017

Synthetic to Real Adaptation with Generative Correlation Alignment Networks

Synthetic images rendered from 3D CAD models are useful for augmenting t...
research
06/03/2020

From Real to Synthetic and Back: Synthesizing Training Data for Multi-Person Scene Understanding

We present a method for synthesizing naturally looking images of multipl...

Please sign up or login with your details

Forgot password? Click here to reset