Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve

07/26/2020
by   Weicheng Kuo, et al.
0

Object recognition has seen significant progress in the image domain, with focus primarily on 2D perception. We propose to leverage existing large-scale datasets of 3D models to understand the underlying 3D structure of objects seen in an image by constructing a CAD-based representation of the objects and their poses. We present Mask2CAD, which jointly detects objects in real-world images and for each detected object, optimizes for the most similar CAD model and its pose. We construct a joint embedding space between the detected regions of an image corresponding to an object and 3D CAD models, enabling retrieval of CAD models for an input RGB image. This produces a clean, lightweight representation of the objects in an image; this CAD-based representation ensures a valid, efficient shape representation for applications such as content creation or interactive scenarios, and makes a step towards understanding the transformation of real-world imagery to a synthetic domain. Experiments on real-world images from Pix3D demonstrate the advantage of our approach in comparison to state of the art. To facilitate future research, we additionally propose a new image-to-3D baseline on ScanNet which features larger shape diversity, real-world occlusions, and challenging image views.

READ FULL TEXT

page 2

page 9

page 11

page 12

page 19

page 20

page 21

page 22

research
08/20/2021

Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image

3D perception of object shapes from RGB image input is fundamental towar...
research
10/19/2020

Learning to Reconstruct and Segment 3D Objects

To endow machines with the ability to perceive the real-world in a three...
research
12/03/2021

ROCA: Robust CAD Model Retrieval and Alignment from a Single Image

We present ROCA, a novel end-to-end approach that retrieves and aligns 3...
research
11/10/2021

Leveraging Geometry for Shape Estimation from a Single RGB Image

Predicting 3D shapes and poses of static objects from a single RGB image...
research
06/22/2014

3D ShapeNets: A Deep Representation for Volumetric Shapes

3D shape is a crucial but heavily underutilized cue in today's computer ...
research
08/18/2016

IM2CAD

Given a single photo of a room and a large database of furniture CAD mod...
research
08/12/2017

Calipso: Physics-based Image and Video Editing through CAD Model Proxies

We present Calipso, an interactive method for editing images and videos ...

Please sign up or login with your details

Forgot password? Click here to reset