O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model

08/18/2023
by   Yubin Hu, et al.
0

Occlusion is a common issue in 3D reconstruction from RGB-D videos, often blocking the complete reconstruction of objects and presenting an ongoing problem. In this paper, we propose a novel framework, empowered by a 2D diffusion-based in-painting model, to reconstruct complete surfaces for the hidden parts of objects. Specifically, we utilize a pre-trained diffusion model to fill in the hidden areas of 2D images. Then we use these in-painted images to optimize a neural implicit surface representation for each instance for 3D reconstruction. Since creating the in-painting masks needed for this process is tricky, we adopt a human-in-the-loop strategy that involves very little human engagement to generate high-quality masks. Moreover, some parts of objects can be totally hidden because the videos are usually shot from limited perspectives. To ensure recovering these invisible areas, we develop a cascaded network architecture for predicting signed distance field, making use of different frequency bands of positional encoding and maintaining overall smoothness. Besides the commonly used rendering loss, Eikonal loss, and silhouette loss, we adopt a CLIP-based semantic consistency loss to guide the surface from unseen camera angles. Experiments on ScanNet scenes show that our proposed framework achieves state-of-the-art accuracy and completeness in object-level reconstruction from scene-level RGB-D videos.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

research
08/15/2023

ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces

In recent years, neural implicit surface reconstruction has emerged as a...
research
03/15/2022

Animatable Neural Implicit Surfaces for Creating Avatars from Videos

This paper aims to reconstruct an animatable human model from a video of...
research
06/09/2023

RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models

The emergence of Neural Radiance Fields (NeRF) has promoted the developm...
research
03/16/2023

Learning a Room with the Occ-SDF Hybrid: Signed Distance Function Mingled with Occupancy Aids Scene Representation

Implicit neural rendering, which uses signed distance function (SDF) rep...
research
03/16/2023

DINAR: Diffusion Inpainting of Neural Textures for One-Shot Human Avatars

We present DINAR, an approach for creating realistic rigged fullbody ava...
research
05/25/2023

Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos

The analysis and use of egocentric videos for robotic tasks is made chal...
research
07/06/2023

IPO-LDM: Depth-aided 360-degree Indoor RGB Panorama Outpainting via Latent Diffusion Model

Generating complete 360-degree panoramas from narrow field of view image...

Please sign up or login with your details

Forgot password? Click here to reset