IMENet: Joint 3D Semantic Scene Completion and 2D Semantic Segmentation through Iterative Mutual Enhancement

06/29/2021
by   Jie Li, et al.
0

3D semantic scene completion and 2D semantic segmentation are two tightly correlated tasks that are both essential for indoor scene understanding, because they predict the same semantic classes, using positively correlated high-level features. Current methods use 2D features extracted from early-fused RGB-D images for 2D segmentation to improve 3D scene completion. We argue that this sequential scheme does not ensure these two tasks fully benefit each other, and present an Iterative Mutual Enhancement Network (IMENet) to solve them jointly, which interactively refines the two tasks at the late prediction stage. Specifically, two refinement modules are developed under a unified framework for the two tasks. The first is a 2D Deformable Context Pyramid (DCP) module, which receives the projection from the current 3D predictions to refine the 2D predictions. In turn, a 3D Deformable Depth Attention (DDA) module is proposed to leverage the reprojected results from 2D predictions to update the coarse 3D predictions. This iterative fusion happens to the stable high-level features of both tasks at a late stage. Extensive experiments on NYU and NYUCAD datasets verify the effectiveness of the proposed iterative late fusion scheme, and our approach outperforms the state of the art on both 3D semantic scene completion and 2D semantic segmentation.

READ FULL TEXT

page 1

page 5

page 6

research
11/24/2022

CasFusionNet: A Cascaded Network for Point Cloud Semantic Scene Completion by Dense Feature Fusion

Semantic scene completion (SSC) aims to complete a partial 3D scene and ...
research
07/04/2022

OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification

Scene segmentation and classification (SSC) serve as a critical step tow...
research
03/31/2020

Attention-based Multi-modal Fusion Network for Semantic Scene Completion

This paper presents an end-to-end 3D convolutional network named attenti...
research
06/27/2023

SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion

Semantic scene completion (SSC) jointly predicts the semantics and geome...
research
11/19/2019

Differentiating Features for Scene Segmentation Based on Dedicated Attention Mechanisms

Semantic segmentation is a challenge in scene parsing. It requires both ...
research
08/01/2019

Cascaded Context Pyramid for Full-Resolution 3D Semantic Scene Completion

Semantic Scene Completion (SSC) aims to simultaneously predict the volum...
research
10/16/2020

In Depth Bayesian Semantic Scene Completion

This work studies Semantic Scene Completion which aims to predict a 3D s...

Please sign up or login with your details

Forgot password? Click here to reset