NeuralDiff: Segmenting 3D objects that move in egocentric videos

by   Vadim Tschernezki, et al.

Given a raw video sequence taken from a freely-moving camera, we study the problem of decomposing the observed 3D scene into a static background and a dynamic foreground containing the objects that move in the video sequence. This task is reminiscent of the classic background subtraction problem, but is significantly harder because all parts of the scene, static and dynamic, generate a large apparent motion due to the camera large viewpoint change. In particular, we consider egocentric videos and further separate the dynamic component into objects and the actor that observes and moves them. We achieve this factorization by reconstructing the video via a triple-stream neural rendering network that explains the different motions based on corresponding inductive biases. We demonstrate that our method can successfully separate the different types of motion, outperforming recent neural rendering baselines at this task, and can accurately segment moving objects. We do so by assessing the method empirically on challenging videos from the EPIC-KITCHENS dataset which we augment with appropriate annotations to create a new benchmark for the task of dynamic object segmentation on unconstrained video sequences, for complex 3D environments.


page 1

page 6

page 7

page 8

page 11

page 12


D^2NeRF: Self-Supervised Decoupling of Dynamic and Static Objects from a Monocular Video

Given a monocular video, segmenting and decoupling dynamic objects while...

EPIC Fields: Marrying 3D Geometry and Video Understanding

Neural rendering is fuelling a unification of learning, 3D geometry and ...

Turning an Urban Scene Video into a Cinemagraph

This paper proposes an algorithm that turns a regular video capturing ur...

HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video

We introduce HOSNeRF, a novel 360 free-viewpoint rendering method that r...

CVABS: Moving Object Segmentation with Common Vector Approach for Videos

Background modelling is a fundamental step for several real-time compute...

Reconstructing Small 3D Objects in front of a Textured Background

We present a technique for a complete 3D reconstruction of small objects...

Towards Unbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation

Video portrait segmentation (VPS), aiming at segmenting prominent foregr...

Please sign up or login with your details

Forgot password? Click here to reset