MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos

12/26/2022
by   Fengrui Tian, et al.
0

In this paper, we target at the problem of learning a generalizable dynamic radiance field from monocular videos. Different from most existing NeRF methods that are based on multiple views, monocular videos only contain one view at each timestamp, thereby suffering from ambiguity along the view direction in estimating point features and scene flows. Previous studies such as DynNeRF disambiguate point features by positional encoding, which is not transferable and severely limits the generalization ability. As a result, these methods have to train one independent model for each scene and suffer from heavy computational costs when applying to increasing monocular videos in real-world applications. To address this, We propose MonoNeRF to simultaneously learn point features and scene flows with point trajectory and feature correspondence constraints across frames. More specifically, we learn an implicit velocity field to estimate point trajectory from temporal features with Neural ODE, which is followed by a flow-based feature aggregation module to obtain spatial features along the point trajectory. We jointly optimize temporal and spatial features by training the network in an end-to-end manner. Experiments show that our MonoNeRF is able to learn from multiple scenes and support new applications such as scene editing, unseen frame synthesis, and fast novel scene adaptation.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

research
04/04/2023

Decoupling Dynamic Monocular Videos for Dynamic View Synthesis

The challenge of dynamic view synthesis from dynamic monocular videos, i...
research
06/03/2023

Context-TAP: Tracking Any Point Demands Spatial Context Features

We tackle the problem of Tracking Any Point (TAP) in videos, which speci...
research
05/13/2021

Dynamic View Synthesis from Dynamic Monocular Video

We present an algorithm for generating novel views at arbitrary viewpoin...
research
11/20/2022

DynIBaR: Neural Dynamic Image-Based Rendering

We address the problem of synthesizing novel views from a monocular vide...
research
03/28/2019

Multifaceted 4D Feature Segmentation and Extraction in Point and Field-based Datasets

The use of large-scale multifaceted data is common in a wide variety of ...
research
04/06/2023

Visualizing Skiers' Trajectories in Monocular Videos

Trajectories are fundamental to winning in alpine skiing. Tools enabling...
research
04/21/2023

Factored Neural Representation for Scene Understanding

A long-standing goal in scene understanding is to obtain interpretable a...

Please sign up or login with your details

Forgot password? Click here to reset