DynamicStereo: Consistent Dynamic Depth from Stereo Videos

05/03/2023
by   Nikita Karaev, et al.
1

We consider the problem of reconstructing a dynamic scene observed from a stereo camera. Most existing methods for depth from stereo treat different stereo frames independently, leading to temporally inconsistent depth predictions. Temporal consistency is especially important for immersive AR or VR scenarios, where flickering greatly diminishes the user experience. We propose DynamicStereo, a novel transformer-based architecture to estimate disparity for stereo videos. The network learns to pool information from neighboring frames to improve the temporal consistency of its predictions. Our architecture is designed to process stereo videos efficiently through divided attention layers. We also introduce Dynamic Replica, a new benchmark dataset containing synthetic videos of people and animals in scanned environments, which provides complementary training and evaluation data for dynamic stereo closer to real applications than existing datasets. Training with this dataset further improves the quality of predictions of our proposed DynamicStereo as well as prior methods. Finally, it acts as a benchmark for consistent stereo methods.

READ FULL TEXT

page 1

page 2

page 4

page 6

page 7

research
11/17/2021

Temporally Consistent Online Depth Estimation in Dynamic Scenes

Temporally consistent depth estimation is crucial for real-time applicat...
research
04/21/2022

A New Dataset and Transformer for Stereoscopic Video Super-Resolution

Stereo video super-resolution (SVSR) aims to enhance the spatial resolut...
research
06/19/2020

Consistency Guided Scene Flow Estimation

We present Consistency Guided Scene Flow Estimation (CGSF), a framework ...
research
09/03/2016

Towards Segmenting Consumer Stereo Videos: Benchmark, Baselines and Ensembles

Are we ready to segment consumer stereo videos? The amount of this data ...
research
09/30/2019

Depth Estimation in Nighttime using Stereo-Consistent Cyclic Translations

Most existing methods of depth from stereo are designed for daytime scen...
research
04/18/2023

Saliency-aware Stereoscopic Video Retargeting

Stereo video retargeting aims to resize an image to a desired aspect rat...
research
06/02/2023

Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis

This paper presents a novel approach, TeFS (Temporal-controlled Frame Sw...

Please sign up or login with your details

Forgot password? Click here to reset