ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval

08/17/2023
by   Song Zhang, et al.
0

Multi-View Stereo (MVS) is a fundamental problem in geometric computer vision which aims to reconstruct a scene using multi-view images with known camera parameters. However, the mainstream approaches represent the scene with a fixed all-pixel depth range and equal depth interval partition, which will result in inadequate utilization of depth planes and imprecise depth estimation. In this paper, we present a novel multi-stage coarse-to-fine framework to achieve adaptive all-pixel depth range and depth interval. We predict a coarse depth map in the first stage, then an Adaptive Depth Range Prediction module is proposed in the second stage to zoom in the scene by leveraging the reference image and the obtained depth map in the first stage and predict a more accurate all-pixel depth range for the following stages. In the third and fourth stages, we propose an Adaptive Depth Interval Adjustment module to achieve adaptive variable interval partition for pixel-wise depth range. The depth interval distribution in this module is normalized by Z-score, which can allocate dense depth hypothesis planes around the potential ground truth depth value and vice versa to achieve more accurate depth estimation. Extensive experiments on four widely used benchmark datasets (DTU, TnT, BlendedMVS, ETH 3D) demonstrate that our model achieves state-of-the-art performance and yields competitive generalization ability. Particularly, our method achieves the highest Acc and Overall on the DTU dataset, while attaining the highest Recall and F_1-score on the Tanks and Temples intermediate and advanced dataset. Moreover, our method also achieves the lowest e_1 and e_3 on the BlendedMVS dataset and the highest Acc and F_1-score on the ETH 3D dataset, surpassing all listed methods.Project website: https://github.com/zs670980918/ARAI-MVSNet

READ FULL TEXT

page 13

page 16

page 42

research
03/26/2021

DDR-Net: Learning Multi-Stage Multi-View Stereo With Dynamic Depth Range

To obtain high-resolution depth maps, some previous learning-based multi...
research
04/17/2019

Multi-Scale Geometric Consistency Guided Multi-View Stereo

In this paper, we propose an efficient multi-scale geometric consistency...
research
10/14/2017

An Adaptive Framework for Missing Depth Inference Using Joint Bilateral Filter

Depth imaging has largely focused on sensor and intrinsics properties. H...
research
12/15/2021

Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

Multi-view depth estimation methods typically require the computation of...
research
07/18/2023

Constraining Depth Map Geometry for Multi-View Stereo: A Dual-Depth Approach with Saddle-shaped Depth Cells

Learning-based multi-view stereo (MVS) methods deal with predicting accu...
research
12/04/2021

Generalized Binary Search Network for Highly-Efficient Multi-View Stereo

Multi-view Stereo (MVS) with known camera parameters is essentially a 1D...
research
03/21/2023

HRDFuse: Monocular 360°Depth Estimation by Collaboratively Learning Holistic-with-Regional Depth Distributions

Depth estimation from a monocular 360 image is a burgeoning problem owin...

Please sign up or login with your details

Forgot password? Click here to reset