EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization

09/14/2023
by   Minjung Kim, et al.
0

Visual localization is the task of estimating a 6-DoF camera pose of a query image within a provided 3D reference map. Thanks to recent advances in various 3D sensors, 3D point clouds are becoming a more accurate and affordable option for building the reference map, but research to match the points of 3D point clouds with pixels in 2D images for visual localization remains challenging. Existing approaches that jointly learn 2D-3D feature matching suffer from low inliers due to representational differences between the two modalities, and the methods that bypass this problem into classification have an issue of poor refinement. In this work, we propose EP2P-Loc, a novel large-scale visual localization method that mitigates such appearance discrepancy and enables end-to-end training for pose estimation. To increase the number of inliers, we propose a simple algorithm to remove invisible 3D points in the image, and find all 2D-3D correspondences without keypoint detection. To reduce memory usage and search complexity, we take a coarse-to-fine approach where we extract patch-level features from 2D images, then perform 2D patch classification on each 3D point, and obtain the exact corresponding 2D pixel coordinates through positional encoding. Finally, for the first time in this task, we employ a differentiable PnP for end-to-end training. In the experiments on newly curated large-scale indoor and outdoor benchmarks based on 2D-3D-S and KITTI, we show that our method achieves the state-of-the-art performance compared to existing visual localization and image-to-point cloud registration methods.

READ FULL TEXT

page 1

page 3

page 4

research
04/22/2019

2D3D-MatchNet: Learning to Match Keypoints Across 2D Image and 3D Point Cloud

Large-scale point cloud generated from 3D sensors is more accurate than ...
research
08/10/2023

2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds

The commonly adopted detect-then-match approach to registration finds di...
research
07/09/2019

Sparse-to-Dense Hypercolumn Matching for Long-Term Visual Localization

We propose a novel approach to feature point matching, suitable for robu...
research
10/24/2020

Improving the generalization of network based relative pose regression: dimension reduction as a regularizer

Visual localization occupies an important position in many areas such as...
research
04/02/2021

End-to-end learning of keypoint detection and matching for relative pose estimation

We propose a new method for estimating the relative pose between two ima...
research
03/01/2021

P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching

Accurately describing and detecting 2D and 3D keypoints is crucial to es...
research
03/14/2023

PATS: Patch Area Transportation with Subdivision for Local Feature Matching

Local feature matching aims at establishing sparse correspondences betwe...

Please sign up or login with your details

Forgot password? Click here to reset