Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images

01/10/2023
by   Xindi Wu, et al.
0

Self-driving vehicles rely on urban street maps for autonomous navigation. In this paper, we introduce Pix2Map, a method for inferring urban street map topology directly from ego-view images, as needed to continually update and expand existing maps. This is a challenging task, as we need to infer a complex urban road topology directly from raw image data. The main insight of this paper is that this problem can be posed as cross-modal retrieval by learning a joint, cross-modal embedding space for images and existing maps, represented as discrete graphs that encode the topological layout of the visual surroundings. We conduct our experimental evaluation using the Argoverse dataset and show that it is indeed possible to accurately retrieve street maps corresponding to both seen and unseen roads solely from image data. Moreover, we show that our retrieved maps can be used to update or expand existing maps and even show proof-of-concept results for visual localization and image retrieval from spatial graphs.

READ FULL TEXT

page 8

page 9

research
11/25/2016

Learning from Maps: Visual Common Sense for Autonomous Driving

Today's autonomous vehicles rely extensively on high-definition 3D maps ...
research
12/08/2020

StacMR: Scene-Text Aware Cross-Modal Retrieval

Recent models for cross-modal retrieval have benefited from an increasin...
research
01/13/2021

Probabilistic Embeddings for Cross-Modal Retrieval

Cross-modal retrieval methods build a common representation space for sa...
research
09/12/2019

CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval

Text-image cross-modal retrieval is a challenging task in the field of l...
research
06/13/2019

Cross-View Policy Learning for Street Navigation

The ability to navigate from visual observations in unfamiliar environme...
research
02/04/2023

CLiNet: Joint Detection of Road Network Centerlines in 2D and 3D

This work introduces a new approach for joint detection of centerlines b...
research
11/09/2022

Graph representation learning for street networks

Streets networks provide an invaluable source of information about the d...

Please sign up or login with your details

Forgot password? Click here to reset