Implicit Ray-Transformers for Multi-view Remote Sensing Image Segmentation

03/15/2023
by   Zipeng Qi, et al.
0

The mainstream CNN-based remote sensing (RS) image semantic segmentation approaches typically rely on massive labeled training data. Such a paradigm struggles with the problem of RS multi-view scene segmentation with limited labeled views due to the lack of considering 3D information within the scene. In this paper, we propose ”Implicit Ray-Transformer (IRT)” based on Implicit Neural Representation (INR), for RS scene semantic segmentation with sparse labels (such as 4-6 labels per 100 images). We explore a new way of introducing multi-view 3D structure priors to the task for accurate and view-consistent semantic segmentation. The proposed method includes a two-stage learning process. In the first stage, we optimize a neural field to encode the color and 3D structure of the remote sensing scene based on multi-view images. In the second stage, we design a Ray Transformer to leverage the relations between the neural field 3D features and 2D texture features for learning better semantic representations. Different from previous methods that only consider 3D prior or 2D features, we incorporate additional 2D texture information and 3D prior by broadcasting CNN features to different point features along the sampled ray. To verify the effectiveness of the proposed method, we construct a challenging dataset containing six synthetic sub-datasets collected from the Carla platform and three real sub-datasets from Google Maps. Experiments show that the proposed method outperforms the CNN-based methods and the state-of-the-art INR-based segmentation methods in quantitative and qualitative metrics.

READ FULL TEXT

page 1

page 2

page 4

page 7

page 9

page 10

page 11

page 12

research
05/18/2022

Remote Sensing Novel View Synthesis with Implicit Multiplane Representations

Novel view synthesis of remote sensing scenes is of great significance f...
research
06/06/2018

Deep Vessel Segmentation By Learning Graphical Connectivity

We propose a novel deep-learning-based system for vessel segmentation. E...
research
09/28/2020

RS-MetaNet: Deep meta metric learning for few-shot remote sensing scene classification

Training a modern deep neural network on massive labeled samples is the ...
research
08/10/2023

A Comparative Assessment of Multi-view fusion learning for Crop Classification

With a rapidly increasing amount and diversity of remote sensing (RS) da...
research
04/04/2022

RayMVSNet: Learning Ray-based 1D Implicit Fields for Accurate Multi-View Stereo

Learning-based multi-view stereo (MVS) has by far centered around 3D con...
research
04/16/2022

GitNet: Geometric Prior-based Transformation for Birds-Eye-View Segmentation

Birds-eye-view (BEV) semantic segmentation is critical for autonomous dr...
research
04/12/2019

Adaptive Weighting Multi-Field-of-View CNN for Semantic Segmentation in Pathology

Automated digital histopathology image segmentation is an important task...

Please sign up or login with your details

Forgot password? Click here to reset