SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery

07/17/2022
by   Yezhen Cong, et al.
15

Unsupervised pre-training methods for large vision models have shown to enhance performance on downstream supervised tasks. Developing similar techniques for satellite imagery presents significant opportunities as unlabelled data is plentiful and the inherent temporal and multi-spectral structure provides avenues to further improve existing pre-training strategies. In this paper, we present SatMAE, a pre-training framework for temporal or multi-spectral satellite imagery based on Masked Autoencoder (MAE). To leverage temporal information, we include a temporal embedding along with independently masking image patches across time. In addition, we demonstrate that encoding multi-spectral data as groups of bands with distinct spectral positional encodings is beneficial. Our approach yields strong improvements over previous state-of-the-art techniques, both in terms of supervised learning performance on benchmark datasets (up to ↑ 7%), and transfer learning performance on downstream remote sensing tasks, including land cover classification (up to ↑ 14%) and semantic segmentation.

READ FULL TEXT

page 2

page 5

page 9

page 20

page 21

research
05/22/2023

Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters

Research in self-supervised learning (SSL) with natural images has progr...
research
10/12/2020

Spectral Synthesis for Satellite-to-Satellite Translation

Earth observing satellites carrying multi-spectral sensors are widely us...
research
04/23/2023

SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models

Interpreting remote sensing imagery enables numerous downstream applicat...
research
05/07/2019

Learning to Interpret Satellite Images in Global Scale Using Wikipedia

Despite recent progress in computer vision, finegrained interpretation o...
research
08/03/2022

Unsupervised Discovery of Semantic Concepts in Satellite Imagery with Style-based Wavelet-driven Generative Models

In recent years, considerable advancements have been made in the area of...
research
11/16/2022

Fair contrastive pre-training for geographic images

Contrastive representation learning is widely employed in visual recogni...
research
10/22/2022

Spectrum-BERT: Pre-training of Deep Bidirectional Transformers for Spectral Classification of Chinese Liquors

Spectral detection technology, as a non-invasive method for rapid detect...

Please sign up or login with your details

Forgot password? Click here to reset