Skip to main navigation Skip to search Skip to main content

SSL4EO-S12: A large-scale multimodal, multitemporal dataset for self-supervised learning in Earth observation [Software and Data Sets]

  • Yi Wang
  • , Nassim Ait Ali Braham
  • , Zhitong Xiong
  • , Chenying Liu
  • , Conrad M. Albrecht
  • , Xiao Xiang Zhu
  • Technical University of Munich
  • Deutsches Zentrum für Luft- und Raumfahrt (DLR)

Research output: Contribution to journalArticlepeer-review

102 Scopus citations

Abstract

Self-supervised pretraining bears the potential to generate expressive representations from large-scale Earth observation (EO) data without human annotation. However, most existing pretraining in the field is based on ImageNet or medium-sized, labeled remote sensing (RS) datasets. In this article, we share an unlabeled dataset Self-Supervised Learning for Earth Observation-Sentinel-1/2 (SSL4EO-S12) to assemble a large-scale, global, multimodal, and multiseasonal corpus of satellite imagery. We demonstrate SSL4EO-S12 to succeed in self-supervised pretraining for a set of representative methods: momentum contrast (MoCo), self-distillation with no labels (DINO), masked autoencoders (MAE), and data2vec, and multiple downstream applications, including scene classification, semantic segmentation, and change detection. Our benchmark results prove the effectiveness of SSL4EO-S12 compared to existing datasets. The dataset, related source code, and pretrained models are available at https://github.com/zhu-xlab/SSL4EO-S12.

Original languageEnglish
Pages (from-to)98-106
Number of pages9
JournalIEEE Geoscience and Remote Sensing Magazine
Volume11
Issue number3
DOIs
StatePublished - 1 Sep 2023

Fingerprint

Dive into the research topics of 'SSL4EO-S12: A large-scale multimodal, multitemporal dataset for self-supervised learning in Earth observation [Software and Data Sets]'. Together they form a unique fingerprint.

Cite this