Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Core-S2L1C-SSL4EO 🟥🟩🟦🟧🟨🟪 🛰️
Dataset Modality Number of Embeddings Sensing Type Total Comments Source Dataset Source Model Size
Core-S2L1C-SSL4EO Sentinel-2 (Level 1C) 56,147,150 Multi-Spectral General-Purpose Global Core-S2L1C SSL4EO-ResNet50-DINO 252.9 GB
Content
Field Type Description
unique_id string hash generated from geometry, time, product_id, and embedding model
embedding array raw embedding array
grid_cell string Major TOM… See the full description on the dataset page: https://huggingface.co/datasets/Major-TOM/Core-S2L1C-SSL4EO.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
SSL4EO-S12-downstream
Welcome to the SSL4EO-S12-downstream dataset. This dataset is used in the Embed2Scale Challenge. SSL4EO-S12-downstream is a Earth Observation (EO) dataset of downstream tasks. It is released as a standalone dataset together with the NeuCo-Bench neural compression benchmarking framework. Parts of the SSL4EO-S12-downstream dataset was used in the 2025 CVPR EarthVision data challenge and the dev and eval phases of the challenge can be recreated. For instructions… See the full description on the dataset page: https://huggingface.co/datasets/embed2scale/SSL4EO-S12-downstream.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Core-S1RTC-SSL4EO 📡⚡🛰️
Dataset Modality Number of Embeddings Sensing Type Total Comments Source Dataset Source Model Size
Core-S1RTC-SSL4EO Sentinel-1 RTC 36,748,875 SAR General-Purpose Global Core-S1RTC SSL4EO-ResNet50-MOCO 332.5 GB
Content
Field TypeDescription
unique_id string hash generated from geometry, time, product_id, and embedding model
embedding array raw embedding array
grid_cell string Major TOM cell
grid_row_u int Major… See the full description on the dataset page: https://huggingface.co/datasets/Major-TOM/Core-S1RTC-SSL4EO.
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
SSL4EO-L: Self-Supervised Learning for Earth Observation for the Landsat family of satellites.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
dataset details in https://github.com/DLR-MF-DAS/SSL4EO-S12-v1.1
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for NOLDO-S12 Dataset
NoLDO-S12 is a multi-modal dataset for remote sensing image segmentation from Sentinel-1&2 images, which contains two splits: SSL4EO-S12@NoL with noisy labels for pretraining, and two downstream datasets, SSL4EO-S12@DW and SSL4EO-S12@OSM, with exact labels for transfer learning.
Dataset Details
Curated by: Chenying Liu, Conrad M Albrecht, Yi Wang, Xiao Xiang Zhu License: MIT Repository: More details at… See the full description on the dataset page: https://huggingface.co/datasets/vikki23/NoLDO-S12.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset Card for Copernicus-Pretrain
Copernicus-Pretrain is a large-scale EO pretraining dataset with 18.7M aligned images covering all major Sentinel missions (S1,2,3,5P). Officially named Copernicus-Pretrain, also referred to as SSL4EO-S ("S" means Sentinel), as an extension of SSL4EO-S12 to the whole Sentinel series.
Dataset Details
Copernicus-Pretrain contains 18.7M aligned imagery from all major Sentinel missions in operation (Sentinel-1 SAR, Sentinel-2… See the full description on the dataset page: https://huggingface.co/datasets/wangyi111/Copernicus-Pretrain.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Core-S2L1C-SSL4EO 🟥🟩🟦🟧🟨🟪 🛰️
Dataset Modality Number of Embeddings Sensing Type Total Comments Source Dataset Source Model Size
Core-S2L1C-SSL4EO Sentinel-2 (Level 1C) 56,147,150 Multi-Spectral General-Purpose Global Core-S2L1C SSL4EO-ResNet50-DINO 252.9 GB
Content
Field Type Description
unique_id string hash generated from geometry, time, product_id, and embedding model
embedding array raw embedding array
grid_cell string Major TOM… See the full description on the dataset page: https://huggingface.co/datasets/Major-TOM/Core-S2L1C-SSL4EO.