7 datasets found
  1. h

    Core-S2L1C-SSL4EO

    • huggingface.co
    Updated Dec 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Major TOM (2024). Core-S2L1C-SSL4EO [Dataset]. http://doi.org/10.57967/hf/5241
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 10, 2024
    Dataset authored and provided by
    Major TOM
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Core-S2L1C-SSL4EO 🟥🟩🟦🟧🟨🟪 🛰️

    Dataset Modality Number of Embeddings Sensing Type Total Comments Source Dataset Source Model Size

    Core-S2L1C-SSL4EO Sentinel-2 (Level 1C) 56,147,150 Multi-Spectral General-Purpose Global Core-S2L1C SSL4EO-ResNet50-DINO 252.9 GB

      Content
    

    Field Type Description

    unique_id string hash generated from geometry, time, product_id, and embedding model

    embedding array raw embedding array

    grid_cell string Major TOM… See the full description on the dataset page: https://huggingface.co/datasets/Major-TOM/Core-S2L1C-SSL4EO.

  2. h

    SSL4EO-S12-downstream

    • huggingface.co
    Updated Mar 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Embed2Scale (2025). SSL4EO-S12-downstream [Dataset]. http://doi.org/10.57967/hf/5912
    Explore at:
    Dataset updated
    Mar 10, 2025
    Dataset authored and provided by
    Embed2Scale
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    SSL4EO-S12-downstream

    Welcome to the SSL4EO-S12-downstream dataset. This dataset is used in the Embed2Scale Challenge. SSL4EO-S12-downstream is a Earth Observation (EO) dataset of downstream tasks. It is released as a standalone dataset together with the NeuCo-Bench neural compression benchmarking framework. Parts of the SSL4EO-S12-downstream dataset was used in the 2025 CVPR EarthVision data challenge and the dev and eval phases of the challenge can be recreated. For instructions… See the full description on the dataset page: https://huggingface.co/datasets/embed2scale/SSL4EO-S12-downstream.

  3. h

    Core-S1RTC-SSL4EO

    • huggingface.co
    Updated Dec 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Major TOM (2024). Core-S1RTC-SSL4EO [Dataset]. http://doi.org/10.57967/hf/5244
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 10, 2024
    Dataset authored and provided by
    Major TOM
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Core-S1RTC-SSL4EO 📡⚡🛰️

    Dataset Modality Number of Embeddings Sensing Type Total Comments Source Dataset Source Model Size

    Core-S1RTC-SSL4EO Sentinel-1 RTC 36,748,875 SAR General-Purpose Global Core-S1RTC SSL4EO-ResNet50-MOCO 332.5 GB

      Content
    

    Field TypeDescription

    unique_id string hash generated from geometry, time, product_id, and embedding model

    embedding array raw embedding array

    grid_cell string Major TOM cell

    grid_row_u int Major… See the full description on the dataset page: https://huggingface.co/datasets/Major-TOM/Core-S1RTC-SSL4EO.

  4. h

    ssl4eo_l

    • huggingface.co
    Updated Jun 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TorchGeo (2023). ssl4eo_l [Dataset]. https://huggingface.co/datasets/torchgeo/ssl4eo_l
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 7, 2023
    Dataset authored and provided by
    TorchGeo
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    SSL4EO-L: Self-Supervised Learning for Earth Observation for the Landsat family of satellites.

  5. h

    SSL4EO-S12-v1.1

    • huggingface.co
    Updated Jul 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Embed2Scale (2025). SSL4EO-S12-v1.1 [Dataset]. https://huggingface.co/datasets/embed2scale/SSL4EO-S12-v1.1
    Explore at:
    Dataset updated
    Jul 23, 2025
    Dataset authored and provided by
    Embed2Scale
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description
  6. h

    NoLDO-S12

    • huggingface.co
    Updated May 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chenying Liu (2025). NoLDO-S12 [Dataset]. https://huggingface.co/datasets/vikki23/NoLDO-S12
    Explore at:
    Dataset updated
    May 31, 2025
    Authors
    Chenying Liu
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for NOLDO-S12 Dataset

    NoLDO-S12 is a multi-modal dataset for remote sensing image segmentation from Sentinel-1&2 images, which contains two splits: SSL4EO-S12@NoL with noisy labels for pretraining, and two downstream datasets, SSL4EO-S12@DW and SSL4EO-S12@OSM, with exact labels for transfer learning.

      Dataset Details
    

    Curated by: Chenying Liu, Conrad M Albrecht, Yi Wang, Xiao Xiang Zhu License: MIT Repository: More details at… See the full description on the dataset page: https://huggingface.co/datasets/vikki23/NoLDO-S12.

  7. h

    Copernicus-Pretrain

    • huggingface.co
    Updated Mar 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yi Wang (2025). Copernicus-Pretrain [Dataset]. https://huggingface.co/datasets/wangyi111/Copernicus-Pretrain
    Explore at:
    Dataset updated
    Mar 31, 2025
    Authors
    Yi Wang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Copernicus-Pretrain

    Copernicus-Pretrain is a large-scale EO pretraining dataset with 18.7M aligned images covering all major Sentinel missions (S1,2,3,5P). Officially named Copernicus-Pretrain, also referred to as SSL4EO-S ("S" means Sentinel), as an extension of SSL4EO-S12 to the whole Sentinel series.

      Dataset Details
    

    Copernicus-Pretrain contains 18.7M aligned imagery from all major Sentinel missions in operation (Sentinel-1 SAR, Sentinel-2… See the full description on the dataset page: https://huggingface.co/datasets/wangyi111/Copernicus-Pretrain.

  8. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Major TOM (2024). Core-S2L1C-SSL4EO [Dataset]. http://doi.org/10.57967/hf/5241

Core-S2L1C-SSL4EO

Major-TOM/Core-S2L1C-SSL4EO

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 10, 2024
Dataset authored and provided by
Major TOM
License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Core-S2L1C-SSL4EO 🟥🟩🟦🟧🟨🟪 🛰️

Dataset Modality Number of Embeddings Sensing Type Total Comments Source Dataset Source Model Size

Core-S2L1C-SSL4EO Sentinel-2 (Level 1C) 56,147,150 Multi-Spectral General-Purpose Global Core-S2L1C SSL4EO-ResNet50-DINO 252.9 GB

  Content

Field Type Description

unique_id string hash generated from geometry, time, product_id, and embedding model

embedding array raw embedding array

grid_cell string Major TOM… See the full description on the dataset page: https://huggingface.co/datasets/Major-TOM/Core-S2L1C-SSL4EO.

Search
Clear search
Close search
Google apps
Main menu