7 datasets found
  1. h

    cloudsen12

    • huggingface.co
    Updated Jan 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    tacofoundation (2025). cloudsen12 [Dataset]. https://huggingface.co/datasets/tacofoundation/cloudsen12
    Explore at:
    Dataset updated
    Jan 4, 2025
    Dataset authored and provided by
    tacofoundation
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    This dataset follows the TACO specification.

      cloudsen12plus
    

    Website: https://cloudsen12.github.io/ version: 1.1.2 The largest dataset of expert-labeled pixels for cloud and cloud shadow detection in Sentinel-2 CloudSEN12+ version 1.1.0 is a significant extension of the CloudSEN12 dataset, which doubles the number of expert-reviewed labels, making it, by a large margin, the largest cloud detection dataset to date for Sentinel-2. All labels from the previous version have… See the full description on the dataset page: https://huggingface.co/datasets/tacofoundation/cloudsen12.

  2. h

    CloudSEN12-nolabel

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cesar Aybar, CloudSEN12-nolabel [Dataset]. https://huggingface.co/datasets/csaybar/CloudSEN12-nolabel
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Cesar Aybar
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    🚨 New Dataset Version Released! We are excited to announce the release of Version [1.1] of our dataset! This update includes: [L2A & L1C support]. [Temporal support]. [Check the data without downloading (Cloud-optimized properties)]. 📥 Go to: https://huggingface.co/datasets/tacofoundation/cloudsen12 and follow the instructions in colab

      CloudSEN12 NOLABEL
    
    
    
    
    
      A Benchmark Dataset for Cloud Semantic Understanding
    

    CloudSEN12 is a LARGE dataset (~1 TB) for cloud semantic… See the full description on the dataset page: https://huggingface.co/datasets/csaybar/CloudSEN12-nolabel.

  3. S

    CloudSEN12 - a global dataset for semantic understanding of cloud and cloud...

    • scidb.cn
    • produccioncientifica.usal.es
    • +1more
    Updated Nov 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cesar Luis; Luis Ysuhuaylas; Jhomira Loja; Karen Gonzales; Fernando Herrera; Lesly Bautista; Roy Yali; Angie Flores; Lissette Diaz; Nicole Cuenca; Wendy Espinoza; Fernando Prudencio; Joselyn Inga; Valeria Llactayo; David Montero; Martin Sudmanns; Dirk Tiede; Gonzalo Mateo-García; Luis Gómez-Chova (2022). CloudSEN12 - a global dataset for semantic understanding of cloud and cloud shadow in Sentinel-2 [Dataset]. http://doi.org/10.57760/sciencedb.06669
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 28, 2022
    Dataset provided by
    Science Data Bank
    Authors
    Cesar Luis; Luis Ysuhuaylas; Jhomira Loja; Karen Gonzales; Fernando Herrera; Lesly Bautista; Roy Yali; Angie Flores; Lissette Diaz; Nicole Cuenca; Wendy Espinoza; Fernando Prudencio; Joselyn Inga; Valeria Llactayo; David Montero; Martin Sudmanns; Dirk Tiede; Gonzalo Mateo-García; Luis Gómez-Chova
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    CloudSEN12 is a large dataset for cloud semantic understanding that consists of 9880 regions of interest (ROIs). Each ROI has five 5090x5090 meters image patches (IPs) collected on different dates; we manually choose the images to guarantee that each IP inside an ROI matches one of the following cloud cover groups:- clear (0%)- low-cloudy (1% - 25%) - almost clear (25% - 45%)- mid-cloudy (45% - 65%)- cloudy (65% >)An IP is the core unit in CloudSEN12. Each IP contains data from Sentinel-2 optical levels 1C and 2A, Sentinel-1 Synthetic Aperture Radar (SAR), digital elevation model, surface water occurrence, land cover classes, and cloud mask results from eight cutting-edge cloud detection algorithms. Besides, in order to support standard, weakly, and self-/semi-supervised learning procedures, cloudSEN12 includes three distinct forms of hand-crafted labelling data: high-quality, scribble, and no annotation. Consequently, each ROI is randomly assigned to a different annotation group:2000 ROIs with pixel-level annotation, where the average annotation time is 150 minutes (high-quality group).2000 ROIs with scribble-level annotation, where the annotation time is 15 minutes (scribble group).5880 ROIs with annotation only in the cloud-free (0\%) image (no annotation group).For high-quality labels, we use the Intelligence foR Image Segmentation\cite{iris2019} (IRIS) active learning technology, combining human photo-interpretation and machine learning. For scribble, ground truth pixels were drawn using IRIS but without ML support. Finally, the no-annotation dataset is generated automatically, with manual annotation only in the clear image patch. A backup of the dataset in STAC format is available here: https://shorturl.at/cgjtz. Check out our website https://cloudsen12.github.io/ for examples.

  4. S

    CloudSEN12+: The largest collection of expert-labeled pixels for cloud and...

    • scidb.cn
    • producciocientifica.uv.es
    Updated Apr 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cesar Aybar; Lesly Bautista; Julio Contreras; Fernando Prudencio; Daryl Ayala; David Montero; Jhomira Loja; Luis Ysuhuaylas; Fernando Herrera; Karen Gonzales; Jeanett Valladares; Lucy A. Flores; Evelin Mamani; Maria Quiñonez; Rai Fajardo; Wendy Espinoza; Antonio Limas; Roy Yali; Bram Willems; Raúl Loayza-Muro; Martín Leyva; Alejandro Alcántara; Gonzalo Mateo-García; Luis Gómez-Chova (2024). CloudSEN12+: The largest collection of expert-labeled pixels for cloud and cloud shadow detection in Sentinel-2 [Dataset]. http://doi.org/10.57760/sciencedb.17702
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 25, 2024
    Dataset provided by
    Science Data Bank
    Authors
    Cesar Aybar; Lesly Bautista; Julio Contreras; Fernando Prudencio; Daryl Ayala; David Montero; Jhomira Loja; Luis Ysuhuaylas; Fernando Herrera; Karen Gonzales; Jeanett Valladares; Lucy A. Flores; Evelin Mamani; Maria Quiñonez; Rai Fajardo; Wendy Espinoza; Antonio Limas; Roy Yali; Bram Willems; Raúl Loayza-Muro; Martín Leyva; Alejandro Alcántara; Gonzalo Mateo-García; Luis Gómez-Chova
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Detecting and screening clouds is the first step in any optical remote sensing (RS) analysis. Cloud formation is diverse, presenting many shapes, thicknesses, and altitudes. This variety poses a significant challenge to developing effective cloud detection algorithms since most datasets shortfall an unbiased representation. To address this issue, we have built CloudSEN12+, a significant expansion of the CloudSEN12 dataset. This new dataset doubles the expert-labeled pixels, making it the largest cloud detection dataset for Sentinel-2 imagery up to date. We have carefully reviewed and refined previous human labels in this new release to ensure maximum trustworthiness. We hope CloudSEN12+ will be a valuable resource for the cloud detection research community.

  5. h

    cloudsen12

    • huggingface.co
    Updated Sep 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The AI Alliance (2025). cloudsen12 [Dataset]. https://huggingface.co/datasets/aialliance/cloudsen12
    Explore at:
    Dataset updated
    Sep 20, 2025
    Dataset authored and provided by
    The AI Alliance
    Description

    aialliance/cloudsen12 dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    mlstac-demo

    • huggingface.co
    Updated Jan 1, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jair Francisco Flores Farfan (2020). mlstac-demo [Dataset]. https://huggingface.co/datasets/jfloresf/mlstac-demo
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 1, 2020
    Authors
    Jair Francisco Flores Farfan
    Description

    cloudsen12

    A dataset about clouds from Sentinel-2 CloudSEN12 is a LARGE dataset (~1 TB) for cloud semantic understanding that consists of 49,400 image patches (IP) that are evenly spread throughout all continents except Antarctica. Each IP covers 5090 x 5090 meters and contains data from Sentinel-2 levels 1C and 2A, hand-crafted annotations of thick and thin clouds and cloud shadows, Sentinel-1 Synthetic Aperture Radar (SAR), digital elevation model, surface water occurrence, land… See the full description on the dataset page: https://huggingface.co/datasets/jfloresf/mlstac-demo.

  7. h

    Copernicus-Bench

    • huggingface.co
    Updated Mar 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yi Wang (2025). Copernicus-Bench [Dataset]. https://huggingface.co/datasets/wangyi111/Copernicus-Bench
    Explore at:
    Dataset updated
    Mar 31, 2025
    Authors
    Yi Wang
    Description

    Dataset Card for Copernicus-Bench

    A hierarchical ML benchmark for Copernicus Sentinels, with 15 datasets spread into three task levels covering all major Sentinel missions (S1,2,3,5P). (Officially named "Copernicus-Bench", initially named "SentinelBench")

      Dataset Details
    

    Level Name Modality Task

    Images

    Image Size

    Classes

    Source License

    L1 Cloud-S2 S2 TOA segmentation (cloud) 1699/567/551 512x512x13 4 CloudSEN12 CC 0 1.0

    L1 Cloud-S3 S3 OLCI… See the full description on the dataset page: https://huggingface.co/datasets/wangyi111/Copernicus-Bench.

  8. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
tacofoundation (2025). cloudsen12 [Dataset]. https://huggingface.co/datasets/tacofoundation/cloudsen12

cloudsen12

cloudsen12plus

tacofoundation/cloudsen12

Explore at:
84 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jan 4, 2025
Dataset authored and provided by
tacofoundation
License

https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

Description

This dataset follows the TACO specification.

  cloudsen12plus

Website: https://cloudsen12.github.io/ version: 1.1.2 The largest dataset of expert-labeled pixels for cloud and cloud shadow detection in Sentinel-2 CloudSEN12+ version 1.1.0 is a significant extension of the CloudSEN12 dataset, which doubles the number of expert-reviewed labels, making it, by a large margin, the largest cloud detection dataset to date for Sentinel-2. All labels from the previous version have… See the full description on the dataset page: https://huggingface.co/datasets/tacofoundation/cloudsen12.

Search
Clear search
Close search
Google apps
Main menu