7 datasets found

h
cloudsen12
huggingface.co
Updated Jan 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
tacofoundation (2025). cloudsen12 [Dataset]. https://huggingface.co/datasets/tacofoundation/cloudsen12
Explore at:
Dataset updated
Jan 4, 2025
Dataset authored and provided by
tacofoundation
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
This dataset follows the TACO specification.

cloudsen12plus

Website: https://cloudsen12.github.io/ version: 1.1.2 The largest dataset of expert-labeled pixels for cloud and cloud shadow detection in Sentinel-2 CloudSEN12+ version 1.1.0 is a significant extension of the CloudSEN12 dataset, which doubles the number of expert-reviewed labels, making it, by a large margin, the largest cloud detection dataset to date for Sentinel-2. All labels from the previous version have… See the full description on the dataset page: https://huggingface.co/datasets/tacofoundation/cloudsen12.
h
CloudSEN12-nolabel
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cesar Aybar, CloudSEN12-nolabel [Dataset]. https://huggingface.co/datasets/csaybar/CloudSEN12-nolabel
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Cesar Aybar
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
🚨 New Dataset Version Released! We are excited to announce the release of Version [1.1] of our dataset! This update includes: [L2A & L1C support]. [Temporal support]. [Check the data without downloading (Cloud-optimized properties)]. 📥 Go to: https://huggingface.co/datasets/tacofoundation/cloudsen12 and follow the instructions in colab

CloudSEN12 NOLABEL A Benchmark Dataset for Cloud Semantic Understanding

CloudSEN12 is a LARGE dataset (~1 TB) for cloud semantic… See the full description on the dataset page: https://huggingface.co/datasets/csaybar/CloudSEN12-nolabel.
S
CloudSEN12 - a global dataset for semantic understanding of cloud and cloud...
scidb.cn
produccioncientifica.usal.es
+1more
Updated Nov 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cesar Luis; Luis Ysuhuaylas; Jhomira Loja; Karen Gonzales; Fernando Herrera; Lesly Bautista; Roy Yali; Angie Flores; Lissette Diaz; Nicole Cuenca; Wendy Espinoza; Fernando Prudencio; Joselyn Inga; Valeria Llactayo; David Montero; Martin Sudmanns; Dirk Tiede; Gonzalo Mateo-García; Luis Gómez-Chova (2022). CloudSEN12 - a global dataset for semantic understanding of cloud and cloud shadow in Sentinel-2 [Dataset]. http://doi.org/10.57760/sciencedb.06669
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57760/sciencedb.06669
Dataset updated
Nov 28, 2022
Dataset provided by
Science Data Bank
Authors
Cesar Luis; Luis Ysuhuaylas; Jhomira Loja; Karen Gonzales; Fernando Herrera; Lesly Bautista; Roy Yali; Angie Flores; Lissette Diaz; Nicole Cuenca; Wendy Espinoza; Fernando Prudencio; Joselyn Inga; Valeria Llactayo; David Montero; Martin Sudmanns; Dirk Tiede; Gonzalo Mateo-García; Luis Gómez-Chova
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
CloudSEN12 is a large dataset for cloud semantic understanding that consists of 9880 regions of interest (ROIs). Each ROI has five 5090x5090 meters image patches (IPs) collected on different dates; we manually choose the images to guarantee that each IP inside an ROI matches one of the following cloud cover groups:- clear (0%)- low-cloudy (1% - 25%) - almost clear (25% - 45%)- mid-cloudy (45% - 65%)- cloudy (65% >)An IP is the core unit in CloudSEN12. Each IP contains data from Sentinel-2 optical levels 1C and 2A, Sentinel-1 Synthetic Aperture Radar (SAR), digital elevation model, surface water occurrence, land cover classes, and cloud mask results from eight cutting-edge cloud detection algorithms. Besides, in order to support standard, weakly, and self-/semi-supervised learning procedures, cloudSEN12 includes three distinct forms of hand-crafted labelling data: high-quality, scribble, and no annotation. Consequently, each ROI is randomly assigned to a different annotation group:2000 ROIs with pixel-level annotation, where the average annotation time is 150 minutes (high-quality group).2000 ROIs with scribble-level annotation, where the annotation time is 15 minutes (scribble group).5880 ROIs with annotation only in the cloud-free (0\%) image (no annotation group).For high-quality labels, we use the Intelligence foR Image Segmentation\cite{iris2019} (IRIS) active learning technology, combining human photo-interpretation and machine learning. For scribble, ground truth pixels were drawn using IRIS but without ML support. Finally, the no-annotation dataset is generated automatically, with manual annotation only in the clear image patch. A backup of the dataset in STAC format is available here: https://shorturl.at/cgjtz. Check out our website https://cloudsen12.github.io/ for examples.
S
CloudSEN12+: The largest collection of expert-labeled pixels for cloud and...
scidb.cn
producciocientifica.uv.es
Updated Apr 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cesar Aybar; Lesly Bautista; Julio Contreras; Fernando Prudencio; Daryl Ayala; David Montero; Jhomira Loja; Luis Ysuhuaylas; Fernando Herrera; Karen Gonzales; Jeanett Valladares; Lucy A. Flores; Evelin Mamani; Maria Quiñonez; Rai Fajardo; Wendy Espinoza; Antonio Limas; Roy Yali; Bram Willems; Raúl Loayza-Muro; Martín Leyva; Alejandro Alcántara; Gonzalo Mateo-García; Luis Gómez-Chova (2024). CloudSEN12+: The largest collection of expert-labeled pixels for cloud and cloud shadow detection in Sentinel-2 [Dataset]. http://doi.org/10.57760/sciencedb.17702
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57760/sciencedb.17702
Dataset updated
Apr 25, 2024
Dataset provided by
Science Data Bank
Authors
Cesar Aybar; Lesly Bautista; Julio Contreras; Fernando Prudencio; Daryl Ayala; David Montero; Jhomira Loja; Luis Ysuhuaylas; Fernando Herrera; Karen Gonzales; Jeanett Valladares; Lucy A. Flores; Evelin Mamani; Maria Quiñonez; Rai Fajardo; Wendy Espinoza; Antonio Limas; Roy Yali; Bram Willems; Raúl Loayza-Muro; Martín Leyva; Alejandro Alcántara; Gonzalo Mateo-García; Luis Gómez-Chova
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Detecting and screening clouds is the first step in any optical remote sensing (RS) analysis. Cloud formation is diverse, presenting many shapes, thicknesses, and altitudes. This variety poses a significant challenge to developing effective cloud detection algorithms since most datasets shortfall an unbiased representation. To address this issue, we have built CloudSEN12+, a significant expansion of the CloudSEN12 dataset. This new dataset doubles the expert-labeled pixels, making it the largest cloud detection dataset for Sentinel-2 imagery up to date. We have carefully reviewed and refined previous human labels in this new release to ensure maximum trustworthiness. We hope CloudSEN12+ will be a valuable resource for the cloud detection research community.
h
cloudsen12
huggingface.co
Updated Sep 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The AI Alliance (2025). cloudsen12 [Dataset]. https://huggingface.co/datasets/aialliance/cloudsen12
Explore at:
Dataset updated
Sep 20, 2025
Dataset authored and provided by
The AI Alliance
Description
aialliance/cloudsen12 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
mlstac-demo
huggingface.co
Updated Jan 1, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jair Francisco Flores Farfan (2020). mlstac-demo [Dataset]. https://huggingface.co/datasets/jfloresf/mlstac-demo
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 1, 2020
Authors
Jair Francisco Flores Farfan
Description
cloudsen12

A dataset about clouds from Sentinel-2 CloudSEN12 is a LARGE dataset (~1 TB) for cloud semantic understanding that consists of 49,400 image patches (IP) that are evenly spread throughout all continents except Antarctica. Each IP covers 5090 x 5090 meters and contains data from Sentinel-2 levels 1C and 2A, hand-crafted annotations of thick and thin clouds and cloud shadows, Sentinel-1 Synthetic Aperture Radar (SAR), digital elevation model, surface water occurrence, land… See the full description on the dataset page: https://huggingface.co/datasets/jfloresf/mlstac-demo.
h
Copernicus-Bench
huggingface.co
Updated Mar 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yi Wang (2025). Copernicus-Bench [Dataset]. https://huggingface.co/datasets/wangyi111/Copernicus-Bench
Explore at:
Dataset updated
Mar 31, 2025
Authors
Yi Wang
Description
Dataset Card for Copernicus-Bench

A hierarchical ML benchmark for Copernicus Sentinels, with 15 datasets spread into three task levels covering all major Sentinel missions (S1,2,3,5P). (Officially named "Copernicus-Bench", initially named "SentinelBench")

Dataset Details

Level Name Modality Task

Images

Image Size

Classes

Source License

L1 Cloud-S2 S2 TOA segmentation (cloud) 1699/567/551 512x512x13 4 CloudSEN12 CC 0 1.0

L1 Cloud-S3 S3 OLCI… See the full description on the dataset page: https://huggingface.co/datasets/wangyi111/Copernicus-Bench.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

tacofoundation (2025). cloudsen12 [Dataset]. https://huggingface.co/datasets/tacofoundation/cloudsen12

cloudsen12

cloudsen12plus

tacofoundation/cloudsen12

Explore at:

84 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Jan 4, 2025

Dataset authored and provided by

tacofoundation

License

https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

Description

This dataset follows the TACO specification.

  cloudsen12plus

Website: https://cloudsen12.github.io/ version: 1.1.2 The largest dataset of expert-labeled pixels for cloud and cloud shadow detection in Sentinel-2 CloudSEN12+ version 1.1.0 is a significant extension of the CloudSEN12 dataset, which doubles the number of expert-reviewed labels, making it, by a large margin, the largest cloud detection dataset to date for Sentinel-2. All labels from the previous version have… See the full description on the dataset page: https://huggingface.co/datasets/tacofoundation/cloudsen12.

Clear search

Close search

Google apps

Main menu

cloudsen12

CloudSEN12-nolabel

CloudSEN12 - a global dataset for semantic understanding of cloud and cloud...

CloudSEN12+: The largest collection of expert-labeled pixels for cloud and...

cloudsen12

mlstac-demo

Copernicus-Bench

Images

Classes

cloudsen12

cloudsen12plus

tacofoundation/cloudsen12