5 datasets found
  1. h

    Core-AlphaEarth-Embeddings

    • huggingface.co
    Updated Jul 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Major TOM (2025). Core-AlphaEarth-Embeddings [Dataset]. https://huggingface.co/datasets/Major-TOM/Core-AlphaEarth-Embeddings
    Explore at:
    Dataset updated
    Jul 29, 2025
    Dataset authored and provided by
    Major TOM
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Major TOM Core AlphaEarth Embeddings Subset

    This is a prototype dataset. It only includes some of the AlphaEarth embeddings stored in Major TOM grid cells. This dataset is mostly aimed at experimentation and prototyping. It is particularly useful to use it along other datasets published within the Major TOM project.

      Content
    

    Field Type Description

    grid_cell string Major TOM cell

    year int year of the sample

    thumbnail image 3-dimensional PCA… See the full description on the dataset page: https://huggingface.co/datasets/Major-TOM/Core-AlphaEarth-Embeddings.

  2. o

    Google Satellite Embedding V1

    • registry.opendata.aws
    Updated Nov 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Source Cooperative (2025). Google Satellite Embedding V1 [Dataset]. https://registry.opendata.aws/aef-source/
    Explore at:
    Dataset updated
    Nov 18, 2025
    Dataset provided by
    <a href="https://source.coop/">Source Cooperative</a>
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    COG (Cloud-Optimized GeoTIFF) files that together contain the AlphaEarth Foundations annual Satellite Embedding dataset. It contains the annual embeddings for the years from 2018 to 2024, inclusive.

  3. h

    Google-Alpha-Earth-MD-data

    • huggingface.co
    Updated Feb 19, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    State of Maryland (2026). Google-Alpha-Earth-MD-data [Dataset]. https://huggingface.co/datasets/stateofmaryland/Google-Alpha-Earth-MD-data
    Explore at:
    Dataset updated
    Feb 19, 2026
    Dataset authored and provided by
    State of Maryland
    Description

    stateofmaryland/Google-Alpha-Earth-MD-data dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    Google-Alpha-Earth-MD

    • huggingface.co
    Updated Nov 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    State of Maryland (2025). Google-Alpha-Earth-MD [Dataset]. https://huggingface.co/datasets/stateofmaryland/Google-Alpha-Earth-MD
    Explore at:
    Dataset updated
    Nov 24, 2025
    Dataset authored and provided by
    State of Maryland
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    stateofmaryland/Google-Alpha-Earth-MD dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. Data from: Sentinel2GlobalLULC: A dataset of Sentinel-2 georeferenced RGB...

    • zenodo.org
    • observatorio-cientifico.ua.es
    • +2more
    text/x-python, zip
    Updated Apr 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yassir Benhammou; Yassir Benhammou; Domingo Alcaraz-Segura; Domingo Alcaraz-Segura; Emilio Guirado; Emilio Guirado; Rohaifa Khaldi; Rohaifa Khaldi; Siham Tabik; Siham Tabik (2025). Sentinel2GlobalLULC: A dataset of Sentinel-2 georeferenced RGB imagery annotated for global land use/land cover mapping with deep learning (License CC BY 4.0) [Dataset]. http://doi.org/10.5281/zenodo.6941662
    Explore at:
    zip, text/x-pythonAvailable download formats
    Dataset updated
    Apr 24, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Yassir Benhammou; Yassir Benhammou; Domingo Alcaraz-Segura; Domingo Alcaraz-Segura; Emilio Guirado; Emilio Guirado; Rohaifa Khaldi; Rohaifa Khaldi; Siham Tabik; Siham Tabik
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sentinel2GlobalLULC is a deep learning-ready dataset of RGB images from the Sentinel-2 satellites designed for global land use and land cover (LULC) mapping. Sentinel2GlobalLULC v2.1 contains 194,877 images in GeoTiff and JPEG format corresponding to 29 broad LULC classes. Each image has 224 x 224 pixels at 10 m spatial resolution and was produced by assigning the 25th percentile of all available observations in the Sentinel-2 collection between June 2015 and October 2020 in order to remove atmospheric effects (i.e., clouds, aerosols, shadows, snow, etc.). A spatial purity value was assigned to each image based on the consensus across 15 different global LULC products available in Google Earth Engine (GEE).

    Our dataset is structured into 3 main zip-compressed folders, an Excel file with a dictionary for class names and descriptive statistics per LULC class, and a python script to convert RGB GeoTiff images into JPEG format. The first folder called "Sentinel2LULC_GeoTiff.zip" contains 29 zip-compressed subfolders where each one corresponds to a specific LULC class with hundreds to thousands of GeoTiff Sentinel-2 RGB images. The second folder called "Sentinel2LULC_JPEG.zip" contains 29 zip-compressed subfolders with a JPEG formatted version of the same images provided in the first main folder. The third folder called "Sentinel2LULC_CSV.zip" includes 29 zip-compressed CSV files with as many rows as provided images and with 12 columns containing the following metadata (this same metadata is provided in the image filenames):

    • Land Cover Class ID: is the identification number of each LULC class
    • Land Cover Class Short Name: is the short name of each LULC class
    • Image ID: is the identification number of each image within its corresponding LULC class
    • Pixel purity Value: is the spatial purity of each pixel for its corresponding LULC class calculated as the spatial consensus across up to 15 land-cover products
    • GHM Value: is the spatial average of the Global Human Modification index (gHM) for each image
    • Latitude: is the latitude of the center point of each image
    • Longitude: is the longitude of the center point of each image
    • Country Code: is the Alpha-2 country code of each image as described in the ISO 3166 international standard. To understand the country codes, we recommend the user to visit the following website where they present the Alpha-2 code for each country as described in the ISO 3166 international standard:https: //www.iban.com/country-codes
    • Administrative Department Level1: is the administrative level 1 name to which each image belongs
    • Administrative Department Level2: is the administrative level 2 name to which each image belongs
    • Locality: is the name of the locality to which each image belongs
    • Number of S2 images : is the number of found instances in the corresponding Sentinel-2 image collection between June 2015 and October 2020, when compositing and exporting its corresponding image tile

    For seven LULC classes, we could not export from GEE all images that fulfilled a spatial purity of 100% since there were millions of them. In this case, we exported a stratified random sample of 14,000 images and provided an additional CSV file with the images actually contained in our dataset. That is, for these seven LULC classes, we provide these 2 CSV files:

    • A CSV file that contains all exported images for this class
    • A CSV file that contains all images available for this class at spatial purity of 100%, both the ones exported and the ones not exported, in case the user wants to export them. These CSV filenames end with "including_non_downloaded_images".

    To clearly state the geographical coverage of images available in this dataset, we included in the version v2.1, a compressed folder called "Geographic_Representativeness.zip". This zip-compressed folder contains a csv file for each LULC class that provides the complete list of countries represented in that class. Each csv file has two columns, the first one gives the country code and the second one gives the number of images provided in that country for that LULC class. In addition to these 29 csv files, we provided another csv file that maps each ISO Alpha-2 country code to its original full country name.

    © Sentinel2GlobalLULC Dataset by Yassir Benhammou, Domingo Alcaraz-Segura, Emilio Guirado, Rohaifa Khaldi, Boujemâa Achchab, Francisco Herrera & Siham Tabik is marked with Attribution 4.0 International (CC-BY 4.0)

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Major TOM (2025). Core-AlphaEarth-Embeddings [Dataset]. https://huggingface.co/datasets/Major-TOM/Core-AlphaEarth-Embeddings

Core-AlphaEarth-Embeddings

Major-TOM/Core-AlphaEarth-Embeddings

Explore at:
Dataset updated
Jul 29, 2025
Dataset authored and provided by
Major TOM
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Major TOM Core AlphaEarth Embeddings Subset

This is a prototype dataset. It only includes some of the AlphaEarth embeddings stored in Major TOM grid cells. This dataset is mostly aimed at experimentation and prototyping. It is particularly useful to use it along other datasets published within the Major TOM project.

  Content

Field Type Description

grid_cell string Major TOM cell

year int year of the sample

thumbnail image 3-dimensional PCA… See the full description on the dataset page: https://huggingface.co/datasets/Major-TOM/Core-AlphaEarth-Embeddings.

Search
Clear search
Close search
Google apps
Main menu