100+ datasets found
  1. Comprehensive Soil Classification Datasets

    • kaggle.com
    zip
    Updated Jun 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AI4A Lab (2025). Comprehensive Soil Classification Datasets [Dataset]. https://www.kaggle.com/datasets/ai4a-lab/comprehensive-soil-classification-datasets
    Explore at:
    zip(514189522 bytes)Available download formats
    Dataset updated
    Jun 12, 2025
    Dataset authored and provided by
    AI4A Lab
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Soil Classification Datasets

    Please ensure to cite the paper when utilizing the dataset in a research study. Refer to the paper link or BibTeX provided below.

    This repository contains comprehensive datasets for soil classification and recognition research. The Original Dataset comprises soil images sourced from various online repositories, which have been meticulously cleaned and preprocessed to ensure data quality and consistency. To enhance the dataset's size and diversity, we employed Generative Adversarial Networks (GANs), specifically the CycleGAN architecture, to generate synthetic soil images. This augmented collection is referred to as the CyAUG Dataset. Both datasets are specifically designed to advance research in soil classification and recognition using state-of-the-art deep learning methodologies.

    This dataset was curated as part of the research study titled "An advanced artificial intelligence framework integrating ensembled convolutional neural networks and Vision Transformers for precise soil classification with adaptive fuzzy logic-based crop recommendations" by Farhan Sheth, Priya Mathur, Amit Kumar Gupta, and Sandeep Chaurasia, published in Engineering Applications of Artificial Intelligence.

    Links

    Application produced by this research is available at:

    Note: If you are using any part of this project; dataset, code, application, then please cite the work as mentioned in the Citation section below.

    Dataset

    Both dataset consists of images of 7 different soil types.

    The Soil Classification Dataset is structured to facilitate the classification of various soil types based on images. The dataset includes images of the following soil types:

    • Alluvial Soil
    • Black Soil
    • Laterite Soil
    • Red Soil
    • Yellow Soil
    • Arid Soil
    • Mountain Soil

    The dataset is organized into folders, each named after a specific soil type, containing images of that soil type. The images vary in resolution and quality, providing a diverse set of examples for training and testing classification models.

    Original Dataset Details

    • Total Images: 1189 images
    • Image Format: JPG/JPEG
    • Image Size: Varies
    • Source: Collected from various online repositories and cleaned for consistency.

    CyAUG Dataset Details

    • Total Images: 5097 images
    • Image Format: JPG/JPEG
    • Image Size: Varies
    • Source: Generated using CycleGAN to augment the original dataset, enhancing its size and diversity.

    Input and Output Parameters

    • Input Parameters:
      • Image: The images of the soils (JPG/JPEG format).
      • Label: The labels are in the format 'soil types' (folder names).
    • Output Parameter:
      • Classification: The predicted class (soil type) based on the input image.

    Citation

    If you are using any of the derived dataset, please cite the following paper:

    @article{SHETH2025111425,
      title = {An advanced artificial intelligence framework integrating ensembled convolutional neural networks and Vision Transformers for precise soil classification with adaptive fuzzy logic-based crop recommendations},
      journal = {Engineering Applications of Artificial Intelligence},
      volume = {158},
      pages = {111425},
      year = {2025},
      issn = {0952-1976},
      doi = {https://doi.org/10.1016/j.engappai.2025.111425},
      url = {https://www.sciencedirect.com/science/article/pii/S0952197625014277},
      author = {Farhan Sheth and Priya Mathur and Amit Kumar Gupta and Sandeep Chaurasia},
      keywords = {Soil classification, Crop recommendation, Vision transformers, Convolutional neural network, Transfer learning, Fuzzy logic}
    }
    
  2. National Soils Database - Dataset - data.gov.ie

    • data.gov.ie
    Updated Jul 23, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.gov.ie (2021). National Soils Database - Dataset - data.gov.ie [Dataset]. https://data.gov.ie/dataset/national-soils-database
    Explore at:
    Dataset updated
    Jul 23, 2021
    Dataset provided by
    data.gov.ie
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The National Soil Database has produced a national database of soil geochemistry including point and spatial distribution maps of major nutrients, major elements, essential trace elements, trace elements of special interest and minor elements. In addition, this study has generated a National Soil Archive, comprising bulk soil samples and a nucleic acids archive each of which represent a valuable resource for future soils research in Ireland. The geographical coherence of the geochemical results was considered to be predominantly underpinned by underlying parent material and glacial geology. Other factors such as soil type, land use, anthropogenic effects and climatic effects were also evident. The coherence between elements, as displayed by multivariate analyses, was evident in this study. Examples included strong relationships between Co, Fe, As, Mn and Cu. This study applied large-scale microbiological analysis of soils for the first time in Ireland and in doing so also investigated microbial community structure in a range of soil types in order to determine the relationship between soil microbiology and chemistry. The results of the microbiological analyses were consistent with geochemical analyses and demonstrated that bacterial community populations appeared to be predominantly determined by soil parent material and soil type. .hidden { display: none }

  3. n

    Data from: A Global Soil Dataset for Earth System Modeling

    • cmr.earthdata.nasa.gov
    Updated Apr 21, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2017). A Global Soil Dataset for Earth System Modeling [Dataset]. https://cmr.earthdata.nasa.gov/search/concepts/C1214604044-SCIOPS.html
    Explore at:
    Dataset updated
    Apr 21, 2017
    Time period covered
    Jan 1, 1970 - Present
    Area covered
    Earth
    Description

    We developed a comprehensive, gridded Global Soil Dataset for use in Earth System Models (GSDE) and other applications as well. GSDE provides soil information including soil particle-size distribution, organic carbon, and nutrients, etc. and quality control information in terms of confidence level. GSDE is based on the Soil Map of the World and various regional and national soil databases, including soil attribute data and soil maps. We used a standardized data structure and data processing procedures to harmonize the data collected from various sources. We then used a soil type linkage method (i.e. taxotransfer rules) and the polygon linkage method to derive the spatial distribution of soil properties. To aggregate the attributes of different compositions of a mapping unit, we used three mapping approaches: area-weighting method, the dominant soil type method and the dominant binned soil attribute method. In the released gridded dataset, we used the area-weighting method as it will meet the demands of most applications. The dataset can be also aggregate to a lower resolution. The resolution is 30 arc-seconds (about 1 km at the equator). The vertical variation of soil property was captured by eight layers to the depth of 2.3 m (i.e. 0- 0.045, 0.045- 0.091, 0.091- 0.166, 0.166- 0.289, 0.289- 0.493, 0.493- 0.829, 0.829- 1.383 and 1.383- 2.296 m).

  4. Soil Type

    • catalog.data.gov
    • datasets.ai
    • +4more
    Updated Feb 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Agriculture, Natural Resources Conservation Service (2025). Soil Type [Dataset]. https://catalog.data.gov/dataset/soil-type
    Explore at:
    Dataset updated
    Feb 5, 2025
    Dataset provided by
    United States Department of Agriculturehttp://usda.gov/
    Natural Resources Conservation Servicehttp://www.nrcs.usda.gov/
    Description

    This data set is a digital soil survey and generally is the most detailed level of soil geographic data developed by the National Cooperative Soil Survey. The information was prepared by digitizing maps, by compiling information onto a planimetric correct base and digitizing, or by revising digitized maps using remotely sensed and other information. This data set consists of georeferenced digital map data and computerized attribute data. The map data are in a soil survey area extent format and include a detailed, field verified inventory of soils and miscellaneous areas that normally occur in a repeatable pattern on the landscape and that can be cartographically shown at the scale mapped. A special soil features layer (point and line features) is optional. This layer displays the location of features too small to delineate at the mapping scale, but they are large enough and contrasting enough to significantly influence use and management. The soil map units are linked to attributes in the National Soil Information System relational database, which gives the proportionate extent of the component soils and their properties.

  5. Soil Data Grevena

    • kaggle.com
    • data.mendeley.com
    zip
    Updated Sep 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jocelyn Dumlao (2023). Soil Data Grevena [Dataset]. https://www.kaggle.com/datasets/jocelyndumlao/soil-data-grevena
    Explore at:
    zip(108258 bytes)Available download formats
    Dataset updated
    Sep 4, 2023
    Authors
    Jocelyn Dumlao
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    Grevena
    Description

    Description

    In this dataset, there are soil data analyses with properties such as pH, organic matter (OM), salinity (EC), etc., major elements (N, P, K, Mg) as well as some microelements (Fe, Zn, Mn, Cu, B) with significant impact on plant nutrition.

    Categories

    Agricultural Soil

    Acknowledgements & Source

    Panagiotis Tziachris

    Data Source

    View Details

    Image Source

  6. R

    Soil Type Dataset

    • universe.roboflow.com
    zip
    Updated Mar 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Traffulent (2023). Soil Type Dataset [Dataset]. https://universe.roboflow.com/traffulent/soil-type
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 2, 2023
    Dataset authored and provided by
    Traffulent
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Soil
    Description

    Soil Type

    ## Overview
    
    Soil Type is a dataset for classification tasks - it contains Soil annotations for 158 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  7. NCSS Soil Characterization Database

    • catalog.data.gov
    • ngda-soils-geoplatform.hub.arcgis.com
    Updated Feb 15, 2026
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Natural Resources Conservation Service (2026). NCSS Soil Characterization Database [Dataset]. https://catalog.data.gov/dataset/ncss-soil-characterization-database-d2772
    Explore at:
    Dataset updated
    Feb 15, 2026
    Dataset provided by
    Natural Resources Conservation Servicehttp://www.nrcs.usda.gov/
    Description

    The National Cooperative Soil Survey - Soil Characterization Database (NCSS-SCD) contains laboratory data for more than 65,000 locations (i.e. xy coordinates) throughout the United States and its Territories, and about 2,100 locations from other countries. It is a compilation of data from the Kellogg Soil Survey Laboratory (KSSL) and several cooperating laboratories. The data steward and distributor is the National Soil Survey Center (NSSC). Information contained within the database includes physical, chemical, biological, mineralogical, morphological, and mid infrared reflectance (MIR) soil measurements, as well a collection of calculated values. The intended use of the data is to support interpretations related to soil use and management. Data Usage Access to the data is provided via the following user interfaces: 1. Interactive Web Map 2. Lab Data Mart (LDM) for querying data and generating reports 3. Soil Data Access (SDA) web services for querying data 5. Direct download of the entire database in several formats Data at each location includes measurements at multiple depths (e.g. soil horizons). However, not all analyses have been conducted for each location and depth. Typically, a suite of measurements was collected based upon assumed or known conditions regarding the soil being analyzed. For example, soils of arid environments are routinely analyzed for salts and carbonates as part of the standard analysis suite. Standard morphological soil descriptions are available for about 60,000 of these locations. Mid-infrared (MIR) spectroscopy is available for about 7,000 locations. Soil fertility measurements, such as those made by Agricultural Experiment Stations, were not made. Most of the data were obtained over the last 40 years, with about 4,000 locations before 1960, 25,000 from 1960-1990, 27,000 from 1990-2010, and 13,000 from 2010 to 2021. Generally, the number of measurements recorded per location has increased over time. Typically, the data were collected to represent a soil series or map unit component concept. They may also have been sampled to determine the range of variation within a given landscape. Although strict quality-control measures are applied, the NSSC does not warrant that the data are error free. Also, in some cases the measurements are not within the applicability range of the laboratory methods. For example, dispersion of clay is incomplete in some soils by the standard method used for determining particle-size distribution. Soils producing incomplete dispersion include those that are derived from volcanic materials or that have a high content of iron oxides, gypsum, carbonates, or other cementing materials. Also note that determination of clay minerals by x-ray diffraction is relative. Measurements of very high or very low quantities by any method are not very precise. Other measurements have other limitations in some kinds of soils. Such data are retained in the database for research purposes. Also, some of the data for were obtained from cooperating laboratories within the NCSS. The accuracy of the location coordinates has not been quantified but can be inferred from the precision of their decimal degrees and the presence of a map datum. Some older records may correspond to a county centroid. When the map datum is missing it can be assumed that data prior to 1990 was recorded using NAD27 and with WGS84 after 1995. For detailed information about methods used in the KSSL and other laboratories refer to "Soil Survey Investigation Report No. 42". For information on the application of laboratory data, refer to "Soil Survey Investigation Report No. 45". If you are unfamiliar with any terms or methods feel free to consult your NRCS State Soil Scientist. Terms of Use This dataset is not designed for use as a primary regulatory tool in permitting or citing decisions but may be used as a reference source. This is public information and may be interpreted by organizations, agencies, units of government, or others based on needs; however, they are responsible for the appropriate application. Federal, State, or local regulatory bodies are not to reassign to the Natural Resources Conservation Service or the National Cooperative Soil Survey any authority for the decisions that they make. The Natural Resources Conservation Service will not perform any evaluations of these data for purposes related solely to State or local regulatory programs.

  8. U

    Soil properties dataset in the United States, Derived from 2020 gNATSGO...

    • data.usgs.gov
    • catalog.data.gov
    Updated Jun 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olena Boiko; Stefanie Kagone; Gabriel Senay (2024). Soil properties dataset in the United States, Derived from 2020 gNATSGO database [Dataset]. http://doi.org/10.5066/P9TI3IS8
    Explore at:
    Dataset updated
    Jun 30, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Authors
    Olena Boiko; Stefanie Kagone; Gabriel Senay
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Time period covered
    2020
    Area covered
    United States
    Description

    The dataset consists of three raster GeoTIFF files describing the following soil properties in the US: available water capacity, field capacity, and soil porosity. The input data were obtained from the gridded National Soil Survey Geographic (gNATSGO) Database and the Gridded Soil Survey Geographic (gSSURGO) Database with Soil Data Development tools provided by the Natural Resources Conservation Service. The soil characteristics derived from the databases were Available Water Capacity (AWC), Water Content (one-third bar) (WC), and Bulk Density (one-third bar) (BD) aggregated as weighted average values in the upper 1 m of soil. AWC and WC layers were converted to mm/m to express respectively available water capacity and field capacity in 1 m of soil, and BD layer was used to produce soil porosity raster assuming that the average particle density of soils is equal to 2.65 g/cm3. For each soil property, soil maps with CONUS, Alaska, and Hawaii geographic coverages were derived from s ...

  9. Soil Survey Geographic (SSURGO) database for Santa Fe County, Area New...

    • catalog.data.gov
    • datasets.ai
    • +2more
    Updated Dec 2, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Agriculture, Natural Resources Conservation Service (Point of Contact) (2020). Soil Survey Geographic (SSURGO) database for Santa Fe County, Area New Mexico [Dataset]. https://catalog.data.gov/dataset/soil-survey-geographic-ssurgo-database-for-santa-fe-county-area-new-mexico
    Explore at:
    Dataset updated
    Dec 2, 2020
    Dataset provided by
    Natural Resources Conservation Servicehttp://www.nrcs.usda.gov/
    Area covered
    Santa Fe County, New Mexico
    Description

    This data set is a digital soil survey and generally is the most detailed level of soil geographic data developed by the National Cooperative Soil Survey. The information was prepared by digitizing maps, by compiling information onto a planimetric correct base and digitizing, or by revising digitized maps using remotely sensed and other information. This data set consists of georeferenced digital map data and computerized attribute data. The map data are in a soil survey area extent format and include a detailed, field verified inventory of soils and miscellaneous areas that normally occur in a repeatable pattern on the landscape and that can be cartographically shown at the scale mapped. A special soil features layer (point and line features) is optional. This layer displays the location of features too small to delineate at the mapping scale, but they are large enough and contrasting enough to significantly influence use and management. The soil map units are linked to attributes in the National Soil Information System relational database, which gives the proportionate extent of the component soils and their properties.

  10. Global Soil Characteristics Dataset (1 Million)

    • kaggle.com
    zip
    Updated Apr 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hossam Hamouda (2024). Global Soil Characteristics Dataset (1 Million) [Dataset]. https://www.kaggle.com/datasets/hossam82/global-soil-characteristics-dataset-1-million
    Explore at:
    zip(132222591 bytes)Available download formats
    Dataset updated
    Apr 2, 2024
    Authors
    Hossam Hamouda
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Brief Description: This dataset contains 1 million simulated soil samples from various locations around the globe. Each sample includes data on soil texture, pH, organic matter content, moisture content, bulk density, nutrient levels (N, P, K), cation exchange capacity, electrical conductivity, color, porosity, and water holding capacity. Designed for environmental scientists, agronomists, and data scientists, this dataset is ideal for research, machine learning models, and educational purposes. Purpose: To provide a comprehensive soil dataset for environmental and agricultural research, including machine learning and data analysis applications. Data Collection Method: Simulated data generated using Python with realistic ranges and distributions based on common soil characteristics.

    Usage Examples

    Predictive modeling of soil properties.
    Classification of soil types based on texture and nutrient content.
    Analysis of soil health and fertility across different geographic locations.
    

    File Descriptions

    soil_data.csv - The main dataset file containing 1 million rows of soil data across 17 features.
    

    Data Fields

    Soil_ID: Unique identifier for each soil sample.
    Location_Latitude and Location_Longitude: Geographic coordinates of the soil sample.
    Depth_cm: Depth at which the soil sample was collected (cm).
    Texture: Soil texture classification (sandy, loamy, clayey).
    pH: Soil pH level.
    Organic_Matter_%: Percentage of organic matter in the soil.
    Moisture_Content_%: Soil moisture content percentage.
    Bulk_Density_g/cm³: Soil bulk density (g/cm³).
    Nitrogen_N_ppm, Phosphorus_P_ppm, Potassium_K_ppm: Nutrient levels in parts per million (ppm).
    Cation_Exchange_Capacity_meq/100g: Soil's ability to hold positively charged ions (meq/100g).
    Electrical_Conductivity_dS/m: Soil electrical conductivity (dS/m).
    Soil_Color: Color of the soil (brown, red, black, yellow).
    Porosity_%: Percentage of pore space in the soil.
    Water_Holding_Capacity_%: Soil's water holding capacity percentage.
    

    Acknowledgments

    If your dataset generation was inspired by specific studies, data sources, or methodologies, acknowledge them here.
    
  11. E

    WISE - Global Soil Profile Data, version 3.1

    • data.moa.gov.et
    • search.dataone.org
    • +2more
    pdf, zip
    Updated Oct 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FDRE - Ministry of Agriculture (MoA) (2023). WISE - Global Soil Profile Data, version 3.1 [Dataset]. https://data.moa.gov.et/dataset/wise-global-soil-profile-data-version-3-1
    Explore at:
    pdf, zipAvailable download formats
    Dataset updated
    Oct 25, 2023
    Dataset provided by
    FDRE - Ministry of Agriculture (MoA)
    Description

    Version 3.1 of the ISRIC-WISE database (WISE3) was compiled from a wide range of soil profile data collected by many soil professionals worldwide. All profiles have been harmonized with respect to the original Legend (1974) and Revised Legend (1988) of FAO-Unesco. Thereby, the primary soil data ─ and any secondary data derived from them ─ can be linked using GIS to the spatial units of the digitized Soil Map of the World as well as more recent digital Soil and Terrain (SOTER) databases through the soil legend code.

    WISE3 holds selected attribute data for some 10,250 soil profiles, with some 47,800 horizons, from 149 countries. Individual profiles have been sampled, described, and analyzed according to methods and standards in use in the originating countries. There is no uniform set of properties for which all profiles have analytical data, generally because only selected measurements were planned during the original surveys. Methods used for laboratory determinations of specific soil properties vary between laboratories and over time; sometimes, results for the same property cannot be compared directly. WISE3 will inevitably include gaps, being a compilation of legacy soil data derived from traditional soil survey, which can be of a taxonomic, geographic, and soil analytical nature. As a result, the amount of data available for modelling is sometimes much less than expected. Adroit use of the data, however, will permit a wide range of agricultural and environmental applications at a global and continental scale (1:500 000 and broader).

    Preferred citation: Batjes NH 2009. Harmonized soil profile data for applications at global and continental scales: updates to the WISE database. Soil Use and Management 5:124–127, http://dx.doi.org/10.1111/j.1475-2743.2009.00202.x

  12. ISLSCP II Global Gridded Soil Characteristics - Dataset - NASA Open Data...

    • data.nasa.gov
    Updated Apr 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). ISLSCP II Global Gridded Soil Characteristics - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/islscp-ii-global-gridded-soil-characteristics-64a5d
    Explore at:
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    This data set provides gridded data for selected soil parameters derived from data and methods developed by the Global Soil Data Task, an international collaborative project with the objective of making accurate and appropriate data relating to soil properties accessible to the global change research community. The task was coordinated by the International Geosphere-Biosphere Programme (IGBP-DIS). The data in this data set were produced by the International Satellite Land-Surface Climatology Project, Initiative II (ISLSCP II) staff from data obtained from the Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC, http://daac.ornl.gov/). See the related data sets section below. Two-dimensional gridded maps of selected soil parameters, including soil texture, at a 1.0 by 1.0 degree spatial resolution and for two soil depths are provided. All data layers have been adjusted to match the ISLSCP II land/water mask. There are 36 data files with this data set.

  13. v

    VT Data - NRCS Soil Survey Units

    • geodata.vermont.gov
    • geodata1-59998-vcgi.opendata.arcgis.com
    • +3more
    Updated Oct 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VT Center for Geographic Information (2022). VT Data - NRCS Soil Survey Units [Dataset]. https://geodata.vermont.gov/datasets/vt-data-nrcs-soil-survey-units
    Explore at:
    Dataset updated
    Oct 1, 2022
    Dataset authored and provided by
    VT Center for Geographic Information
    Area covered
    Description

    (Link to Metadata) This data set is a digital soil survey and generally is the most detailed level of soil geographic data developed by the National Cooperative Soil Survey. The information was prepared by digitizing maps, by compiling information onto a planimetric correct base and digitizing, or by revising digitized maps using remotely sensed and other information. This data set consists of georeferenced digital map data and computerized attribute data. The map data are in a soil survey area extent format and include a detailed, field verified inventory of soils and miscellaneous areas that normally occur in a repeatable pattern on the landscape and that can be cartographically shown at the scale mapped. A special soil features layer (point and line features) is optional. This layer displays the location of features too small to delineate at the mapping scale, but they are large enough and contrasting enough to significantly influence use and management. The soil map units are linked to attributes in the National Soil Information System relational database, which gives the proportionate extent of the component soils and their properties. Survey Dates - https://www.nrcs.usda.gov/wps/portal/nrcs/surveylist/soils/survey/state/?stateId=VT

  14. n

    Global Soil Profile Data (ISRIC-WISE)

    • earthdata.nasa.gov
    • search.dataone.org
    • +2more
    Updated Sep 5, 2000
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ORNL_CLOUD (2000). Global Soil Profile Data (ISRIC-WISE) [Dataset]. http://doi.org/10.3334/ORNLDAAC/547
    Explore at:
    Dataset updated
    Sep 5, 2000
    Dataset authored and provided by
    ORNL_CLOUD
    Description

    The International Soil Reference and Information Centre-World Inventory of Soil Emission Potentials (ISRIC-WISE) international soil profile data set consists of a homogenized, global set of 1,125 soil profiles for use by global modelers. These profiles provided the basis for the Global Pedon Database (GPDB) of the International Geosphere-Biosphere Programme (IGBP) - Data and Information System (DIS). The data set consists of a selection of 665 profiles originating from the Natural Resources Conservation Service (NRCS, Lincoln), 250 profiles obtained from the Food and Agriculture Organization (FAO, Rome), and 210 profiles from the reference collection of the International Soil Reference and Information Centre (ISRIC, Wageningen). All profiles are georeferenced and classified according to the 1974 Legend of the FAO-UNESCO Soil Map (FAC-UNESCO, 1974) of the World, as well as the 1988 Revised Legend of FAO-UNESCO (FAO, 1990). The data set includes information on soil classification, site data, soil horizon data, source of data, and methods used for determining analytical data. The data files are in a comma-delimited format. Data Citation: The data set should be cited as follows: Batjes, N. H. (ed). 2000. Global Soil Profile Data (ISRIC-WISE). Available on-line from the ORNL Distributed Active Archive Center, Oak Ridge National Laboratory, Oak Ridge, Tennessee, U.S.A.

  15. Indonesia Soil Type

    • data.globalforestwatch.org
    • data.amerigeoss.org
    Updated Jun 1, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Global Forest Watch (2018). Indonesia Soil Type [Dataset]. https://data.globalforestwatch.org/documents/7945178fad3f4deeb51785d1e2df67bf
    Explore at:
    Dataset updated
    Jun 1, 2018
    Dataset authored and provided by
    Global Forest Watchhttp://www.globalforestwatch.org/
    Area covered
    Indonesia
    Description

    This layer shows soil type, based on the result of a classification established from Kalimantan RePPProT data on 'SL_ORDER' field (1990, 1:250,000 scale) . This data was provided and processed by Daemeter Consulting. Soil categories from RePPProT were then re-classified by the World Resources Institute according to the FAO Digital Soil Map of the World, for use in the Suitability Mapper (2012). The FAO data is available at http://www.fao.org/geonetwork/srv/en/metadata.show?id=14116 . Data separated into categories: Inceptisol; Oxisol; Alfisol; Ultisol; Spodosol; Entisol; Histosol.

  16. a

    Soils All Soils

    • ct-deep-gis-open-data-website-ctdeep.hub.arcgis.com
    • data.ct.gov
    • +4more
    Updated Dec 14, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Energy & Environmental Protection (2023). Soils All Soils [Dataset]. https://ct-deep-gis-open-data-website-ctdeep.hub.arcgis.com/datasets/CTDEEP::soils-all-soils/about
    Explore at:
    Dataset updated
    Dec 14, 2023
    Dataset authored and provided by
    Department of Energy & Environmental Protection
    Area covered
    Description

    This data set is a digital soil survey and generally is the mostdetailed level of soil geographic data developed by the NationalCooperative Soil Survey. The information was prepared by digitizingmaps, by compiling information onto a planimetric correct baseand digitizing, or by revising digitized maps using remotelysensed and other information.This data set consists of georeferenced digital map data andcomputerized attribute data. The map data are in a soil survey areaextent format and include a detailed, field verified inventoryof soils and miscellaneous areas that normally occur in a repeatablepattern on the landscape and that can be cartographically shown atthe scale mapped. The soil map units are linked to attributes in theNational Soil Information System relational database, which givesthe proportionate extent of the component soils and their properties.

  17. n

    Global Soil Types, 0.5-Degree Grid (Modified Zobler)

    • earthdata.nasa.gov
    Updated May 19, 2000
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ORNL_CLOUD (2000). Global Soil Types, 0.5-Degree Grid (Modified Zobler) [Dataset]. http://doi.org/10.3334/ORNLDAAC/540
    Explore at:
    Dataset updated
    May 19, 2000
    Dataset authored and provided by
    ORNL_CLOUD
    Description

    A global data set of soil types is available at 0.5-degree latitude by 0.5-degree longitude resolution. There are 106 soil units, based on Zobler?s (1986) assessment of the FAO/UNESCO Soil Map of the World. This data set is a conversion of the Zobler 1-degree resolution version to a 0.5-degree resolution. The resolution of the data set was not actually increased. Rather, the 1-degree squares were divided into four 0.5-degree squares with the necessary adjustment of continental boundaries and islands. The computer code used to convert the original 1-degree data to 0.5-degree is provided as a companion file. A JPG image of the data is provided in this document. The Zobler data (1-degree resolution) as distributed by Webb et al. (1993) http://www.ngdc.noaa.gov/seg/eco/cdroms/gedii_a/datasets/a12/wr.htm#top contains two columns, one column for continent and one column for soil type. The Soil Map of the World consists of 9 maps that represent parts of the world. The texture data that Webb et al.(1993) provided allowed for the fact that a soil type in one part of the world may have different properties than the same soil in a different part of the world. This continent-specific information is retained in this 0.5-degree resolution data set, as well as the soil type information which is the second column. A code was written (one2half.c) to take the file CONTIZOB.LER distributed by Webb et al. (1993) http://www.ngdc.noaa.gov/seg/eco/cdroms/gedii_a/datasets/a12/wr.htm#top and simply divide the 1-degree cells into quarters. This code also reads in a land/water file (land.wave) that specifies the cells that are land at 0.5 degrees. The code checks for consistency between the newly quartered map and the land/w...

  18. d

    Soils (soil type) - Dataset - data.sa.gov.au

    • data.sa.gov.au
    Updated Jun 28, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). Soils (soil type) - Dataset - data.sa.gov.au [Dataset]. https://data.sa.gov.au/data/dataset/soil-type
    Explore at:
    Dataset updated
    Jun 28, 2016
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    South Australia
    Description

    Sixty one soils (soil types) represent the range of soils found across South Australia’s agricultural lands. Mapping shows the most common soil within each map unit, while more detailed proportion data are supplied for calculating respective areas of each soil type (spatial data statistics).

  19. Soil Survey Geographic (SSURGO) database for Ute Mountain Area, Colorado and...

    • catalog.data.gov
    • gimi9.com
    Updated Dec 2, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Agriculture, Natural Resources Conservation Service (Point of Contact) (2020). Soil Survey Geographic (SSURGO) database for Ute Mountain Area, Colorado and New Mexico [Dataset]. https://catalog.data.gov/dataset/soil-survey-geographic-ssurgo-database-for-ute-mountain-area-colorado-and-new-mexico
    Explore at:
    Dataset updated
    Dec 2, 2020
    Dataset provided by
    United States Department of Agriculturehttp://usda.gov/
    Natural Resources Conservation Servicehttp://www.nrcs.usda.gov/
    Area covered
    Ute Mountain, Colorado, New Mexico
    Description

    This data set is a digital soil survey and generally is the most detailed level of soil geographic data developed by the National Cooperative Soil Survey. The information was prepared by digitizing maps, by compiling information onto a planimetric correct base and digitizing, or by revising digitized maps using remotely sensed and other information. This data set consists of georeferenced digital map data and computerized attribute data. The map data are in a soil survey area extent format and include a detailed, field verified inventory of soils and miscellaneous areas that normally occur in a repeatable pattern on the landscape and that can be cartographically shown at the scale mapped. A special soil features layer (point and line features) is optional. This layer displays the location of features too small to delineate at the mapping scale, but they are large enough and contrasting enough to significantly influence use and management. The soil map units are linked to attributes in the National Soil Information System relational database, which gives the proportionate extent of the component soils and their properties.

  20. Soil Use - Hydric Soils database

    • agdatacommons.nal.usda.gov
    bin
    Updated Nov 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    USDA Natural Resources Conservation Service (2025). Soil Use - Hydric Soils database [Dataset]. https://agdatacommons.nal.usda.gov/articles/dataset/Soil_Use_-_Hydric_Soils_database/25212176
    Explore at:
    binAvailable download formats
    Dataset updated
    Nov 21, 2025
    Dataset provided by
    United States Department of Agriculturehttp://usda.gov/
    Authors
    USDA Natural Resources Conservation Service
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Hydric soils are defined as those soils that are sufficiently wet in the upper part to develop anaerobic conditions during the growing season. The Hydric Soils section presents the most current information about hydric soils. The lists of hydric soils were created by using National Soil Information System (NASIS) database selection criteria that were developed by the National Technical Committee for Hydric Soils. These criteria are selected soil properties that are documented in Soil Taxonomy (Soil Survey Staff, 1999) and were designed primarily to generate a list of potentially hydric soils from the National Soil Information System (NASIS) database. It updates information that was previously published in Hydric Soils of the United States and coordinates it with information that has been published in the Federal Register. It also includes the most recent set of field indicators of hydric soils. The database selection criteria are selected soil properties that are documented in Soil Taxonomy and were designed primarily to generate a list of potentially hydric soils from soil survey databases. Only criteria 1, 3, and 4 can be used in the field to determine hydric soils; however, proof of anaerobic conditions must also be obtained for criteria 1, 3, and 4 either through data or best professional judgment (from Tech Note 1). The primary purpose of these selection criteria is to generate a list of soil map unit components that are likely to meet the hydric soil definition. Caution must be used when comparing the list of hydric components to soil survey maps. Many of the soils on the list have ranges in water table depths that allow the soil component to range from hydric to nonhydric depending on the location of the soil within the landscape as described in the map unit. Lists of hydric soils along with soil survey maps are good off-site ancillary tools to assist in wetland determinations, but they are not a substitute for observations made during on-site investigations. The list of field indicators of hydric soils — The field indicators are morphological properties known to be associated with soils that meet the definition of a hydric soil. Presence of one or more field indicators suggests that the processes associated with hydric soil formation have taken place on the site being observed. The field indicators are essential for hydric soil identification because once formed, they persist in the soil during both wet and dry seasonal periods. The Hydric Soil Technical Notes — Contain National Technical Committee for Hydric Soils (NTCHS) updates, insights, standards, and clarifications. Users can query the database by State or by Soil Survey Area. Resources in this dataset:Resource Title: Website Pointer to Hydric Soils . File Name: Web Page, url: https://www.nrcs.usda.gov/wps/portal/nrcs/main/soils/use/hydric/ Includes description of Criteria, Query by State or Soil Survey Area, national Technical Committee for Hydric Soils. Technical Notes, and Related Links. Report Metadata:

    • Area_Symbol: A symbol that uniquely identifies a single occurrence of a particular type of area (e.g. Dane Co., Wisconsin is WI025).
    • Area_Name: The name given to the specified geographic area.
    • mukey: A non-connotative string of characters used to uniquely identify a record in the Mapunit table.
    • Mapunit_SYM: The symbol used to uniquely identify the soil mapunit in the soil survey.
    • Mapunit_Name: Correlated name of the mapunit (recommended name or field name for surveys in progress).
    • Comp_Name_phase: Component name - Name assigned to a component based on its range of properties. Local Phase - Phase criterion to be used at a local level, in conjunction with "component name" to help identify a soil component.
    • muacres: The number of acres of a particular mapunit.
    • Comp_RV_Pct: The percentage of the component of the mapunit.
    • majcompflag: Indicates whether or not a component is a major component in the mapunit.
    • Comp_Acres: The number of acres of a particular component in a mapunit. ((muacres*comppct_r)/100)
    • Comp_Landform: A word or group of words used to name a feature on the earth's surface, expressed in the plural form. Column Physical
    • Hydric_Rating: A yes/no field that indicates whether or not a map unit component is classified as a "hydric soil". If rated as hydric, the specific criteria met are listed in the Component Hydric Criteria table.
    • Hydric_criteria: Criterion code for the soil characteristic(s) and/or feature(s) that cause the map unit component to be classified as a "hydric soil." These codes are the paragraph numbers in the hydric soil criteria publication.

    Criteria:

    1. All Histels except Folistels and Histosols except Folists; or
    2. Map unit components in Aquic suborders, great groups, or subgroups, Albolls suborder, Historthels great group, Histoturbels great group, or Andic, Cumulic, Pachic, or Vitrandic subgroups that: a. Based on the range of characteristics for the soil series, will at least in part meet one or more Field Indicators of Hydric Soils in the United States, or b. Show evidence that the soil meets the definition of a hydric soil;
    3. Map unit components that are frequently ponded for long duration or very long duration during the growing season that: a. Based on the range of characteristics for the soil series, will at least in part meet one or more Field Indicators of Hydric Soils in the United States, or b. Show evidence that the soil meets the definition of a hydric soil; or
    4. Map unit components that are frequently flooded for long duration or very long duration during the growing season that: a. Based on the range of characteristics for the soil series, will at least in part meet one or more Field Indicators of Hydric Soils in the United States, or b. Show evidence that the soils meet the definition of a hydric soil.
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
AI4A Lab (2025). Comprehensive Soil Classification Datasets [Dataset]. https://www.kaggle.com/datasets/ai4a-lab/comprehensive-soil-classification-datasets
Organization logo

Comprehensive Soil Classification Datasets

🌾Advanced Soil Classification: Original (1K+) & GAN-Augmented (5K+) Datasets

Explore at:
zip(514189522 bytes)Available download formats
Dataset updated
Jun 12, 2025
Dataset authored and provided by
AI4A Lab
License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

Soil Classification Datasets

Please ensure to cite the paper when utilizing the dataset in a research study. Refer to the paper link or BibTeX provided below.

This repository contains comprehensive datasets for soil classification and recognition research. The Original Dataset comprises soil images sourced from various online repositories, which have been meticulously cleaned and preprocessed to ensure data quality and consistency. To enhance the dataset's size and diversity, we employed Generative Adversarial Networks (GANs), specifically the CycleGAN architecture, to generate synthetic soil images. This augmented collection is referred to as the CyAUG Dataset. Both datasets are specifically designed to advance research in soil classification and recognition using state-of-the-art deep learning methodologies.

This dataset was curated as part of the research study titled "An advanced artificial intelligence framework integrating ensembled convolutional neural networks and Vision Transformers for precise soil classification with adaptive fuzzy logic-based crop recommendations" by Farhan Sheth, Priya Mathur, Amit Kumar Gupta, and Sandeep Chaurasia, published in Engineering Applications of Artificial Intelligence.

Links

Application produced by this research is available at:

Note: If you are using any part of this project; dataset, code, application, then please cite the work as mentioned in the Citation section below.

Dataset

Both dataset consists of images of 7 different soil types.

The Soil Classification Dataset is structured to facilitate the classification of various soil types based on images. The dataset includes images of the following soil types:

  • Alluvial Soil
  • Black Soil
  • Laterite Soil
  • Red Soil
  • Yellow Soil
  • Arid Soil
  • Mountain Soil

The dataset is organized into folders, each named after a specific soil type, containing images of that soil type. The images vary in resolution and quality, providing a diverse set of examples for training and testing classification models.

Original Dataset Details

  • Total Images: 1189 images
  • Image Format: JPG/JPEG
  • Image Size: Varies
  • Source: Collected from various online repositories and cleaned for consistency.

CyAUG Dataset Details

  • Total Images: 5097 images
  • Image Format: JPG/JPEG
  • Image Size: Varies
  • Source: Generated using CycleGAN to augment the original dataset, enhancing its size and diversity.

Input and Output Parameters

  • Input Parameters:
    • Image: The images of the soils (JPG/JPEG format).
    • Label: The labels are in the format 'soil types' (folder names).
  • Output Parameter:
    • Classification: The predicted class (soil type) based on the input image.

Citation

If you are using any of the derived dataset, please cite the following paper:

@article{SHETH2025111425,
  title = {An advanced artificial intelligence framework integrating ensembled convolutional neural networks and Vision Transformers for precise soil classification with adaptive fuzzy logic-based crop recommendations},
  journal = {Engineering Applications of Artificial Intelligence},
  volume = {158},
  pages = {111425},
  year = {2025},
  issn = {0952-1976},
  doi = {https://doi.org/10.1016/j.engappai.2025.111425},
  url = {https://www.sciencedirect.com/science/article/pii/S0952197625014277},
  author = {Farhan Sheth and Priya Mathur and Amit Kumar Gupta and Sandeep Chaurasia},
  keywords = {Soil classification, Crop recommendation, Vision transformers, Convolutional neural network, Transfer learning, Fuzzy logic}
}
Search
Clear search
Close search
Google apps
Main menu