Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Please ensure to cite the paper when utilizing the dataset in a research study. Refer to the paper link or BibTeX provided below.
This repository contains comprehensive datasets for soil classification and recognition research. The Original Dataset comprises soil images sourced from various online repositories, which have been meticulously cleaned and preprocessed to ensure data quality and consistency. To enhance the dataset's size and diversity, we employed Generative Adversarial Networks (GANs), specifically the CycleGAN architecture, to generate synthetic soil images. This augmented collection is referred to as the CyAUG Dataset. Both datasets are specifically designed to advance research in soil classification and recognition using state-of-the-art deep learning methodologies.
This dataset was curated as part of the research study titled "An advanced artificial intelligence framework integrating ensembled convolutional neural networks and Vision Transformers for precise soil classification with adaptive fuzzy logic-based crop recommendations" by Farhan Sheth, Priya Mathur, Amit Kumar Gupta, and Sandeep Chaurasia, published in Engineering Applications of Artificial Intelligence.
Application produced by this research is available at:
Note: If you are using any part of this project; dataset, code, application, then please cite the work as mentioned in the Citation section below.
Both dataset consists of images of 7 different soil types.
The Soil Classification Dataset is structured to facilitate the classification of various soil types based on images. The dataset includes images of the following soil types:
The dataset is organized into folders, each named after a specific soil type, containing images of that soil type. The images vary in resolution and quality, providing a diverse set of examples for training and testing classification models.
If you are using any of the derived dataset, please cite the following paper:
@article{SHETH2025111425,
title = {An advanced artificial intelligence framework integrating ensembled convolutional neural networks and Vision Transformers for precise soil classification with adaptive fuzzy logic-based crop recommendations},
journal = {Engineering Applications of Artificial Intelligence},
volume = {158},
pages = {111425},
year = {2025},
issn = {0952-1976},
doi = {https://doi.org/10.1016/j.engappai.2025.111425},
url = {https://www.sciencedirect.com/science/article/pii/S0952197625014277},
author = {Farhan Sheth and Priya Mathur and Amit Kumar Gupta and Sandeep Chaurasia},
keywords = {Soil classification, Crop recommendation, Vision transformers, Convolutional neural network, Transfer learning, Fuzzy logic}
}
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The National Soil Database has produced a national database of soil geochemistry including point and spatial distribution maps of major nutrients, major elements, essential trace elements, trace elements of special interest and minor elements. In addition, this study has generated a National Soil Archive, comprising bulk soil samples and a nucleic acids archive each of which represent a valuable resource for future soils research in Ireland. The geographical coherence of the geochemical results was considered to be predominantly underpinned by underlying parent material and glacial geology. Other factors such as soil type, land use, anthropogenic effects and climatic effects were also evident. The coherence between elements, as displayed by multivariate analyses, was evident in this study. Examples included strong relationships between Co, Fe, As, Mn and Cu. This study applied large-scale microbiological analysis of soils for the first time in Ireland and in doing so also investigated microbial community structure in a range of soil types in order to determine the relationship between soil microbiology and chemistry. The results of the microbiological analyses were consistent with geochemical analyses and demonstrated that bacterial community populations appeared to be predominantly determined by soil parent material and soil type. .hidden { display: none }
Facebook
TwitterWe developed a comprehensive, gridded Global Soil Dataset for use in Earth System Models (GSDE) and other applications as well. GSDE provides soil information including soil particle-size distribution, organic carbon, and nutrients, etc. and quality control information in terms of confidence level. GSDE is based on the Soil Map of the World and various regional and national soil databases, including soil attribute data and soil maps. We used a standardized data structure and data processing procedures to harmonize the data collected from various sources. We then used a soil type linkage method (i.e. taxotransfer rules) and the polygon linkage method to derive the spatial distribution of soil properties. To aggregate the attributes of different compositions of a mapping unit, we used three mapping approaches: area-weighting method, the dominant soil type method and the dominant binned soil attribute method. In the released gridded dataset, we used the area-weighting method as it will meet the demands of most applications. The dataset can be also aggregate to a lower resolution. The resolution is 30 arc-seconds (about 1 km at the equator). The vertical variation of soil property was captured by eight layers to the depth of 2.3 m (i.e. 0- 0.045, 0.045- 0.091, 0.091- 0.166, 0.166- 0.289, 0.289- 0.493, 0.493- 0.829, 0.829- 1.383 and 1.383- 2.296 m).
Facebook
TwitterThis data set is a digital soil survey and generally is the most detailed level of soil geographic data developed by the National Cooperative Soil Survey. The information was prepared by digitizing maps, by compiling information onto a planimetric correct base and digitizing, or by revising digitized maps using remotely sensed and other information. This data set consists of georeferenced digital map data and computerized attribute data. The map data are in a soil survey area extent format and include a detailed, field verified inventory of soils and miscellaneous areas that normally occur in a repeatable pattern on the landscape and that can be cartographically shown at the scale mapped. A special soil features layer (point and line features) is optional. This layer displays the location of features too small to delineate at the mapping scale, but they are large enough and contrasting enough to significantly influence use and management. The soil map units are linked to attributes in the National Soil Information System relational database, which gives the proportionate extent of the component soils and their properties.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
In this dataset, there are soil data analyses with properties such as pH, organic matter (OM), salinity (EC), etc., major elements (N, P, K, Mg) as well as some microelements (Fe, Zn, Mn, Cu, B) with significant impact on plant nutrition.
Agricultural Soil
Panagiotis Tziachris
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Soil Type is a dataset for classification tasks - it contains Soil annotations for 158 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Facebook
TwitterThe National Cooperative Soil Survey - Soil Characterization Database (NCSS-SCD) contains laboratory data for more than 65,000 locations (i.e. xy coordinates) throughout the United States and its Territories, and about 2,100 locations from other countries. It is a compilation of data from the Kellogg Soil Survey Laboratory (KSSL) and several cooperating laboratories. The data steward and distributor is the National Soil Survey Center (NSSC). Information contained within the database includes physical, chemical, biological, mineralogical, morphological, and mid infrared reflectance (MIR) soil measurements, as well a collection of calculated values. The intended use of the data is to support interpretations related to soil use and management. Data Usage Access to the data is provided via the following user interfaces: 1. Interactive Web Map 2. Lab Data Mart (LDM) for querying data and generating reports 3. Soil Data Access (SDA) web services for querying data 5. Direct download of the entire database in several formats Data at each location includes measurements at multiple depths (e.g. soil horizons). However, not all analyses have been conducted for each location and depth. Typically, a suite of measurements was collected based upon assumed or known conditions regarding the soil being analyzed. For example, soils of arid environments are routinely analyzed for salts and carbonates as part of the standard analysis suite. Standard morphological soil descriptions are available for about 60,000 of these locations. Mid-infrared (MIR) spectroscopy is available for about 7,000 locations. Soil fertility measurements, such as those made by Agricultural Experiment Stations, were not made. Most of the data were obtained over the last 40 years, with about 4,000 locations before 1960, 25,000 from 1960-1990, 27,000 from 1990-2010, and 13,000 from 2010 to 2021. Generally, the number of measurements recorded per location has increased over time. Typically, the data were collected to represent a soil series or map unit component concept. They may also have been sampled to determine the range of variation within a given landscape. Although strict quality-control measures are applied, the NSSC does not warrant that the data are error free. Also, in some cases the measurements are not within the applicability range of the laboratory methods. For example, dispersion of clay is incomplete in some soils by the standard method used for determining particle-size distribution. Soils producing incomplete dispersion include those that are derived from volcanic materials or that have a high content of iron oxides, gypsum, carbonates, or other cementing materials. Also note that determination of clay minerals by x-ray diffraction is relative. Measurements of very high or very low quantities by any method are not very precise. Other measurements have other limitations in some kinds of soils. Such data are retained in the database for research purposes. Also, some of the data for were obtained from cooperating laboratories within the NCSS. The accuracy of the location coordinates has not been quantified but can be inferred from the precision of their decimal degrees and the presence of a map datum. Some older records may correspond to a county centroid. When the map datum is missing it can be assumed that data prior to 1990 was recorded using NAD27 and with WGS84 after 1995. For detailed information about methods used in the KSSL and other laboratories refer to "Soil Survey Investigation Report No. 42". For information on the application of laboratory data, refer to "Soil Survey Investigation Report No. 45". If you are unfamiliar with any terms or methods feel free to consult your NRCS State Soil Scientist. Terms of Use This dataset is not designed for use as a primary regulatory tool in permitting or citing decisions but may be used as a reference source. This is public information and may be interpreted by organizations, agencies, units of government, or others based on needs; however, they are responsible for the appropriate application. Federal, State, or local regulatory bodies are not to reassign to the Natural Resources Conservation Service or the National Cooperative Soil Survey any authority for the decisions that they make. The Natural Resources Conservation Service will not perform any evaluations of these data for purposes related solely to State or local regulatory programs.
Facebook
TwitterU.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
The dataset consists of three raster GeoTIFF files describing the following soil properties in the US: available water capacity, field capacity, and soil porosity. The input data were obtained from the gridded National Soil Survey Geographic (gNATSGO) Database and the Gridded Soil Survey Geographic (gSSURGO) Database with Soil Data Development tools provided by the Natural Resources Conservation Service. The soil characteristics derived from the databases were Available Water Capacity (AWC), Water Content (one-third bar) (WC), and Bulk Density (one-third bar) (BD) aggregated as weighted average values in the upper 1 m of soil. AWC and WC layers were converted to mm/m to express respectively available water capacity and field capacity in 1 m of soil, and BD layer was used to produce soil porosity raster assuming that the average particle density of soils is equal to 2.65 g/cm3. For each soil property, soil maps with CONUS, Alaska, and Hawaii geographic coverages were derived from s ...
Facebook
TwitterThis data set is a digital soil survey and generally is the most detailed level of soil geographic data developed by the National Cooperative Soil Survey. The information was prepared by digitizing maps, by compiling information onto a planimetric correct base and digitizing, or by revising digitized maps using remotely sensed and other information. This data set consists of georeferenced digital map data and computerized attribute data. The map data are in a soil survey area extent format and include a detailed, field verified inventory of soils and miscellaneous areas that normally occur in a repeatable pattern on the landscape and that can be cartographically shown at the scale mapped. A special soil features layer (point and line features) is optional. This layer displays the location of features too small to delineate at the mapping scale, but they are large enough and contrasting enough to significantly influence use and management. The soil map units are linked to attributes in the National Soil Information System relational database, which gives the proportionate extent of the component soils and their properties.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Brief Description: This dataset contains 1 million simulated soil samples from various locations around the globe. Each sample includes data on soil texture, pH, organic matter content, moisture content, bulk density, nutrient levels (N, P, K), cation exchange capacity, electrical conductivity, color, porosity, and water holding capacity. Designed for environmental scientists, agronomists, and data scientists, this dataset is ideal for research, machine learning models, and educational purposes. Purpose: To provide a comprehensive soil dataset for environmental and agricultural research, including machine learning and data analysis applications. Data Collection Method: Simulated data generated using Python with realistic ranges and distributions based on common soil characteristics.
Usage Examples
Predictive modeling of soil properties.
Classification of soil types based on texture and nutrient content.
Analysis of soil health and fertility across different geographic locations.
File Descriptions
soil_data.csv - The main dataset file containing 1 million rows of soil data across 17 features.
Data Fields
Soil_ID: Unique identifier for each soil sample.
Location_Latitude and Location_Longitude: Geographic coordinates of the soil sample.
Depth_cm: Depth at which the soil sample was collected (cm).
Texture: Soil texture classification (sandy, loamy, clayey).
pH: Soil pH level.
Organic_Matter_%: Percentage of organic matter in the soil.
Moisture_Content_%: Soil moisture content percentage.
Bulk_Density_g/cm³: Soil bulk density (g/cm³).
Nitrogen_N_ppm, Phosphorus_P_ppm, Potassium_K_ppm: Nutrient levels in parts per million (ppm).
Cation_Exchange_Capacity_meq/100g: Soil's ability to hold positively charged ions (meq/100g).
Electrical_Conductivity_dS/m: Soil electrical conductivity (dS/m).
Soil_Color: Color of the soil (brown, red, black, yellow).
Porosity_%: Percentage of pore space in the soil.
Water_Holding_Capacity_%: Soil's water holding capacity percentage.
Acknowledgments
If your dataset generation was inspired by specific studies, data sources, or methodologies, acknowledge them here.
Facebook
TwitterVersion 3.1 of the ISRIC-WISE database (WISE3) was compiled from a wide range of soil profile data collected by many soil professionals worldwide. All profiles have been harmonized with respect to the original Legend (1974) and Revised Legend (1988) of FAO-Unesco. Thereby, the primary soil data ─ and any secondary data derived from them ─ can be linked using GIS to the spatial units of the digitized Soil Map of the World as well as more recent digital Soil and Terrain (SOTER) databases through the soil legend code.
WISE3 holds selected attribute data for some 10,250 soil profiles, with some 47,800 horizons, from 149 countries. Individual profiles have been sampled, described, and analyzed according to methods and standards in use in the originating countries. There is no uniform set of properties for which all profiles have analytical data, generally because only selected measurements were planned during the original surveys. Methods used for laboratory determinations of specific soil properties vary between laboratories and over time; sometimes, results for the same property cannot be compared directly. WISE3 will inevitably include gaps, being a compilation of legacy soil data derived from traditional soil survey, which can be of a taxonomic, geographic, and soil analytical nature. As a result, the amount of data available for modelling is sometimes much less than expected. Adroit use of the data, however, will permit a wide range of agricultural and environmental applications at a global and continental scale (1:500 000 and broader).
Preferred citation: Batjes NH 2009. Harmonized soil profile data for applications at global and continental scales: updates to the WISE database. Soil Use and Management 5:124–127, http://dx.doi.org/10.1111/j.1475-2743.2009.00202.x
Facebook
TwitterThis data set provides gridded data for selected soil parameters derived from data and methods developed by the Global Soil Data Task, an international collaborative project with the objective of making accurate and appropriate data relating to soil properties accessible to the global change research community. The task was coordinated by the International Geosphere-Biosphere Programme (IGBP-DIS). The data in this data set were produced by the International Satellite Land-Surface Climatology Project, Initiative II (ISLSCP II) staff from data obtained from the Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC, http://daac.ornl.gov/). See the related data sets section below. Two-dimensional gridded maps of selected soil parameters, including soil texture, at a 1.0 by 1.0 degree spatial resolution and for two soil depths are provided. All data layers have been adjusted to match the ISLSCP II land/water mask. There are 36 data files with this data set.
Facebook
Twitter(Link to Metadata) This data set is a digital soil survey and generally is the most detailed level of soil geographic data developed by the National Cooperative Soil Survey. The information was prepared by digitizing maps, by compiling information onto a planimetric correct base and digitizing, or by revising digitized maps using remotely sensed and other information. This data set consists of georeferenced digital map data and computerized attribute data. The map data are in a soil survey area extent format and include a detailed, field verified inventory of soils and miscellaneous areas that normally occur in a repeatable pattern on the landscape and that can be cartographically shown at the scale mapped. A special soil features layer (point and line features) is optional. This layer displays the location of features too small to delineate at the mapping scale, but they are large enough and contrasting enough to significantly influence use and management. The soil map units are linked to attributes in the National Soil Information System relational database, which gives the proportionate extent of the component soils and their properties. Survey Dates - https://www.nrcs.usda.gov/wps/portal/nrcs/surveylist/soils/survey/state/?stateId=VT
Facebook
TwitterThe International Soil Reference and Information Centre-World Inventory of Soil Emission Potentials (ISRIC-WISE) international soil profile data set consists of a homogenized, global set of 1,125 soil profiles for use by global modelers. These profiles provided the basis for the Global Pedon Database (GPDB) of the International Geosphere-Biosphere Programme (IGBP) - Data and Information System (DIS). The data set consists of a selection of 665 profiles originating from the Natural Resources Conservation Service (NRCS, Lincoln), 250 profiles obtained from the Food and Agriculture Organization (FAO, Rome), and 210 profiles from the reference collection of the International Soil Reference and Information Centre (ISRIC, Wageningen). All profiles are georeferenced and classified according to the 1974 Legend of the FAO-UNESCO Soil Map (FAC-UNESCO, 1974) of the World, as well as the 1988 Revised Legend of FAO-UNESCO (FAO, 1990). The data set includes information on soil classification, site data, soil horizon data, source of data, and methods used for determining analytical data. The data files are in a comma-delimited format. Data Citation: The data set should be cited as follows: Batjes, N. H. (ed). 2000. Global Soil Profile Data (ISRIC-WISE). Available on-line from the ORNL Distributed Active Archive Center, Oak Ridge National Laboratory, Oak Ridge, Tennessee, U.S.A.
Facebook
TwitterThis layer shows soil type, based on the result of a classification established from Kalimantan RePPProT data on 'SL_ORDER' field (1990, 1:250,000 scale) . This data was provided and processed by Daemeter Consulting. Soil categories from RePPProT were then re-classified by the World Resources Institute according to the FAO Digital Soil Map of the World, for use in the Suitability Mapper (2012). The FAO data is available at http://www.fao.org/geonetwork/srv/en/metadata.show?id=14116 . Data separated into categories: Inceptisol; Oxisol; Alfisol; Ultisol; Spodosol; Entisol; Histosol.
Facebook
TwitterThis data set is a digital soil survey and generally is the mostdetailed level of soil geographic data developed by the NationalCooperative Soil Survey. The information was prepared by digitizingmaps, by compiling information onto a planimetric correct baseand digitizing, or by revising digitized maps using remotelysensed and other information.This data set consists of georeferenced digital map data andcomputerized attribute data. The map data are in a soil survey areaextent format and include a detailed, field verified inventoryof soils and miscellaneous areas that normally occur in a repeatablepattern on the landscape and that can be cartographically shown atthe scale mapped. The soil map units are linked to attributes in theNational Soil Information System relational database, which givesthe proportionate extent of the component soils and their properties.
Facebook
TwitterA global data set of soil types is available at 0.5-degree latitude by 0.5-degree longitude resolution. There are 106 soil units, based on Zobler?s (1986) assessment of the FAO/UNESCO Soil Map of the World. This data set is a conversion of the Zobler 1-degree resolution version to a 0.5-degree resolution. The resolution of the data set was not actually increased. Rather, the 1-degree squares were divided into four 0.5-degree squares with the necessary adjustment of continental boundaries and islands. The computer code used to convert the original 1-degree data to 0.5-degree is provided as a companion file. A JPG image of the data is provided in this document. The Zobler data (1-degree resolution) as distributed by Webb et al. (1993) http://www.ngdc.noaa.gov/seg/eco/cdroms/gedii_a/datasets/a12/wr.htm#top contains two columns, one column for continent and one column for soil type. The Soil Map of the World consists of 9 maps that represent parts of the world. The texture data that Webb et al.(1993) provided allowed for the fact that a soil type in one part of the world may have different properties than the same soil in a different part of the world. This continent-specific information is retained in this 0.5-degree resolution data set, as well as the soil type information which is the second column. A code was written (one2half.c) to take the file CONTIZOB.LER distributed by Webb et al. (1993) http://www.ngdc.noaa.gov/seg/eco/cdroms/gedii_a/datasets/a12/wr.htm#top and simply divide the 1-degree cells into quarters. This code also reads in a land/water file (land.wave) that specifies the cells that are land at 0.5 degrees. The code checks for consistency between the newly quartered map and the land/w...
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Sixty one soils (soil types) represent the range of soils found across South Australia’s agricultural lands. Mapping shows the most common soil within each map unit, while more detailed proportion data are supplied for calculating respective areas of each soil type (spatial data statistics).
Facebook
TwitterThis data set is a digital soil survey and generally is the most detailed level of soil geographic data developed by the National Cooperative Soil Survey. The information was prepared by digitizing maps, by compiling information onto a planimetric correct base and digitizing, or by revising digitized maps using remotely sensed and other information. This data set consists of georeferenced digital map data and computerized attribute data. The map data are in a soil survey area extent format and include a detailed, field verified inventory of soils and miscellaneous areas that normally occur in a repeatable pattern on the landscape and that can be cartographically shown at the scale mapped. A special soil features layer (point and line features) is optional. This layer displays the location of features too small to delineate at the mapping scale, but they are large enough and contrasting enough to significantly influence use and management. The soil map units are linked to attributes in the National Soil Information System relational database, which gives the proportionate extent of the component soils and their properties.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Hydric soils are defined as those soils that are sufficiently wet in the upper part to develop anaerobic conditions during the growing season. The Hydric Soils section presents the most current information about hydric soils. The lists of hydric soils were created by using National Soil Information System (NASIS) database selection criteria that were developed by the National Technical Committee for Hydric Soils. These criteria are selected soil properties that are documented in Soil Taxonomy (Soil Survey Staff, 1999) and were designed primarily to generate a list of potentially hydric soils from the National Soil Information System (NASIS) database. It updates information that was previously published in Hydric Soils of the United States and coordinates it with information that has been published in the Federal Register. It also includes the most recent set of field indicators of hydric soils. The database selection criteria are selected soil properties that are documented in Soil Taxonomy and were designed primarily to generate a list of potentially hydric soils from soil survey databases. Only criteria 1, 3, and 4 can be used in the field to determine hydric soils; however, proof of anaerobic conditions must also be obtained for criteria 1, 3, and 4 either through data or best professional judgment (from Tech Note 1). The primary purpose of these selection criteria is to generate a list of soil map unit components that are likely to meet the hydric soil definition. Caution must be used when comparing the list of hydric components to soil survey maps. Many of the soils on the list have ranges in water table depths that allow the soil component to range from hydric to nonhydric depending on the location of the soil within the landscape as described in the map unit. Lists of hydric soils along with soil survey maps are good off-site ancillary tools to assist in wetland determinations, but they are not a substitute for observations made during on-site investigations. The list of field indicators of hydric soils — The field indicators are morphological properties known to be associated with soils that meet the definition of a hydric soil. Presence of one or more field indicators suggests that the processes associated with hydric soil formation have taken place on the site being observed. The field indicators are essential for hydric soil identification because once formed, they persist in the soil during both wet and dry seasonal periods. The Hydric Soil Technical Notes — Contain National Technical Committee for Hydric Soils (NTCHS) updates, insights, standards, and clarifications. Users can query the database by State or by Soil Survey Area. Resources in this dataset:Resource Title: Website Pointer to Hydric Soils . File Name: Web Page, url: https://www.nrcs.usda.gov/wps/portal/nrcs/main/soils/use/hydric/ Includes description of Criteria, Query by State or Soil Survey Area, national Technical Committee for Hydric Soils. Technical Notes, and Related Links. Report Metadata:
Criteria:
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Please ensure to cite the paper when utilizing the dataset in a research study. Refer to the paper link or BibTeX provided below.
This repository contains comprehensive datasets for soil classification and recognition research. The Original Dataset comprises soil images sourced from various online repositories, which have been meticulously cleaned and preprocessed to ensure data quality and consistency. To enhance the dataset's size and diversity, we employed Generative Adversarial Networks (GANs), specifically the CycleGAN architecture, to generate synthetic soil images. This augmented collection is referred to as the CyAUG Dataset. Both datasets are specifically designed to advance research in soil classification and recognition using state-of-the-art deep learning methodologies.
This dataset was curated as part of the research study titled "An advanced artificial intelligence framework integrating ensembled convolutional neural networks and Vision Transformers for precise soil classification with adaptive fuzzy logic-based crop recommendations" by Farhan Sheth, Priya Mathur, Amit Kumar Gupta, and Sandeep Chaurasia, published in Engineering Applications of Artificial Intelligence.
Application produced by this research is available at:
Note: If you are using any part of this project; dataset, code, application, then please cite the work as mentioned in the Citation section below.
Both dataset consists of images of 7 different soil types.
The Soil Classification Dataset is structured to facilitate the classification of various soil types based on images. The dataset includes images of the following soil types:
The dataset is organized into folders, each named after a specific soil type, containing images of that soil type. The images vary in resolution and quality, providing a diverse set of examples for training and testing classification models.
If you are using any of the derived dataset, please cite the following paper:
@article{SHETH2025111425,
title = {An advanced artificial intelligence framework integrating ensembled convolutional neural networks and Vision Transformers for precise soil classification with adaptive fuzzy logic-based crop recommendations},
journal = {Engineering Applications of Artificial Intelligence},
volume = {158},
pages = {111425},
year = {2025},
issn = {0952-1976},
doi = {https://doi.org/10.1016/j.engappai.2025.111425},
url = {https://www.sciencedirect.com/science/article/pii/S0952197625014277},
author = {Farhan Sheth and Priya Mathur and Amit Kumar Gupta and Sandeep Chaurasia},
keywords = {Soil classification, Crop recommendation, Vision transformers, Convolutional neural network, Transfer learning, Fuzzy logic}
}