100+ datasets found

Data from: Estimated spring crop yields using Flex Cropping Tool
catalog.data.gov
geodata.nal.usda.gov
+2more
Updated Apr 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agricultural Research Service (2025). Estimated spring crop yields using Flex Cropping Tool [Dataset]. https://catalog.data.gov/dataset/estimated-spring-crop-yields-using-flex-cropping-tool-fdbfd
Explore at:
Dataset updated
Apr 21, 2025
Dataset provided by
Agricultural Research Servicehttps://www.ars.usda.gov/
Description
Average estimated yields and associated CV values for current (2018) model runs. Based on work done by Harsimran Kaur et al in 2017. The following is from her thesis: Agro-ecological classes (AECs) of dryland cropping systems in the inland Pacific Northwest have been predicted to become more dynamic with greater use of annual fallow under projected climate change. At the same time, initiatives are being taken by growers either to intensify or diversify their cropping systems using oilseed and grain legume crops. The main objective of this study was to use a mechanistic model (CropSyst) to provide yield and soil water forecasts at regional scales which could compare fallow versus spring crop choices (flex/opportunity crop). Model simulations were based on historic weather data (1981-2010) as well as combined with actual year weather data for simulations at pre-planting dates starting in Dec. for representative years. Yield forecasts of spring pea, canola and wheat were compared to yield simulations using only weather of the representative year via linear regression analysis to assess pre-plant forecasts. Crop yield projections on pre-plant forecast date of Feb 1st had higher R2 with yield simulated using actual years weather data and lower CVs across the region as compared to forecasts based on historic weather data and other pre-season forecast dates (Dec. 1st and Jan. 1st). Therefore, Feb. 1st was considered the most reliable time to predict yield and other relevant outputs such as available water forecasts on a regional scale. Regional forecast maps of predicted spring crop yields and CVs showed ranges of 1 to 4367 kg/ha and 11 to 293% for spring canola, 72 to 2646 kg/ha and 11 to 143% for spring pea and 39 to 5330 kg/ha and 11 to 158% for spring wheat across study region for a representative year. These data combined with predicted available water after fallow and following spring crop yield as well as estimates of winter wheat yield reduction would collectively serve as information contributing to decisions related to crop intensification and diversification. Resources in this dataset:Resource Title: GeoData catalog record. File Name: Web Page, url: https://geodata.nal.usda.gov/geonetwork/srv/eng/catalog.search#/metadata/459d2dba-a346-4e54-9750-ef3178c18f38
A
‘Crop Yield Prediction Dataset’ analyzed by Analyst-2
analyst-2.ai
Updated Jan 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ‘Crop Yield Prediction Dataset’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-crop-yield-prediction-dataset-033a/latest
Explore at:
Dataset updated
Jan 28, 2022
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Crop Yield Prediction Dataset’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/patelris/crop-yield-prediction-dataset on 28 January 2022.

--- Dataset description provided by original source is as follows ---

Context

The science of training machines to learn and produce models for future predictions is widely used, and not for nothing. Agriculture plays a critical role in the global economy. With the continuing expansion of the human population understanding worldwide crop yield is central to addressing food security challenges and reducing the impacts of climate change.

Crop yield prediction is an important agricultural problem. The Agricultural yield primarily depends on weather conditions (rain, temperature, etc), pesticides and accurate information about history of crop yield is an important thing for making decisions related to agricultural risk management and future predictions.

Acknowledgements

All dataset(publicly available dataset) here are taken form FAO (Food and Agriculture Organization) and World Data Bank. http://www.fao.org/home/en/ https://data.worldbank.org/

--- Original source retains full ownership of the source dataset ---
Sample data for "Machine learning for large-scale forecasting"
zenodo.org
csv
Updated Oct 11, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dilli Paudel; Dilli Paudel; Hendrik Boogaard; Allard de Wit; Sander Janssen; Sjoukje Osinga; Christos Pylianidis; Ioannis Athanasiadis; Hendrik Boogaard; Allard de Wit; Sander Janssen; Sjoukje Osinga; Christos Pylianidis; Ioannis Athanasiadis (2021). Sample data for "Machine learning for large-scale forecasting" [Dataset]. http://doi.org/10.5281/zenodo.4312941
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4312941
Dataset updated
Oct 11, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Dilli Paudel; Dilli Paudel; Hendrik Boogaard; Allard de Wit; Sander Janssen; Sjoukje Osinga; Christos Pylianidis; Ioannis Athanasiadis; Hendrik Boogaard; Allard de Wit; Sander Janssen; Sjoukje Osinga; Christos Pylianidis; Ioannis Athanasiadis
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset includes sample data for the Netherlands to run the machine learning baseline as described in the paper titled Machine learning for large-scale crop yield forecasting, accessible at https://doi.org/10.1016/j.agsy.2020.103016. The software implementation of the machine learning baseline is available at: https://github.com/BigDataWUR/MLforCropYieldForecasting.

Notes:

The NUTS classification (Nomenclature of territorial units for statistics) is a hierarchical system for dividing up the economic territory of the EU and the UK (see Eurostat, 2016) for more details).

Data

The dataset consists of 11 CSV files. They are formatted to work as sample inputs to the machine learning baseline.

Crop Area Fractions (NUTS2, NUTS1): We aggregated the predictions of the machine learning baseline from NUTS2 to national (NUTS0) level by weighting them on the modeled crop area. Cerrani and López Lozano (2017) have described in detail the algorithm used to model crop areas for different NUTS levels. The data comes from the MARS Crop Yield Forecasting System (MCYFS) of European Commission's Joint Research Centre (JRC) (see Lecerf et al., 2019).

Centroids (NUTS2): Data includes latitude, longitude and distance to coast of the centroids of NUTS2 regions.

Meteo Daily Data and Meteo Dekadal Data (NUTS2): The data comes from MCYFS (see EC-JRC, 2020). By default, the implementation uses daily data.

Remote Sensing Data (NUTS2, see Copernicus Global Land Service, 2020): Data includes fraction of absorbed photosynthetically active radiation (FAPAR) aggregated to NUTS2.

Soil Data: Data includes soil moisture information that can be used to calculate soil water holding capacity. The data comes from MCYFS (see Lecerf et al., 2019).

WOFOST data (NUTS2): The World Food Studies (WOFOST) crop model (van Diepen et al., 1989; Supit et al., 1994; de Wit et al. 2019) is a simulation model for the quantitative analysis of the growth and production of annual field crops. It is a mechanistic, dynamic model that explains daily crop growth on the basis of the underlying processes, such as photosynthesis, respiration and how these processes are influenced by environmental conditions. The crop simulation is fed by weather, soil and crop data. Observed meteorological data is interpolated on a regular 25 km grid using a method based on the distance, altitude and climatic region similarity between the center of grid cells and weather stations (see Van der Goot, 1998). WOFOST runs on the intersection between the 25 km meteorological grid and soil units based on the European soil map (http://esdac.jrc.ec.europa.eu/). In order to have the output data aggregated to administrative regions such as countries or provinces, simulation units are further intersected with the boundaries of these regions. The outputs at soil unit (STU) level are aggregated to grid level in an area weighted manner. Gridded simulations are aggregated to lowest NUTS level 3 considering the arable land area of each grid, derived from GLOBCOVER and CORINE Land Cover (Cerrani and Lopez Lozano, 2017). From NUTS3 to higher levels, crop area fractions for the current year, retrieved from Eurostat, are used to weight and aggregate the output (Cerrani and Lopez Lozano, 2017).

National yield statistics (NUTS0): These are the official Eurostat national yield statistics (Eurostat, 2020a). We used these yield statistics as reference to compare the machine learning predictions aggregated to NUTS0 and the actual MCYFS forecasts (see van der Velde and Nisini, 2019).

Regional yield statistics (NUTS2): We used NUTS2 yield statistics as labels to train and evaluate machine learning algorithms. We got NUTS2 yield statistics from The Central Bureau of Statistics (CBS) of the Netherlands (NL-CBS, 2020).

Past MCYFS Yield Forecasts (NUTS0): These are actual forecasts made by MCYFS in the past (see van der Velde and Nisini, 2019). We used the official Eurostat national yield statistics (see point 7 above) as the reference to compare the machine learning predictions aggregated to NUTS0 and MCYFS forecasts.

Crop ID and name mapping

2 : grain maize

6 : sugar beets

7 : potatoes

90 : soft wheat

93 : sunflower

95 : spring barley

Acknowledgements

We would like to thank S. Niemeyer from the European Commission’s Joint Research Centre (JRC) for the permission to provide open access to the Netherlands data. Similarly, we would like to thank M. van der Velde, L. Nisini and I. Cerrani from JRC for sharing with us past MCYFS forecasts and Eurostat national yield statistics.
h
Drone-based-Agricultural-Dataset-for-Crop-Yield-Estimation
huggingface.co
Updated Aug 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KaraAgro AI Foundation (2023). Drone-based-Agricultural-Dataset-for-Crop-Yield-Estimation [Dataset]. http://doi.org/10.57967/hf/0959
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57967/hf/0959
Dataset updated
Aug 20, 2023
Authors
KaraAgro AI Foundation
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Drone-based Agricultural Dataset for Crop Yield Estimation

This repository contains a comprehensive dataset of cashew, cocoa and coffee images captured by drones, accompanied by meticulously annotated labels. To facilitate object detection, each image is paired with a corresponding text file in YOLO format. The YOLO format file contains annotations, including class labels and bounding box coordinates. The dataset was collected by teams from Ghana (KaraAgro AI) and Uganda (Makerere… See the full description on the dataset page: https://huggingface.co/datasets/KaraAgroAI/Drone-based-Agricultural-Dataset-for-Crop-Yield-Estimation.
m
Data from: Stacked Ensemble Model for Accurate Crop Yield Prediction Using...
data.mendeley.com
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ramesh V (2025). Stacked Ensemble Model for Accurate Crop Yield Prediction Using Machine Learning Techniques [Dataset]. http://doi.org/10.17632/ncw2vbcgnk.2
Explore at:
Unique identifier
https://doi.org/10.17632/ncw2vbcgnk.2
Dataset updated
Feb 5, 2025
Authors
Ramesh V
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We used historical data for crop yield in 27 Indian states and 3 Union Territories of India, covering the years 1997 to 2020. The dataset consists of 19,689 data points, each with ten features including Crop, Season, Crop_Year, State, Annual_Rainfall, Area, Production, Pesticide, Fertilizer, and Yield. The dataset encompasses 55 different types of crops cultivated across India. The crop yield dataset was used to prediction of crop yield using regression with stacking ensemble model. The dataset is split into training 80% and testing 20%.
Crop Yield Data India
kaggle.com
Updated Jul 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shahid Hussain (2024). Crop Yield Data India [Dataset]. https://www.kaggle.com/datasets/saincoder404/crop-yield-data-india
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 14, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Shahid Hussain
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
India
Description
This dataset contains detailed information on crop yields across various states in India for the year 1997. It includes data on different crops, their production, area under cultivation, season of cultivation, and state-specific information. Additionally, the dataset provides supplementary details such as annual rainfall, fertilizer use, pesticide use, and yield for each crop. This comprehensive dataset can be used for agricultural analysis, trend prediction, and studying the impact of various factors on crop yields in India.
C
Crop Yield Forecasting Report
archivemarketresearch.com
doc, pdf, ppt
Updated Mar 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Crop Yield Forecasting Report [Dataset]. https://www.archivemarketresearch.com/reports/crop-yield-forecasting-52287
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Mar 6, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global crop yield forecasting market is experiencing robust growth, driven by increasing demand for efficient agricultural practices and the need to mitigate risks associated with climate change and resource scarcity. The market, valued at approximately $2.5 billion in 2025, is projected to exhibit a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033. This substantial growth is fueled by several key factors. Technological advancements in remote sensing, artificial intelligence (AI), and machine learning (ML) are enabling the development of increasingly accurate and sophisticated forecasting models. The integration of these technologies into both software and service offerings is revolutionizing agricultural planning, enabling farmers to optimize resource allocation, improve yields, and reduce waste. Furthermore, government initiatives promoting sustainable agriculture and precision farming are providing further impetus to market expansion. The rising adoption of precision agriculture techniques, coupled with the growing awareness among farmers about the benefits of data-driven decision-making, is further contributing to the market's upward trajectory. The market segmentation reveals a strong preference for crop yield forecasting software, owing to its scalability and potential for integration with existing farm management systems. The commercial sector dominates the application segment, reflecting the higher adoption rate among large-scale agricultural businesses seeking to enhance operational efficiency and profitability. Geographically, North America and Europe currently hold significant market share, driven by early adoption of advanced technologies and well-established agricultural infrastructure. However, rapidly developing economies in Asia-Pacific, particularly India and China, represent lucrative growth opportunities, with increasing investment in agricultural modernization and technological advancements. The market's growth, while promising, faces certain restraints including the high initial investment costs associated with advanced technologies and the need for reliable internet connectivity and digital literacy among farmers, particularly in developing regions. Overcoming these barriers will be crucial to unlocking the full potential of the crop yield forecasting market in the coming years.
t
Machine Learning For Crop Yield Prediction Global Market Report 2025
thebusinessresearchcompany.com
pdf,excel,csv,ppt
Updated Mar 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Business Research Company (2025). Machine Learning For Crop Yield Prediction Global Market Report 2025 [Dataset]. https://www.thebusinessresearchcompany.com/report/machine-learning-for-crop-yield-prediction-global-market-report
Explore at:
pdf,excel,csv,pptAvailable download formats
Dataset updated
Mar 25, 2025
Dataset authored and provided by
The Business Research Company
License
https://www.thebusinessresearchcompany.com/privacy-policyhttps://www.thebusinessresearchcompany.com/privacy-policy
Description
Global Machine Learning For Crop Yield Prediction market size is expected to reach $2.58 billion by 2029 at 26.6%, rising demand for sustainable agriculture driving the growth of the market due to environmental and food security concerns
G
Crop Yield Forecast Variance Log
gomask.ai
csv
Updated Jul 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GoMask.ai (2025). Crop Yield Forecast Variance Log [Dataset]. https://gomask.ai/marketplace/datasets/crop-yield-forecast-variance-log
Explore at:
csv(Unknown)Available download formats
Dataset updated
Jul 12, 2025
Dataset provided by
GoMask.ai
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
notes, field_id, variance, crop_type, field_name, yield_unit, season_year, actual_yield, harvest_date, forecast_date, and 5 more
Description
This dataset provides detailed logs comparing projected and actual crop yields at the field level, including forecast methods, timing, and variance calculations. It enables precise analysis of forecast accuracy, supports agricultural planning, and helps identify patterns or anomalies in yield prediction. The data is ideal for farm management optimization, agronomic research, and improving predictive models.
Crop Yield Prediction using Soil and Weather
kaggle.com
Updated Nov 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gurudath g (2024). Crop Yield Prediction using Soil and Weather [Dataset]. https://www.kaggle.com/datasets/gurudathg/crop-yield-prediction-using-soil-and-weather/discussion?sort=undefined
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 8, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Gurudath g
Description
This dataset is designed to help researchers and data scientists predict crop yield based on key agricultural factors. It contains columns representing the amount of fertilizer used, temperature, and soil nutrients (nitrogen, phosphorus, and potassium), along with the corresponding crop yield. This dataset can be used for machine learning projects focusing on agricultural optimization, yield forecasting, and resource management. The data is structured and ready for analysis, making it ideal for training regression models or conducting exploratory data analysis.

File Information: Number of Rows: 2,596 Number of Columns: 6 Data Types: Fertilizer: float64 temp: float64 N (Nitrogen): float64 P (Phosphorus): float64 K (Potassium): float64 yeild (Yield): float64 Missing Values: None (0 missing values in all columns) Summary Statistics: Fertilizer: Mean: 66.49, Min: 49.75, Max: 80.22 Temperature: Mean: 33.85, Min: 23.77, Max: 40.27 Nitrogen (N): Mean: 69.52, Min: 58.84, Max: 80.22 Phosphorus (P): Mean: 20.71, Min: 17.72, Max: 25.16 Potassium (K): Mean: 17.81, Min: 14.70, Max: 22.06 Yield: Mean: 8.53, Min: 5.15, Max: 12.34 This dataset is complete, with no missing values, and provides diverse statistics for various features important for crop yield prediction.
C
Crop Yield Forecasting Report
archivemarketresearch.com
doc, pdf, ppt
Updated Mar 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Crop Yield Forecasting Report [Dataset]. https://www.archivemarketresearch.com/reports/crop-yield-forecasting-52289
Explore at:
ppt, doc, pdfAvailable download formats
Dataset updated
Mar 6, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global crop yield forecasting market is experiencing robust growth, driven by the increasing need for efficient agricultural practices and enhanced food security in a changing climate. The market, estimated at $2.5 billion in 2025, is projected to witness a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033, reaching an estimated value of $8 billion by 2033. This expansion is fueled by several key factors. Technological advancements in remote sensing, data analytics, and artificial intelligence are enabling the development of sophisticated forecasting models, providing farmers with more accurate and timely predictions of crop yields. Furthermore, the rising adoption of precision agriculture techniques, coupled with the growing awareness of climate change's impact on crop production, is further propelling market growth. The increasing demand for higher crop yields to meet the rising global food demand also contributes significantly to this expansion. The market segmentation reveals strong growth across both software and service offerings, with the commercial application segment dominating due to its larger scale operations and greater investment capacity in advanced technologies. Geographic growth is anticipated to be particularly strong in regions like Asia-Pacific and North America, driven by higher technology adoption rates and significant agricultural sectors. However, challenges remain, including the high initial investment costs associated with implementing these technologies, a lack of digital literacy among some farming communities, and potential data security concerns. Despite these restraints, the long-term outlook for the crop yield forecasting market remains positive, with continued innovation and technological advancements expected to drive further market expansion in the coming years.
f
Table_1_Using machine learning for crop yield prediction in the past or the...
frontiersin.figshare.com
xlsx
Updated Jun 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alejandro Morales; Francisco J. Villalobos (2023). Table_1_Using machine learning for crop yield prediction in the past or the future.xlsx [Dataset]. http://doi.org/10.3389/fpls.2023.1128388.s001
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.3389/fpls.2023.1128388.s001
Dataset updated
Jun 20, 2023
Dataset provided by
Frontiers
Authors
Alejandro Morales; Francisco J. Villalobos
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The use of ML in agronomy has been increasing exponentially since the start of the century, including data-driven predictions of crop yields from farm-level information on soil, climate and management. However, little is known about the effect of data partitioning schemes on the actual performance of the models, in special when they are built for yield forecast. In this study, we explore the effect of the choice of predictive algorithm, amount of data, and data partitioning strategies on predictive performance, using synthetic datasets from biophysical crop models. We simulated sunflower and wheat data using OilcropSun and Ceres-Wheat from DSSAT for the period 2001-2020 in 5 areas of Spain. Simulations were performed in farms differing in soil depth and management. The data set of farm simulated yields was analyzed with different algorithms (regularized linear models, random forest, artificial neural networks) as a function of seasonal weather, management, and soil. The analysis was performed with Keras for neural networks and R packages for all other algorithms. Data partitioning for training and testing was performed with ordered data (i.e., older data for training, newest data for testing) in order to compare the different algorithms in their ability to predict yields in the future by extrapolating from past data. The Random Forest algorithm had a better performance (Root Mean Square Error 35-38%) than artificial neural networks (37-141%) and regularized linear models (64-65%) and was easier to execute. However, even the best models showed a limited advantage over the predictions of a sensible baseline (average yield of the farm in the training set) which showed RMSE of 42%. Errors in seasonal weather forecasting were not taken into account, so real-world performance is expected to be even closer to the baseline. Application of AI algorithms for yield prediction should always include a comparison with the best guess to evaluate if the additional cost of data required for the model compensates for the increase in predictive power. Random partitioning of data for training and validation should be avoided in models for yield forecasting. Crop models validated for the region and cultivars of interest may be used before actual data collection to establish the potential advantage as illustrated in this study.
f
GPS Coordinates of 18,482 Crop Fields in East Africa with Improved Accuracy...
figshare.com
pdf
Updated Jun 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kareem Eissa; Karim Amer; Jacoby Jaeger; Mohamed ElHelw; David Guerena (2023). GPS Coordinates of 18,482 Crop Fields in East Africa with Improved Accuracy using Planet Imagery and Yolo v5 Object Detection Model [Dataset]. http://doi.org/10.6084/m9.figshare.15157263.v2
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.15157263.v2
Dataset updated
Jun 9, 2023
Dataset provided by
figshare
Authors
Kareem Eissa; Karim Amer; Jacoby Jaeger; Mohamed ElHelw; David Guerena
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
East Africa, Africa
Description
Georeferenced crop yield prediction is a valuable tool for agronomists and policymakers. One challenge with many existing datasets is that of location accuracy. GPS locations for fields can end up offset from the true location due to sensor inaccuracies or from locations being collected at the edges of fields rather than the field centers. This makes it harder to connect remote-sensed data to the yield values. The goal of this project was to produce a method that can help correct these location offsets by finding the most probable field center given an input location. We prepared and hosted a competition on Zindi (https://zindi.africa) where competitors model the problem using state-of-the-art data science techniques. We provided the competitors with satellite images of fields along with their corresponding manually annotated correct centers. Additionally, we also provided approximate plot size and measured yield in case these help with creating their solutions. Original positions are considered images' centers as (0,0) and a displacement vector for each field in the training set was provided. The goal of the competition was to predict these vectors for each vector in the test set. This dataset includes the locations of 18,481 crop fields across Kenya, Tanzania, and Rwanda, collected in 2016-2017 with mixed qualities and their error-corrected ones from the winning solution using Planet satellite imagery and the Yolo v5 object detection model.
C
Crop Yield Forecasting Report
datainsightsmarket.com
doc, pdf, ppt
Updated Jun 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Crop Yield Forecasting Report [Dataset]. https://www.datainsightsmarket.com/reports/crop-yield-forecasting-1967680
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Jun 10, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global crop yield forecasting market is experiencing robust growth, driven by the increasing need for precision agriculture and improved food security. The market's expansion is fueled by several key factors, including the rising adoption of advanced technologies like AI, machine learning, and remote sensing, which enable more accurate and timely yield predictions. Furthermore, climate change and its unpredictable impact on agricultural production are pushing farmers and stakeholders to leverage predictive analytics to mitigate risks and optimize resource allocation. The market is segmented based on various factors, including technology (satellite imagery, weather data, soil sensors), application (field-level forecasting, regional forecasting), and crop type. While the precise market size is not provided, considering a typical CAGR (Compound Annual Growth Rate) of 15-20% for such a technology-driven market, a reasonable estimate for the 2025 market size could be in the range of $500 million to $750 million, given the involvement of established players like Pessl Instruments and emerging companies like AgroMetShell and CropProphet. This growth trajectory is projected to continue throughout the forecast period (2025-2033), driven by continuous technological advancements and increasing adoption rates across different regions. Constraints such as high initial investment costs for technology and data accessibility limitations, especially in developing countries, may pose some challenges, but are likely to be offset by the significant economic and environmental benefits offered by accurate crop yield forecasting. The competitive landscape comprises a mix of established players and innovative startups. Companies like EOSDA and Agronomy Insights are leveraging their expertise in data analytics and remote sensing to offer comprehensive solutions, while others focus on niche applications or specific crop types. The market is also witnessing increasing collaboration between agricultural research institutions (like CCAFS and WUR) and technology companies, further accelerating innovation. Geographical expansion, particularly in developing economies with significant agricultural sectors, presents significant growth opportunities. The integration of crop yield forecasting with other precision agriculture technologies, such as variable-rate fertilization and irrigation, promises to further enhance efficiency and sustainability in agricultural practices, strengthening the market's long-term prospects. Overall, the crop yield forecasting market is poised for substantial growth, contributing significantly to enhancing agricultural productivity and addressing global food security concerns.
h
california-crop-yield-benchmark
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hamid Kamangir, california-crop-yield-benchmark [Dataset]. https://huggingface.co/datasets/hkaman/california-crop-yield-benchmark
Explore at:
Authors
Hamid Kamangir
Description
California Crop Yield Benchmark: Combining Satellite Image, Climate, Evapotranspiration, and Soil Data Layers for County-Level Yield Forecasting of Over 70 Crops

This benchmark offers a comprehensive, unified, and multi-modal dataset for county-level crop yield prediction across California. It integrates diverse data sources, including monthly time series from Landsat satellite imagery, monthly evapotranspiration (ET) data, daily DayMet climate variables, static soil attributes, annual… See the full description on the dataset page: https://huggingface.co/datasets/hkaman/california-crop-yield-benchmark.

Data from: CY-Bench: A comprehensive benchmark dataset for subnational crop...

zenodo.org
explore.openaire.eu

zip

Updated Sep 25, 2024

Facebook

Twitter

Click to copy link

Link copied

Cite

Dilli Paudel; Dilli Paudel; Hilmy Baja; Hilmy Baja; Ron van Bree; Michiel Kallenberg; Michiel Kallenberg; Stella Ofori-Ampofo; Aike Potze; Pratishtha Poudel; Pratishtha Poudel; Abdelrahman Saleh; Weston Anderson; Weston Anderson; Malte von Bloh; Andres Castellano; Oumnia Ennaji; Raed Hamed; Rahel Laudien; Donghoon Lee; Inti Luna; Dainius Masiliūnas; Dainius Masiliūnas; Michele Meroni; Janet Mumo Mutuku; Siyabusa Mkuhlani; Jonathan Richetti; Alex C. Ruane; Ritvik Sahajpal; Guanyuan Shuai; Vasileios Sitokonstantinou; Rogerio de Souza Noia Junior; Amit Kumar Srivastava; Robert Strong; Lily-belle Sweet; Lily-belle Sweet; Petar Vojnović; Allard de Wit; Allard de Wit; Maximilian Zachow; Ioannis N. Athanasiadis; Ron van Bree; Stella Ofori-Ampofo; Aike Potze; Abdelrahman Saleh; Malte von Bloh; Andres Castellano; Oumnia Ennaji; Raed Hamed; Rahel Laudien; Donghoon Lee; Inti Luna; Michele Meroni; Janet Mumo Mutuku; Siyabusa Mkuhlani; Jonathan Richetti; Alex C. Ruane; Ritvik Sahajpal; Guanyuan Shuai; Vasileios Sitokonstantinou; Rogerio de Souza Noia Junior; Amit Kumar Srivastava; Robert Strong; Petar Vojnović; Maximilian Zachow; Ioannis N. Athanasiadis (2024). CY-Bench: A comprehensive benchmark dataset for subnational crop yield forecasting [Dataset]. http://doi.org/10.5281/zenodo.13798797

Explore at:

zipAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.13798797

Dataset updated

Sep 25, 2024

Dataset provided by

AgML (https://www.agml.org/)

Authors

License

https://joinup.ec.europa.eu/page/eupl-text-11-12https://joinup.ec.europa.eu/page/eupl-text-11-12

Description

CY-Bench: A comprehensive benchmark dataset for sub-national crop yield forecasting

Overview

CY-Bench is a dataset and benchmark for subnational crop yield forecasting, with coverage of major crop growing countries of the world for maize and wheat. By subnational, we mean the administrative level where yield statistics are published. When statistics are available for multiple levels, we pick the highest resolution. The dataset combines sub-national yield statistics with relevant predictors, such as growing-season weather indicators, remote sensing indicators, evapotranspiration, soil moisture indicators, and static soil properties. CY-Bench has been designed and curated by agricultural experts, climate scientists, and machine learning researchers from the AgML Community, with the aim of facilitating model intercomparison across the diverse agricultural systems around the globe in conditions as close as possible to real-world operationalization. Ultimately, by lowering the barrier to entry for ML researchers in this crucial application area, CY-Bench will facilitate the development of improved crop forecasting tools that can be used to support decision-makers in food security planning worldwide.

* Crops : Wheat & Maize
* Spatial Coverage : Wheat (29 countries), Maize (38).
See CY-Bench paper appendix for the list of countries.
* Temporal Coverage : Varies. See country-specific data

Data

Data format

The benchmark data is organized as a collection of CSV files, with each file representing a specific category of variable for a particular country. Each CSV file is named according to the category and the country it pertains to, facilitating easy identification and retrieval. The data within each CSV file is structured in tabular format, where rows represent observations and columns represent different predictors related to a category of variable.

Data content

All data files are provided as .csv.

Data	Description	Variables (units)	Temporal Resolution	Data Source (Reference)
crop_calendar	Start and end of growing season	sos (day of the year), eos (day of the year)	Static	World Cereal (Franch et al, 2022)
fpar	fraction of absorbed photosynthetically active radiation	fpar (%)	Dekadal (3 times a month; 1-10, 11-20, 21-31)	European Commission's Joint Research Centre (EC-JRC, 2024)
ndvi	normalized difference vegetation index	-	approximately weekly	MOD09CMG (Vermote, 2015)
meteo	temperature, precipitation (prec), radiation, potential evapotranspiration (et0), climatic water balance (= prec - et0)	tmin (C), tmax (C), tavg (C), prec (mm0, et0 (mm), cwb (mm), rad (J m-2 day-1)	daily	AgERA5 (Boogaard et al, 2022), FAO-AQUASTAT for et0 (FAO-AQUASTAT, 2024)
soil_moisture	surface soil moisture, rootzone soil moisture	ssm (kg m-2), rsm (kg m-2)	daily	GLDAS (Rodell et al, 2004)
soil	available water capacity, bulk density, drainage class	awc (c m-1), bulk_density (kg dm-3), drainage class (category)	static	WISE Soil database (Batjes, 2016)
yield	end-of-season yield	yield (t ha-1)	yearly	Various country or region specific sources (see crop_statistics_... in https://github.com/BigDataWUR/AgML-CY-Bench/tree/main/data_preparation)

Folder structure

The CY-Bench dataset has been structure at first level by crop type and subsequently by country. For each country, the folder name follows the ISO 3166-1 alpha-2 two-character code. A separate .csv is available for each predictor data and crop calendar as shown below. The csv files are named to reflect the corresponding country and crop type e.g. **variable_croptype_country.csv**.
```
CY-Bench
│
└─── maize
│ │
│ └─── AO
│ │ -- crop_calendar_maize_AO.csv
│ │ -- fpar_maize_AO.csv
│ │ -- meteo_maize_AO.csv
│ │ -- ndvi_maize_AO.csv
│ │ -- soil_maize_AO.csv
│ │ -- soil_moisture_maize_AO.csv
│ │ -- yield_maize_AO.csv
│ │
│ └─── AR
│ -- crop_calendar_maize_AR.csv
│ -- fpar_maize_AR.csv
│ -- ...
│
└─── wheat
│ │
│ └─── AR
│ │ -- crop_calendar_wheat_AR.csv
│ │ -- fpar_wheat_AR.csv
│ │ ...
```

Example : CSV data content for maize in country X

```
X
└─── crop_calendar_maize_X.csv
│ -- crop_name (name of the crop)
│ -- adm_id (unique identifier for a subnational unit)
│ -- sos (start of crop season)
│ -- eos (end of crop season)
│
└─── fpar_maize_X.csv
│ -- crop_name
│ -- adm_id
│ -- date (in the format YYYYMMdd)
│ -- fpar
│
└─── meteo_maize_X.csv
│ -- crop_name
│ -- adm_id
│ -- date (in the format YYYYMMdd)

│ -- tmin (minimum temperature)
│ -- tmax (maximum temperature)
│ -- prec (precipitation)
│ -- rad (radiation)
│ -- tavg (average temperature)
│ -- et0 (evapotranspiration)
│ -- cwb (crop water balance)
│
└─── ndvi_maize_X.csv
│ -- crop_name
│ -- adm_id
│ -- date (in the format YYYYMMdd)
│ -- ndvi
│
└─── soil_maize_X.csv
│ -- crop_name
│ -- adm_id
│ -- awc (available water capacity)
│ -- bulk_density
│ -- drainage_class
│
└─── soil_moisture_maize_X.csv
│ -- crop_name
│ -- adm_id
│ -- date (in the format YYYYMMdd)
│ -- ssm (surface soil moisture)
│ -- rsm ()
│
└─── yield_maize_X.csv
│ -- crop_name
│ -- country_code
│ -- adm_id
│ -- harvest_year
│ -- yield
│ -- harvest_area
│ -- production

Data access

The full dataset can be downloaded directly from Zenodo or using the ```zenodo_get``` library

License and citation

We kindly ask all users of CY-Bench to properly respect licensing and citation conditions of the datasets included.

i
Machine Learning for Crop Yield Prediction Market - Global Size & Upcoming...
imrmarketreports.com
Updated Feb 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Swati Kalagate; Akshay Patil; Vishal Kumbhar (2025). Machine Learning for Crop Yield Prediction Market - Global Size & Upcoming Industry Trends [Dataset]. https://www.imrmarketreports.com/reports/machine-learning-for-crop-yield-prediction-market
Explore at:
Dataset updated
Feb 2025
Dataset provided by
IMR Market Reports
Authors
Swati Kalagate; Akshay Patil; Vishal Kumbhar
License
https://www.imrmarketreports.com/privacy-policy/https://www.imrmarketreports.com/privacy-policy/
Description
The Machine Learning for Crop Yield Prediction report features an extensive regional analysis, identifying market penetration levels across major geographic areas. It highlights regional growth trends and opportunities, allowing businesses to tailor their market entry strategies and maximize growth in specific regions.
Wild Blueberry Yield Prediction
kaggle.com
zip
Updated Feb 26, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Saurabh Shahane (2021). Wild Blueberry Yield Prediction [Dataset]. https://www.kaggle.com/saurabhshahane/wild-blueberry-yield-prediction
Explore at:
zip(361235 bytes)Available download formats
Dataset updated
Feb 26, 2021
Authors
Saurabh Shahane
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Context

A number of research is underway in the agricultural sector to better predict crop yield using machine learning algorithms. Many machine learning algorithms require large amounts of data in order to give useful results. One of the major challenges in training and experimenting with machine learning algorithms is the availability of training data in sufficient quality and quantity remains a limiting factor. In the paper, “Wild blueberry yield prediction using a combination of computer simulation and machine learning algorithms”, we used dataset generated by the Wild Blueberry Pollination Model, a spatially explicit simulation model validated by field observation and experimental data collected in Maine USA during the last 30 years. The blueberry yields predictive models require data that sufficiently characterize the influence of plant spatial traits, bee species composition, and weather conditions on production. In a multi-step process, we designed simulation experiments and conducted the runs on the calibrated version of the blueberry simulation model. The simulated dataset was then examined, and important features were selected to build four machine-learning-based predictive models. This simulated data provides researchers who have actual data collected from field observation and those who wants to experiment the potential of machine learning algorithms response to real data and computer simulation modelling generated data as input for crop yield prediction models.

Acknowledgements

Qu, Hongchun; Obsie, Efrem; Drummond, Frank (2020), “Data for: Wild blueberry yield prediction using a combination of computer simulation and machine learning algorithms”, Mendeley Data, V1, doi: 10.17632/p5hvjzsvn8.1
Crop Yield Prediction
kaggle.com
Updated Sep 17, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aya Shabbar (2021). Crop Yield Prediction [Dataset]. https://www.kaggle.com/ayashabbar/crop-yield-prediction/metadata
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 17, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Aya Shabbar
Description
Dataset

This dataset was created by Aya Shabbar

Contents
AI in agriculture: AI and traditional: crop yield forecasts accuracy rate...
statista.com
Updated Jul 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). AI in agriculture: AI and traditional: crop yield forecasts accuracy rate 2025 [Dataset]. https://www.statista.com/statistics/1619458/ai-crop-yield-forecasts-accuracy-rate/
Explore at:
Dataset updated
Jul 23, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2025
Area covered
Worldwide
Description
On average, AI in agriculture was estimated to significantly increase the crop yield prediction accuracy. It was estimated that AI can predict crop yield by ** percent accuracy rate.

Facebook

Twitter

Click to copy link

Link copied

Cite

Agricultural Research Service (2025). Estimated spring crop yields using Flex Cropping Tool [Dataset]. https://catalog.data.gov/dataset/estimated-spring-crop-yields-using-flex-cropping-tool-fdbfd

Data from: Estimated spring crop yields using Flex Cropping Tool

Explore at:

Dataset updated

Apr 21, 2025

Dataset provided by

Agricultural Research Servicehttps://www.ars.usda.gov/

Description

Average estimated yields and associated CV values for current (2018) model runs. Based on work done by Harsimran Kaur et al in 2017. The following is from her thesis: Agro-ecological classes (AECs) of dryland cropping systems in the inland Pacific Northwest have been predicted to become more dynamic with greater use of annual fallow under projected climate change. At the same time, initiatives are being taken by growers either to intensify or diversify their cropping systems using oilseed and grain legume crops. The main objective of this study was to use a mechanistic model (CropSyst) to provide yield and soil water forecasts at regional scales which could compare fallow versus spring crop choices (flex/opportunity crop). Model simulations were based on historic weather data (1981-2010) as well as combined with actual year weather data for simulations at pre-planting dates starting in Dec. for representative years. Yield forecasts of spring pea, canola and wheat were compared to yield simulations using only weather of the representative year via linear regression analysis to assess pre-plant forecasts. Crop yield projections on pre-plant forecast date of Feb 1st had higher R2 with yield simulated using actual years weather data and lower CVs across the region as compared to forecasts based on historic weather data and other pre-season forecast dates (Dec. 1st and Jan. 1st). Therefore, Feb. 1st was considered the most reliable time to predict yield and other relevant outputs such as available water forecasts on a regional scale. Regional forecast maps of predicted spring crop yields and CVs showed ranges of 1 to 4367 kg/ha and 11 to 293% for spring canola, 72 to 2646 kg/ha and 11 to 143% for spring pea and 39 to 5330 kg/ha and 11 to 158% for spring wheat across study region for a representative year. These data combined with predicted available water after fallow and following spring crop yield as well as estimates of winter wheat yield reduction would collectively serve as information contributing to decisions related to crop intensification and diversification. Resources in this dataset:Resource Title: GeoData catalog record. File Name: Web Page, url: https://geodata.nal.usda.gov/geonetwork/srv/eng/catalog.search#/metadata/459d2dba-a346-4e54-9750-ef3178c18f38

Clear search

Close search

Google apps

Main menu

Data from: Estimated spring crop yields using Flex Cropping Tool

‘Crop Yield Prediction Dataset’ analyzed by Analyst-2

Context

Acknowledgements

Sample data for "Machine learning for large-scale forecasting"

Drone-based-Agricultural-Dataset-for-Crop-Yield-Estimation

Data from: Stacked Ensemble Model for Accurate Crop Yield Prediction Using...

Crop Yield Data India

Crop Yield Forecasting Report

Machine Learning For Crop Yield Prediction Global Market Report 2025

Crop Yield Forecast Variance Log

Crop Yield Prediction using Soil and Weather

Crop Yield Forecasting Report

Table_1_Using machine learning for crop yield prediction in the past or the...

GPS Coordinates of 18,482 Crop Fields in East Africa with Improved Accuracy...

Crop Yield Forecasting Report

california-crop-yield-benchmark

Data from: CY-Bench: A comprehensive benchmark dataset for subnational crop...

CY-Bench: A comprehensive benchmark dataset for sub-national crop yield forecasting

Overview

Data

Data format

Data content

Folder structure

Example : CSV data content for maize in country X

Data access

License and citation

Machine Learning for Crop Yield Prediction Market - Global Size & Upcoming...

Wild Blueberry Yield Prediction

Context

Acknowledgements

Crop Yield Prediction

Dataset

Contents

AI in agriculture: AI and traditional: crop yield forecasts accuracy rate...

Data from: Estimated spring crop yields using Flex Cropping ToolSee More Versions

Data from: Estimated spring crop yields using Flex Cropping Tool