https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset is modified from the Wheat-Ears-Detection-Dataset to the format of the dataset provided by the Global Wheat Detection (https://www.kaggle.com/c/global-wheat-detection). This dataset can be easily used in any kernels for the Global Wheat Detection as an external data source.
https://github.com/simonMadec/Wheat-Ears-Detection-Dataset Madec, S., Jin, X., Lu, H., De Solan, B., Liu, S., Duyme, F., et al. (2019). Ear density estimation from high resolution RGB imagery using deep learning technique. Agric. For. Meteorol. 264, 225–234. doi:10.1016/j.agrformet.2018.10.013.
The original dataset was posted by the authors above. It has 236 6000px*4000px images. We cut each image to 6 3000px*3000px images with strides of 1500px for width and 1000px for height. Then we resize them to 1024px*1024px. This dataset is compatible with the dataset provided by the Global Wheat Detection (https://www.kaggle.com/c/global-wheat-detection). The size of bounding boxes are similar as well due to the resize operation.
train.csv - the training data train - (folder) training images
image_id - the unique image ID width, height - the width and height of the images bbox - a bounding box, formatted as a Python-style list of [xmin, ymin, width, height] etc.
See https://www.kaggle.com/c/global-wheat-detection/data for more information.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is the full Global Wheat Head Dataset 2021. Labels are included in csv.
Tutorials available here: https://www.aicrowd.com/challenges/global-wheat-challenge-2021
🕵️ Introduction
Wheat is the basis of the diet of a large part of humanity. Therefore, this cereal is widely studied by scientists to ensure food security. A tedious, yet important part of this research is the measurement of different characteristics of the plants, also known as Plant Phenotyping. Monitoring plant architectural characteristics allow the breeders to grow better varieties and the farmers to make better decisions, but this critical step is still done manually. The emergence of UAV, camera and smartphone makes in-field RGB images more available and could be a solution to manual measurement. For instance, the counting of the wheat head can be done with Deep Learning. However, this task can be visually challenging. There is often an overlap of dense wheat plants, and the wind can blur the photographs, making identify single heads difficult. Additionally, appearances vary due to maturity, colour, genotype, and head orientation. Finally, because wheat is grown worldwide, different varieties, planting densities, patterns, and field conditions must be considered. To end manual counting, a robust algorithm must be created to address all these issues.
💾 Dataset
The dataset is composed of more than 6000 images of 1024x1024 pixels containing 300k+ unique wheat heads, with the corresponding bounding boxes. The images come from 11 countries and covers 44 unique measurement sessions. A measurement session is a set of images acquired at the same location, during a coherent timestamp (usually a few hours), with a specific sensor. In comparison to the 2020 competition on Kaggle, it represents 4 new countries, 22 new measurements sessions, 1200 new images and 120k new wheat heads. This amount of new situations will help to reinforce the quality of the test dataset. The 2020 dataset was labelled by researchers and students from 9 institutions across 7 countries. The additional data have been labelled by Human in the Loop, an ethical AI labelling company. We hope these changes will help in finding the most robust algorithms possible!
The task is to localize the wheat head contained in each image. The goal is to obtain a model which is robust to variation in shape, illumination, sensor and locations. A set of boxes coordinates is provided for each image.
The training dataset will be the images acquired in Europe and Canada, which cover approximately 4000 images and the test dataset will be composed of the images from North America (except Canada), Asia, Oceania and Africa and covers approximately 2000 images. It represents 7 new measurements sessions available for training but 17 new measurements sessions for the test!
📁 Files
Following files are available in the resources
section:
images: the folder contains all images
competition_train.csv , competition_val.csv, competition_test.csv : contains the splits used for the 2021 Global Wheat Challenge
Val contains the "public test", which is the test set of Global Wheat Head 2020
Test contains the "private test".
Metadata.csv : contains additional metadatas for each domain
💻 Labels
image_name
, BoxesString and domainimage_name
is the name of the image, without the suffix. All images have a .png extension
If you use the dataset for your research, please do not forget to quote:
@article{david2020global, title={Global Wheat Head Detection (GWHD) dataset: a large and diverse dataset of high-resolution RGB-labelled images to develop and benchmark wheat head detection methods}, author={David, Etienne and Madec, Simon and Sadeghi-Tehran, Pouria and Aasen, Helge and Zheng, Bangyou and Liu, Shouyang and Kirchgessner, Norbert and Ishikawa, Goro and Nagasawa, Koichi and Badhon, Minhajul A and others}, journal={Plant Phenomics}, volume={2020}, year={2020}, publisher={Science Partner Journal} }
@misc{david2021global,
title={Global Wheat Head Dataset 2021: more diversity to improve the benchmarking of wheat head localization methods},
author={Etienne David and Mario Serouart and Daniel Smith and Simon Madec and Kaaviya Velumani and Shouyang Liu and Xu Wang and Francisco Pinto Espinosa and Shahameh Shafiee and Izzat S. A. Tahir and Hisashi Tsujimoto and Shuhei Nasuda and Bangyou Zheng and Norbert Kichgessner and Helge Aasen and Andreas Hund and Pouria Sadhegi-Tehran and Koichi Nagasawa and Goro Ishikawa and Sébastien Dandrifosse and Alexis Carlier and Benoit Mercatoris and Ken Kuroki and Haozhou Wang and Masanori Ishii and Minhajul A. Badhon and Curtis Pozniak and David Shaner LeBauer and Morten Lilimo and Jesse Poland and Scott Chapman and Benoit de Solan and Frédéric Baret and Ian Stavness and Wei Guo},
year={2021},
eprint={2105.07660},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset is extracted from https://en.wikipedia.org/wiki/List_of_countries_by_wheat_exports. Context: There s a story behind every dataset and heres your opportunity to share yours.Content: What s inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too. Acknowledgements:We wouldn t be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.Inspiration: Your data will be in front of the world s largest data science community. What questions do you want to see answered?
Not seeing a result you expected?
Learn how you can add new datasets to our index.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset is modified from the Wheat-Ears-Detection-Dataset to the format of the dataset provided by the Global Wheat Detection (https://www.kaggle.com/c/global-wheat-detection). This dataset can be easily used in any kernels for the Global Wheat Detection as an external data source.
https://github.com/simonMadec/Wheat-Ears-Detection-Dataset Madec, S., Jin, X., Lu, H., De Solan, B., Liu, S., Duyme, F., et al. (2019). Ear density estimation from high resolution RGB imagery using deep learning technique. Agric. For. Meteorol. 264, 225–234. doi:10.1016/j.agrformet.2018.10.013.
The original dataset was posted by the authors above. It has 236 6000px*4000px images. We cut each image to 6 3000px*3000px images with strides of 1500px for width and 1000px for height. Then we resize them to 1024px*1024px. This dataset is compatible with the dataset provided by the Global Wheat Detection (https://www.kaggle.com/c/global-wheat-detection). The size of bounding boxes are similar as well due to the resize operation.
train.csv - the training data train - (folder) training images
image_id - the unique image ID width, height - the width and height of the images bbox - a bounding box, formatted as a Python-style list of [xmin, ymin, width, height] etc.
See https://www.kaggle.com/c/global-wheat-detection/data for more information.