Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep 22:2021:9846158.
doi: 10.34133/2021/9846158. eCollection 2021.

Global Wheat Head Detection 2021: An Improved Dataset for Benchmarking Wheat Head Detection Methods

Affiliations

Global Wheat Head Detection 2021: An Improved Dataset for Benchmarking Wheat Head Detection Methods

Etienne David et al. Plant Phenomics. .

Abstract

The Global Wheat Head Detection (GWHD) dataset was created in 2020 and has assembled 193,634 labelled wheat heads from 4700 RGB images acquired from various acquisition platforms and 7 countries/institutions. With an associated competition hosted in Kaggle, GWHD_2020 has successfully attracted attention from both the computer vision and agricultural science communities. From this first experience, a few avenues for improvements have been identified regarding data size, head diversity, and label reliability. To address these issues, the 2020 dataset has been reexamined, relabeled, and complemented by adding 1722 images from 5 additional countries, allowing for 81,553 additional wheat heads. We now release in 2021 a new version of the Global Wheat Head Detection dataset, which is bigger, more diverse, and less noisy than the GWHD_2020 version.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there is no conflict of interest regarding the publication of this article.

Figures

Figure 1
Figure 1
Sample images of the Global Wheat Head Detection 2021. The blue boxes correspond to the interactively labeled heads.
Figure 2
Figure 2
Distribution of the development stage. The x-axis presents the number of subdataset per development stage.
Figure 3
Figure 3
Distribution of the images in the two first dimensions defined by the UMAP algorithm for the GWHD 2021 dataset. The additional subdatasets as well as the training and test datasets from GWHD_2020 are represented by colors.

References

    1. Sambasivan N., Kapania S., Highfill H., Akrong D., Paritosh P., Aroyo L. M. Everyone wants to do the model work, not the data work: data cascades in high-stakes AI. New York, NY, USA: 2021. - DOI
    1. Deng J., Dong W., Socher R., Li L.-J., Li K., Fei-Fei L. ImageNet: a large-scale hierarchical image database. 2009 IEEE Conference on Computer Vision and Pattern Recognition; 2009; pp. 248–255.
    1. Lin T.-Y., et al. Microsoft coco: common objects in context. European conference on computer vision. 2014. pp. 740–755.
    1. Cruz J. A., Yin X., Liu X., et al. Multi-modality imagery database for plant phenotyping. Machine Vision and Applications. 2016;27(5):735–749. doi: 10.1007/s00138-015-0734-6. - DOI
    1. Guo W., Zheng B., Potgieter A. B., et al. Aerial imagery analysis – quantifying appearance and number of sorghum heads for applications in breeding and agronomy. Frontiers in Plant Science. 2018;9:p. 1544. doi: 10.3389/fpls.2018.01544. - DOI - PMC - PubMed

LinkOut - more resources