Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Nov 1:57:111097.
doi: 10.1016/j.dib.2024.111097. eCollection 2024 Dec.

A novel automated cloud-based image datasets for high throughput phenotyping in weed classification

Affiliations

A novel automated cloud-based image datasets for high throughput phenotyping in weed classification

Sunil G C et al. Data Brief. .

Abstract

Deep learning-based weed detection data management involves data acquisition, data labeling, model development, and model evaluation phases. Out of these data management phases, data acquisition and data labeling are labor-intensive and time-consuming steps for building robust models. In addition, low temporal variation of crop and weed in the datasets is one of the limiting factors for effective weed detection model development. This article describes the cloud-based automatic data acquisition system (CADAS) to capture the weed and crop images in fixed time intervals to take plant growth stages into account for weed identification. The CADAS was developed by integrating fifteen digital cameras in the visible spectrum with gphoto2 libraries, external storage, cloud storage, and a computer with Linux operating system. Dataset from CADAS system contain six weed species and eight crop species for weed and crop detection. A dataset of 2000 images per weed and crop species was publicly released. Raw RGB images underwent a cropping process guided by bounding box annotations to generate individual JPG images for crop and weed instances. In addition to cropped image 200 raw images with label files were released publicly. This dataset hold potential for investigating challenges in deep learning-based weed and crop detection in agricultural settings. Additionally, this data could be used by researcher along with field data to boost the model performance by reducing data imbalance problem.

Keywords: Automated data acquisition; Cloud computing; Computer vision; Deep learning; Weed and crop detection.

PubMed Disclaimer

Figures

Fig 1
Fig. 1
Folder structure of the data repository that is publicly available.
Fig 2
Fig. 2
Automated image acquisition system components: a) Canon EOS Rebel 7T, b) Canon EOS 90D, c) Desktop computer with Linux OS, d) AC battery adapters for continuous power, e) LCD display f) Data storage disk g) USB hubs and h) power extension cords.
Fig 3
Fig. 3
Block diagram showing automated image acquisition system with its component; external storage device, cloud, desktop with ubuntu 20.04, and canon camera; arrow directions show the direction of image flow from the camera.
Fig 4
Fig. 4
Automated image acquisition system greenhouse setup: a) Greenhouse benches with weed and crop pots, b) digital camera, c) Desktop computer with Linux OS attached to an external storage drive.
Fig 5
Fig. 5
Flowchart showing the Linux bash script algorithm for automatic image acquisition.
Fig 6
Fig. 6
Flowchart illustrating the process of extracting a cropped image containing six weeds and eight crops from the original image.
Fig 7
Fig. 7
Flowchart depicting the methodology for cropping raw images to extract individual object classes, including relevant mathematical formulations.

References

    1. Upadhyay A., S. G C. Zhang Y., Koparan C., Sun X. Development and evaluation of a machine vision and deep learning-based smart sprayer system for site-specific weed management in row crops: an edge computing approach. J. Agric. Food Res. 2024;18 doi: 10.1016/J.JAFR.2024.101331. - DOI
    1. GC S., Zhang Y., Howatt K., Schumacher L.G., Sun X. Multi-species weed and crop classification comparison using five different deep learning network architectures. J. ASABE. 2024;67(2):43–55. doi: 10.13031/ja.15590. - DOI
    1. S. G C. Koparan C., Ahmed M.R., Zhang Y., Howatt K., Sun X. A study on deep learning algorithm performance on weed and crop species identification under different image background. Artif. Intell. Agric. 2022;6:242–256. doi: 10.1016/J.AIIA.2022.11.001. - DOI
    1. T. Lin, “LabelImg,” Online: https://github.com/tzutalin/labelImg, 2015.

LinkOut - more resources