Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Nov 9;14(1):27406.
doi: 10.1038/s41598-024-74612-0.

Advancing mango leaf variant identification with a robust multi-layer perceptron model

Affiliations

Advancing mango leaf variant identification with a robust multi-layer perceptron model

Md Fahim-Ul-Islam et al. Sci Rep. .

Abstract

Mango, often regarded as the "king of fruits," holds a significant position in Bangladesh's agricultural landscape due to its popularity among the general population. However, identifying different types of mangoes, especially from mango leaves, poses a challenge for most people. While some studies have focused on mango type identification using fruit images, limited work has been done on classifying mango types based on leaf images. Early identification of mango types through leaf analysis is crucial for taking proactive steps in the cultivation process. This research introduces a novel multi-layer perceptron model called WaveVisionNet, designed to address this challenge using mango leaf datasets collected from five regions in Bangladesh. The MangoFolioBD dataset, comprising 16,646 annotated high-resolution images of mango leaves, has been curated and augmented to enhance robustness in real-world conditions. To validate the model, WaveVisionNet is evaluated on both the publicly available dataset and the MangoFolioBD dataset, achieving accuracy rates of 96.11% and 95.21%, respectively, outperforming state-of-the-art models such as Vision Transformer and transfer learning models. The model effectively combines the strengths of lightweight Convolutional Neural Networks and noise-resistant techniques, allowing for accurate analysis of mango leaf images while minimizing the impact of noise and environmental factors. The application of the WaveVisionNet model for automated mango leaf identification offers significant benefits to farmers, agricultural experts, agri-tech companies, government agencies, and consumers by enabling precise diagnosis of plant health, enhancing agricultural practices, and ultimately improving crop yields and quality.

Keywords: Agricultural AI; Mango leaf identification; MangoFolioBD dataset; Multi-layer perceptron (MLP); Noise-resistant image analysis; WaveVisionNet.

PubMed Disclaimer

Figures

Fig.1
Fig.1
Medicinal leaf dataset sample images (collected online).
Fig. 2
Fig. 2
Locations of mango orchards; (a) SOAS agro, (Feni) (b) Bangladesh Agricultural Research Institute (Rangpur), (c) Brac nursery (Demra), (d) Rajshahi division, (e) Munshiganj in Bangladesh selected for data collection.
Fig. 3
Fig. 3
Geographic locations of mango orchards in Bangladesh selected for data collection. The map was adapted from the disaster and climate risk information platform (DRIP), developed by the Center for Environmental and Geographic Information Services (CEGIS) under the Government of the People’s Republic of Bangladesh. Reproduced with permission. ©Government of the People’s Republic of Bangladesh. All Rights Reserved http://drip.plancomm.gov.bd/BasicMap/MapViewer.
Fig. 4
Fig. 4
Data collection process ensuring leaf sample accuracy and cleanliness. Visual examination, dusting, washing, and controlled drying techniques were employed.
Fig. 5
Fig. 5
Sample Images of our MangoFolioBD dataset (primary data).
Fig. 6
Fig. 6
Data augmentation and noise addition pipeline for our MangoFolioBD dataset.
Fig. 7
Fig. 7
The overall technical path of this study and WaveVisionNet architecture.
Fig. 8
Fig. 8
The working mechanism and receptive field of (a), ( b) dialated convolution (DC) and HDC with dilation rates of (3,3) and (2,2).
Fig. 9
Fig. 9
Schematic overview of WaveVisionNet architecture for mango leaf categorization. The model enhances classification tasks by incorporating hierarchical feature extraction, attention mechanisms, and positional encoding, with self-attention applied initially and hybrid dilated convolution for enhanced receptive field.
Fig. 10
Fig. 10
(a) ROC curves illustrating the performance of the top three models. (b) Confusion matrix depicting the assessment of our WaveVisionNet model across training iterations on the MangoFolioBD dataset.
Fig. 11
Fig. 11
(a) Validation accuracy curves showcasing the performance of all models utilizing the WaveVisionNet model architecture on the public dataset. (b) Validation accuracy curves illustrating the performance of all models utilizing the WaveVisionNet model architecture on the MangoFolioBD dataset.
Fig. 12
Fig. 12
Overall performance metrics evaluation on medicinal leaves dataset (a) (public) and MangoFolioBD dataset (b) (private).
Fig. 13
Fig. 13
Robustness measurement of different metrics of our proposed WaveVisionNet model in with noise and without noise condition.
Fig. 14
Fig. 14
(ac) Output probabilities displayed by our WaveVisionNet model during the pretraining phase on the public dataset. (df) Output probabilities depicted by our WaveVisionNet model for accurate performance on the MangoFolio dataset, utilizing validation data directory.
Fig. 15
Fig. 15
t-SNE visualizations of the MangoFolio dataset with 26 unique classes, highlighting insights into the model’s learned representation of high-dimensional data across different learning rates (0.5, 0.6, 0.7).
Fig. 16
Fig. 16
Heatmaps generated using Grad-CAM technique from our proposed WaveVisionNet model. Heatmap corresponding to the first convolutional layer and final convolutional layer. These heatmaps highlight the regions in the input image that significantly influence the model’s predictions, providing insights into the model’s decision-making process.

References

    1. Nkoko, N., Cronje, N. & Swanepoel, J. W. Factors associated with food security among small-holder farming households in Lesotho. Agric. Food Secur.13, 1–10. 10.1186/s40066-023-00454-0 (2024).
    1. Vuppalapati, C. Specialty Crop Mangoes. In Specialty Crops for Climate Change Adaptation: Strategies for Enhanced Food Security by Using Machine Learning and Artificial Intelligence 419–514 (Springer Nature Switzerland, Cham, 2023). 10.1007/978-3-031-38399-1_5.
    1. Rizvee, R. A. et al. LeafNet: A proficient convolutional neural network for detecting seven prominent mango leaf diseases. J. Agric. Food Res.14, 1–13 (2023).
    1. Aggarwal, K. et al. Studies on energy efficient techniques for agricultural monitoring by wireless sensor networks. Comput. Electr. Eng.113, 109052 (2024).
    1. Zhong, X., Zhang, M., Tang, T., Adhikari, B. & Ma, Y. Advances in intelligent detection, monitoring, and control for preserving the quality of fresh fruits and vegetables in the supply chain. Food Biosci.56, 103350 (2023).

MeSH terms

LinkOut - more resources