Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 1:14:1283127.
doi: 10.3389/fmicb.2023.1283127. eCollection 2023.

Dynamic geospatial modeling of mycotoxin contamination of corn in Illinois: unveiling critical factors and predictive insights with machine learning

Affiliations

Dynamic geospatial modeling of mycotoxin contamination of corn in Illinois: unveiling critical factors and predictive insights with machine learning

Lina Castano-Duque et al. Front Microbiol. .

Abstract

Mycotoxin contamination of corn is a pervasive problem that negatively impacts human and animal health and causes economic losses to the agricultural industry worldwide. Historical aflatoxin (AFL) and fumonisin (FUM) mycotoxin contamination data of corn, daily weather data, satellite data, dynamic geospatial soil properties, and land usage parameters were modeled to identify factors significantly contributing to the outbreaks of mycotoxin contamination of corn grown in Illinois (IL), AFL >20 ppb, and FUM >5 ppm. Two methods were used: a gradient boosting machine (GBM) and a neural network (NN). Both the GBM and NN models were dynamic at a state-county geospatial level because they used GPS coordinates of the counties linked to soil properties. GBM identified temperature and precipitation prior to sowing as significant influential factors contributing to high AFL and FUM contamination. AFL-GBM showed that a higher aflatoxin risk index (ARI) in January, March, July, and November led to higher AFL contamination in the southern regions of IL. Higher values of corn-specific normalized difference vegetation index (NDVI) in July led to lower AFL contamination in Central and Southern IL, while higher wheat-specific NDVI values in February led to higher AFL. FUM-GBM showed that temperature in July and October, precipitation in February, and NDVI values in March are positively correlated with high contamination throughout IL. Furthermore, the dynamic geospatial models showed that soil characteristics were correlated with AFL and FUM contamination. Greater calcium carbonate content in soil was negatively correlated with AFL contamination, which was noticeable in Southern IL. Greater soil moisture and available water-holding capacity throughout Southern IL were positively correlated with high FUM contamination. The higher clay percentage in the northeastern areas of IL negatively correlated with FUM contamination. NN models showed high class-specific performance for 1-year predictive validation for AFL (73%) and FUM (85%), highlighting their accuracy for annual mycotoxin prediction. Our models revealed that soil, NDVI, year-specific weekly average precipitation, and temperature were the most important factors that correlated with mycotoxin contamination. These findings serve as reliable guidelines for future modeling efforts to identify novel data inputs for the prediction of AFL and FUM outbreaks and potential farm-level management practices.

Keywords: Aspergillus; Fusarium; aflatoxin; fumonisin; gradient boosting; machine learning; neural network; soil.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Figures

Figure 1
Figure 1
Pair-wise correlation analysis of the significant input variables influencing GBM and summary of the GBM model using multinomial mycotoxin outcome. (A) Correlation analysis of the features used for AFL modeling; (B) top 20 influential input features and their relative influence over the model in the prediction of AFL; (C) correlation analysis of the features used for FUM modeling; (D) top 20 influential input features and their relative influence over the model in the prediction of FUM. The correlation is depicted from positive (blue) to negative (red), with blank squares representing non-significant p-values of correlation between variables. For the correlation analysis, the p-value cutoff was 0.05, and the confidence level was 0.95. The blue hue in GBM plots represents the relative influence of the input variables, with light blue high and dark blue low influence levels. The green line is the testing set multinomial deviance loss, the black line is the training set multinomial deviance loss, and the blue dotted vertical line is the number of iterations used.
Figure 2
Figure 2
ARI relationship with AFL contamination levels and their geospatial distribution in IL from 2003 to 2021. (A) Average ARI in week 4 (January); (B) average ARI in week 11 (March); (C) average ARI in week 27 (July); (D) average ARI 45 (November). Box–Whiskers plot depicts the maximum (25th – 1.5 * interquartile range “IQR”) and minimum [75th percentile + 1.5 * interquartile range (IQR)], and the Box–Whisker plot depicts median, first (25th percentile) and third (75th percentile) quantiles distribution; For AFL, high is >20 ppb and low ≤20 ppb. The violin plot is shaded in red and depicts the density distribution of the soil property in low and high levels of mycotoxin contamination; and the gray dots depict each data point. Maps are shaded in red in relation to the ARI value for a specific week in each year; and the y-axis is latitude and the x-axis is longitude.
Figure 3
Figure 3
Crop-specific NDVI relationship with AFL contamination levels and their spatial distribution in Illinois from 2003 to 2021. (A) Corn NDVI in week 30 (July); (B) wheat NDVI in week 5 (February). Box–Whisker plot depicts the maximum (25th – 1.5 * interquartile range “IQR”) and minimum [75th percentile + 1.5 * interquartile range (IQR)], and the Box–Whisker plot depicts median, first (25th percentile), and third (75th percentile) quantiles distribution. For AFL, high is >20 ppb and low ≤20 ppb. The violin plot is shaded in red and depicts the density distribution of the soil property in low and high levels of mycotoxin contamination. The gray dots depict each data point. Maps show the average NDVI values for corn (green) and wheat (yellow) for specific weeks each year.
Figure 4
Figure 4
Calcium carbonate (CaCO3 Kg/m2) relationship with (A) AFL contamination levels from 2003 to 2021 and its (B) spatial distribution in IL. Box–Whisker plot depicts the maximum (25th – 1.5 * interquartile range “IQR”) and minimum [75th percentile + 1.5 * interquartile range (IQR)], and the Box–Whisker plot depicts median, first (25th percentile), and third (75th percentile) quantiles distribution. For AFL, high is >20 ppb and low ≤20 ppb. The violin plot is shaded in red and depicts the density distribution of calcium carbonate in low and elevated levels of mycotoxin contamination; and the gray dots depict each data point. Maps show the weight percentage of calcium carbonate values. For color legend, a non-linear ramp was used.
Figure 5
Figure 5
Temperature, precipitation, and veg. index relationships with FUM contamination levels and their geospatial distribution in IL from 2003 to 2021. (A) Average temperature in week 31, July, (B) average temperature in week 44, October, (C) average precipitation in week 7, February, (D) average veg. index in week 11, March. Box–Whisker plot depicts the maximum (25th – 1.5 * interquartile range “IQR”) and minimum [75th percentile + 1.5 * interquartile range (IQR)], and the Box–Whisker plot depicts median, first (25th percentile) and third (75th percentile) quantiles distribution; For FUM, high is >5 ppm, and low ≤5 ppm. The violin plot is shaded in blue and depicts the density distribution of the temperature, precipitation, or veg. index in low and high levels of mycotoxin contamination; and the gray dots depict each data point. Maps of geography are shaded in relation to temperature (red), precipitation (blue), or vegetation index (green) values for specific weeks each year, the y-axis is latitude and the x-axis is longitude.
Figure 6
Figure 6
Crop-specific NDVI relationship with FUM contamination levels and their geospatial distribution in IL from 2003 to 2021. (A) Corn NDVI in week 5, February, (B) corn NDVI in week 19, May, (C) corn NDVI in week 42, October, (D) wheat NDVI in week 26, June. Box–Whisker plot depicts the maximum (25th – 1.5 * interquartile range “IQR”) and minimum [75th percentile + 1.5 * interquartile range (IQR)], and the Box–Whisker plot depicts median, first (25th percentile) and third (75th percentile) quantiles distribution; For FUM, high is >5 ppm, and low ≤5 ppm. The violin plot is shaded in blue and depicts the density distribution of the corn or wheat NDVI in low and high levels of mycotoxin contamination; and gray dots depict each data point. Maps of geography are shaded in relation to corn NDVI (green) or wheat NDVI (yellow) values for specific weeks in each year.
Figure 7
Figure 7
Soil properties relationship with FUM contamination levels and their geospatial distribution in IL. (A) soil moisture (m3 H2O/m3 soil), (B) available water-holding capacity (cm). Box–Whisker plot depicts the maximum (25th – 1.5 * interquartile range “IQR”) and minimum [75th percentile + 1.5 * interquartile range (IQR)], and the Box–Whisker plot depicts median, first (25th percentile) and third (75th percentile) quantiles distribution; (C) percentage of clay content from 0 to 5 cm below surface, (D) percentage of clay content from 25 to 50 cm below surface; For FUM, high is >5 ppm, and low ≤5 ppm. The violin plot is shaded in blue and depicts the density distribution of the soil properties in low and elevated levels of mycotoxin contamination; and the gray dots depict each data point. Maps of geographical shaded red in relation to soil properties values, scale is in miles. For color, legend of soil moisture, a linear ramp was used, and for available water-holding capacity, and clay content, and a non-linear ramp was used.

References

    1. Abdel-Hadi A., Schmidt-Heydt M., Parra R., Geisen R., Magan N. (2012). A. systems approach to model the relationship between aflatoxin gene cluster expression, environmental factors, growth and toxin production by Aspergillus flavus. J. R. Soc. Interface 9, 757–767. 10.1098/rsif.2011.0482 - DOI - PMC - PubMed
    1. Baechler M. C., Love P. M. (2004). Building America Best Practices Series: Volume 1; Builders and Buyers Handbook for Improving New Home Efficiency, Comfort, and Durability in the Hot and Humid Climate. Golden, CO, USA: National Renewable Energy Lab (NREL). 10.2172/15011705 - DOI
    1. Battilani P., Leggieri M. C., Rossi V., Giorni P. (2013). AFLA-maize, a mechanistic model for Aspergillus flavus infection and aflatoxin B1 contamination in maize. Comput. Electr. Agric. 94, 38–46. 10.1016/j.compag.2013.03.005 - DOI
    1. Battilani P., Toscano P., Van der Fels-Klerx H. J., Moretti A., Leggieri M., Brera C., et al. (2016). Aflatoxin B1 contamination in maize in Europe increases due to climate change. Sci. Rep. 6, 24328. 10.1038/srep24328 - DOI - PMC - PubMed
    1. Beck H. E., Zimmermann N. E., McVicar T. R., Vergopolan N., Berg A., Wood E. F., et al. (2018). Present and future Köppen-Geiger climate classification maps at 1-km resolution. Scient. Data 5, 180214. 10.1038/sdata.2018.214 - DOI - PMC - PubMed