Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 20;29(6):1381.
doi: 10.3390/molecules29061381.

Machine-Learning-Based Prediction of Plant Cuticle-Air Partition Coefficients for Organic Pollutants: Revealing Mechanisms from a Molecular Structure Perspective

Affiliations

Machine-Learning-Based Prediction of Plant Cuticle-Air Partition Coefficients for Organic Pollutants: Revealing Mechanisms from a Molecular Structure Perspective

Tianyun Tao et al. Molecules. .

Abstract

Accurately predicting plant cuticle-air partition coefficients (Kca) is essential for assessing the ecological risk of organic pollutants and elucidating their partitioning mechanisms. The current work collected 255 measured Kca values from 25 plant species and 106 compounds (dataset (I)) and averaged them to establish a dataset (dataset (II)) containing Kca values for 106 compounds. Machine-learning algorithms (multiple linear regression (MLR), multi-layer perceptron (MLP), k-nearest neighbors (KNN), and gradient-boosting decision tree (GBDT)) were applied to develop eight QSPR models for predicting Kca. The results showed that the developed models had a high goodness of fit, as well as good robustness and predictive performance. The GBDT-2 model (Radj2 = 0.925, QLOO2 = 0.756, QBOOT2 = 0.864, Rext2 = 0.837, Qext2 = 0.811, and CCC = 0.891) is recommended as the best model for predicting Kca due to its superior performance. Moreover, interpreting the GBDT-1 and GBDT-2 models based on the Shapley additive explanations (SHAP) method elucidated how molecular properties, such as molecular size, polarizability, and molecular complexity, affected the capacity of plant cuticles to adsorb organic pollutants in the air. The satisfactory performance of the developed models suggests that they have the potential for extensive applications in guiding the environmental fate of organic pollutants and promoting the progress of eco-friendly and sustainable chemical engineering.

Keywords: QSPR; machine learning; organic pollutants; plant cuticle–air partition coefficient.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Descriptor correlation graph of the QSPR models. (a) Datasets (I); (b) Datasets (II).
Figure 2
Figure 2
Plots of the observed versus predicted for log Kca based on dataset (I). (a) Training set; (b) Test set; Plots of the observed versus predicted for log Kca based on dataset (II). (c) Training set; (d) Test set.
Figure 3
Figure 3
Application domain characterized by Williams plots: the MLR-1 (a) and GBDT-2 (b) models for log Kca.
Figure 4
Figure 4
Relationship between SHAP value and the values of different descriptors for dataset (I) (a) and dataset (II) (b).
Figure 5
Figure 5
Distribution of experimental data of log Kca values. (a) Distribution of 255 experimental log Kca values for 106 compounds; (b) Distribution of average experimental log Kca values for 106 compounds.

References

    1. Talaiekhozani A., Rezania S., Kim K.-H., Sanaye R., Amani A.M. Recent advances in photocatalytic removal of organic and inorganic pollutants in air. J. Clean. Prod. 2021;278:123895. doi: 10.1016/j.jclepro.2020.123895. - DOI
    1. Welke B., Ettlinger K., Riederer M. Sorption of Volatile Organic Chemicals in Plant Surfaces. Environ. Sci. Technol. 1998;32:1099–1104. doi: 10.1021/es970763v. - DOI
    1. Li Q., Chen B. Organic Pollutant Clustered in the Plant Cuticular Membranes: Visualizing the Distribution of Phenanthrene in Leaf Cuticle Using Two-Photon Confocal Scanning Laser Microscopy. Environ. Sci. Technol. 2014;48:4774–4781. doi: 10.1021/es404976c. - DOI - PubMed
    1. Collins C.D., Finnegan E. Modeling the Plant Uptake of Organic Chemicals, Including the Soil−Air−Plant Pathway. Environ. Sci. Technol. 2010;44:998–1003. doi: 10.1021/es901941z. - DOI - PubMed
    1. Sabljic A., Guesten H., Schoenherr J., Riederer M. Modeling plant uptake of airborne organic chemicals. 1. Plant cuticle/water partitioning and molecular connectivity. Environ. Sci. Technol. 1990;24:1321–1326. doi: 10.1021/es00079a004. - DOI

MeSH terms

Substances

LinkOut - more resources