Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 12;31(4):483-495.
doi: 10.1007/s10068-022-01041-y. eCollection 2022 Apr.

In silico prediction models for thyroid peroxidase inhibitors and their application to synthetic flavors

Affiliations

In silico prediction models for thyroid peroxidase inhibitors and their application to synthetic flavors

Mihyun Seo et al. Food Sci Biotechnol. .

Abstract

Systematic toxicity tests are often waived for the synthetic flavors as they are added in a very small amount in foods. However, their safety for some endpoints such as endocrine disruption should be concerned as they are likely to be active in low levels. In this case, structure-activity-relationship (SAR) models are good alternatives. In this study, therefore, binary, ternary, and quaternary prediction models were designed using simple or complex machine-learning methods. Overall, hard-voting classifiers outperformed other methods. The test scores for the best binary, ternary, and quaternary models were 0.6635, 0.5083, and 0.5217, respectively. Along with model development, some substructures including primary aromatic amine, (enol)ether, phenol, heterocyclic sulfur, and heterocyclic nitrogen, dominantly occurred in the most highly active compounds. The best predicting models were applied to synthetic flavors, and 22 agents appeared to have a strong inhibitory potential towards TPO activities.

Supplementary information: The online version contains supplementary material available at 10.1007/s10068-022-01041-y.

Keywords: Machine learning; Quantitative structure–activity relationship (QSAR); Synthetic flavor; Thyroid peroxidase inhibitor (TPO); Toxicity prediction.

PubMed Disclaimer

Conflict of interest statement

Conflict of interestThe authors declare no conflicts of interest.

Figures

Fig. 1
Fig. 1
Cross-validation (CV) and test scores for each feature, learning method, and grouping method (binary, ternary, or quaternary). For each grouping method, the black bar marked above with an asterisk indicates the test scores of the best-performing model. Each label is shown in “model name_feature extraction method (or voting methods in voting classifier)” form. [RF: Random forest; SVM: Support vector machine; ANN: Artificial neural network; AdaB: Adaptive boosting; XGB: Extreme gradient boosting; PCA: Principal component analysis; LDA: Linear discriminant analysis]
Fig. 2
Fig. 2
Confusion matrices for the best-performing models of each grouping. The color intensity of each cell represents the proportion of the number of compounds predicted to be a class with respect to the actual number of compounds in a class
Fig. 3
Fig. 3
Chemical space analysis between the input data for model development and synthetic flavors (red dots: synthetic flavors; green dots: input data for model development)

Similar articles

References

    1. Ai H, Wu X, Zhang L, Qi M, Zhao Y, Zhao Q, Zhao J, Liu H. QSAR modelling study of the bioconcentration factor and toxicity of organic compounds to aquatic organisms using machine learning and ensemble methods. Ecotoxicology and Environmental Safety. 2019;179:71–78. doi: 10.1016/j.ecoenv.2019.04.035. - DOI - PubMed
    1. Auld DS, Southall NT, Jadhav A, Johnson RL, Diller DJ, Simeonov A, Austin CP, Inglese J. Characterization of chemical libraries for luciferase inhibitory activity. Journal of Medical Chemistry. 2008;51:2372–2386. doi: 10.1021/jm701302v. - DOI - PubMed
    1. Blagus R, Lusa L. SMOTE for high-dimensional class-imbalanced data. BMC Bioinformatics. 2013;14:106. doi: 10.1186/1471-2105-14-106. - DOI - PMC - PubMed
    1. Capecchi A, Probst D, Reymond J. One molecular fingerprint to rule them all: drugs, biomolecules, and the metabolome. Journal of Cheminformatics. 12: 43 (2020) - PMC - PubMed
    1. Carvalho DP, Ferreira AC, Coelho SM, Moraes JM, Camacho MA, Rosenthal D. Thyroid peroxidase activity is inhibited by amino acids. Brazilian Journal of Medical and Biological Research. 2000;33(3):355–361. doi: 10.1590/S0100-879X2000000300015. - DOI - PubMed

LinkOut - more resources