Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 8;14(1):8252.
doi: 10.1038/s41598-024-58394-z.

Inferring molecular inhibition potency with AlphaFold predicted structures

Affiliations

Inferring molecular inhibition potency with AlphaFold predicted structures

Pedro F Oliveira et al. Sci Rep. .

Erratum in

Abstract

Even though in silico drug ligand-based methods have been successful in predicting interactions with known target proteins, they struggle with new, unassessed targets. To address this challenge, we propose an approach that integrates structural data from AlphaFold 2 predicted protein structures into machine learning models. Our method extracts 3D structural protein fingerprints and combines them with ligand structural data to train a single machine learning model. This model captures the relationship between ligand properties and the unique structural features of various target proteins, enabling predictions for never before tested molecules and protein targets. To assess our model, we used a dataset of 144 Human G-protein Coupled Receptors (GPCRs) with over 140,000 measured inhibition constants (Ki) values. Results strongly suggest that our approach performs as well as state-of-the-art ligand-based methods. In a second modeling approach that used 129 targets for training and a separate test set of 15 different protein targets, our model correctly predicted interactions for 73% of targets, with explained variances exceeding 0.50 in 22% of cases. Our findings further verified that the usage of experimentally determined protein structures produced models that were statistically indistinct from the Alphafold synthetic structures. This study presents a proteo-chemometric drug screening approach that uses a simple and scalable method for extracting protein structural information for usage in machine learning models capable of predicting protein-molecule interactions even for orphan targets.

Keywords: In silico drug discovery; Machine learning; Protein structure; Proteo-chemometrics; Quantitative structure-activity relationship modeling (QSAR); Structure based virtual screening.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Visual representation of the traditional QSAR approach (a) where the descriptors of the molecules with known activities are used to create a model capable of predicting activities for untested molecules. On the left (b), is a visual representation of the proposed proteo-chemometric methodology where fingerprints for multiple targets and molecules are used to create the model.
Figure 2
Figure 2
Generating proteins structural fingerprints from close amino acids.
Figure 3
Figure 3
The three modeling approaches followed. (A) Baseline individual QSAR models without any target based information; (B) Unified model with both ligand and target structure data with randomly selected data from all targets for validation; (C) Unified Blind model built with data from a subset of targets using data from an unknown target for validation.
Figure 4
Figure 4
Comparing validation results of Baseline QSAR models and Unified Model for Root Mean Squared Error (RMSE) and Ratio of Variance Explained (RVE) for 144 targets.

References

    1. Shaker B, Ahmad S, Lee J, Jung C, Na D. In silico methods and tools for drug discovery. Comput. Biol. Med. 2021;137:104851. doi: 10.1016/j.compbiomed.2021.104851. - DOI - PubMed
    1. Cui W, Aouidate A, Wang S, Qiuliyang Yu, Li Y, Yuan S. Discovering anti-cancer drugs via computational methods. Front. Pharmacol. 2020;11:733. doi: 10.3389/fphar.2020.00733. - DOI - PMC - PubMed
    1. Sabe VT, Ntombela T, Jhamba LA, Maguire GEM, Govender T, Naicker T, Kruger HG. Current trends in computer aided drug design and a highlight of drugs discovered via computational techniques: A review. Eur. J. Med. Chem. 2021;224(113705):113705. doi: 10.1016/j.ejmech.2021.113705. - DOI - PubMed
    1. Brogi S, Ramalho TC, Kuca K, Medina-Franco JL, Valko M. In silico methods for drug design and discovery. Front. Chem. 2020;8:612. doi: 10.3389/fchem.2020.00612. - DOI - PMC - PubMed
    1. Meng X-Y, Zhang H-X, Mezei M, Cui M. Molecular docking: A powerful approach for structure-based drug discovery. Curr. Comput. Aided Drug Des. 2011;7(2):146–157. doi: 10.2174/157340911795677602. - DOI - PMC - PubMed