Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Oct 6;3(6):465-486.
doi: 10.1515/mr-2023-0030. eCollection 2023 Dec.

An overview of recent advances and challenges in predicting compound-protein interaction (CPI)

Affiliations
Review

An overview of recent advances and challenges in predicting compound-protein interaction (CPI)

Yanbei Li et al. Med Rev (2021). .

Abstract

Compound-protein interactions (CPIs) are critical in drug discovery for identifying therapeutic targets, drug side effects, and repurposing existing drugs. Machine learning (ML) algorithms have emerged as powerful tools for CPI prediction, offering notable advantages in cost-effectiveness and efficiency. This review provides an overview of recent advances in both structure-based and non-structure-based CPI prediction ML models, highlighting their performance and achievements. It also offers insights into CPI prediction-related datasets and evaluation benchmarks. Lastly, the article presents a comprehensive assessment of the current landscape of CPI prediction, elucidating the challenges faced and outlining emerging trends to advance the field.

Keywords: artificial intelligence; chemogenomics; compound-protein interaction prediction; drug discovery; scoring function.

PubMed Disclaimer

Conflict of interest statement

Competing interests: Authors state no conflict of interest.

Figures

Figure 1:
Figure 1:
Categorization of compound-protein interaction (CPI) prediction models. FASTA, FAST-All; SMILES, Simplified molecular input line entry system.
Figure 2:
Figure 2:
Feature engineering-based strategies. (A) Specific energy terms. (B) Protein-ligand atom pairwise counts. (C) Protein-ligand interaction fingerprints. (D) Mathematical features.
Figure 3:
Figure 3:
Feature learning-based representation strategies. (A) Atom context-based features. (B) Grid-based features. (C) Graph-based features.
Figure 4:
Figure 4:
Non-structure-based CPI prediction methodology. (A) CPI prediction model pipeline; (B) architecture of convolutional neural network; (C) architecture of recurrent neural network; (D) architecture of graph neural network; (E) three common input types of models: proteins are input as sequence and small molecules are constructed as graphs; protein pockets and small molecules are constructed into different graphs; protein pockets and small molecules are converted into the same graphs.

Similar articles

Cited by

References

    1. Zhang X, Wu F, Yang N, Zhan X, Liao J, Mai S, et al. In silico methods for identification of potential therapeutic targets. Interdiscip Sci. 2022;14:285–310. doi: 10.1007/s12539-021-00491-y. - DOI - PMC - PubMed
    1. Lounkine E, Keiser MJ, Whitebread S, Mikhailov D, Hamon J, Jenkins JL, et al. Large-scale prediction and testing of drug activity on side-effect targets. Nature. 2012;486:361–7. doi: 10.1038/nature11159. - DOI - PMC - PubMed
    1. Keiser MJ, Setola V, Irwin JJ, Laggner C, Abbas AI, Hufeisen SJ, et al. Predicting new molecular targets for known drugs. Nature. 2009;462:175–81. doi: 10.1038/nature08506. - DOI - PMC - PubMed
    1. Pahikkala T, Airola A, Pietilä S, Shakyawar S, Szwajda A, Tang J, et al. Toward more realistic drug–target interaction predictions. Briefings Bioinf. 2015;16:325–37. doi: 10.1093/bib/bbu010. - DOI - PMC - PubMed
    1. Jayatunga MK, Xie W, Ruder L, Schulze U, Meier C. AI in small-molecule drug discovery: a coming wave. Nat Rev Drug Discov. 2022;21:175–6. doi: 10.1038/d41573-022-00025-1. - DOI - PubMed

LinkOut - more resources