Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 14;10(14):e34244.
doi: 10.1016/j.heliyon.2024.e34244. eCollection 2024 Jul 30.

A deep learning drug screening framework for integrating local-global characteristics: A novel attempt for limited data

Affiliations

A deep learning drug screening framework for integrating local-global characteristics: A novel attempt for limited data

Ying Wang et al. Heliyon. .

Abstract

At the beginning of the "Disease X" outbreak, drug discovery and development are often challenged by insufficient and unbalanced data. To address this problem and maximize the information value of limited data, we propose a drug screening model, LGCNN, based on convolutional neural network (CNN), which enables rapid drug screening by integrating features of drug molecular structures and drug-target interactions at both local and global (LG) levels. Experimental results show that LGCNN exhibits better performance compared to other state-of-the-art classification methods under limited data. In addition, LGCNN was applied to anti-SARS-CoV-2 drug screening to realize therapeutic drug mining against COVID-19. LGCNN transcends the limitations of traditional models for predicting interactions between single drug targets and shows new advantages in predicting multi-target drug-target interactions. Notably, the cross-coronavirus generalizability of the model is also implied by the analysis of targets, drugs, and mechanisms in the prediction results. In conclusion, LGCNN provides new ideas and methods for rapid drug screening in emergency situations where data are scarce.

Keywords: COVID-19; Deep learning; Direct-acting antiviral drugs; Host-targeted antiviral drugs; Molecular docking; Virtual drug screening.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Image 1
Graphical abstract
Fig. 1
Fig. 1
The overall workflow of the study. The combination of drug substructure features and drug-target association features is embedded as comprehensive feature vectors, which is trained by a convolutional neural network composed of double convolutional architecture. The performance of LGCNN and the accuracy of the predicted results are evaluated by multiple cross-validation and molecular docking simulation experiments. Drug-target network analysis and therapeutic mechanism analysis are used to explain the reliability of the predicted results.
Fig. 2
Fig. 2
Diagram of feature extraction and integration. Drug substructure features and drug-target association features are integrated into local-global comprehensive features.
Fig. 3
Fig. 3
The architecture of the CNN model.
Fig. 4
Fig. 4
LGCNN performance diagram. (A) Loss values and accuracy change figures of the training set and test set (Positive sample: negative sample = 1:1). (B) ROC and PR curves of the LGCNN. (C) LGCNN cross-validation results based on different datasets.
Fig. 5
Fig. 5
ROC curves obtained by LGCNN and other competing methods on the limited datasets based on 5-fold CV.
Fig. 6
Fig. 6
Drug-target interaction network. (A) Complete drug-target interaction network predicted by LGCNN. (B) Drug-target interaction subnetwork for multi-target drugs (cid). (C) Drug-target interaction subnetworks for targets modulated by multiple drugs (cid).
Fig. 7
Fig. 7
Virtual docking diagrams of ligands and receptors. 3D interactions of ligand-receptor complex (left) and 2D interactions of ligand-receptor complex (right) in each subdiagram. (A) CHEMBL2263447-DHODH docking complex. (B) SCHEMBL18785395-SIG1R docking complex. (C) CHEMBL473984-MAPK14 docking complex. (D) CHEMBL474178-MAPK14 docking complex.
Fig. 8
Fig. 8
Compound-mechanism-target diagram. This diagram illustrates the possible pathways through which high-scoring predictive drugs may treat COVID-19 via target engagement. Each target is assigned a serial number. Targets involved in viral infection replication processes are indicated by blue boxes, while targets implicated in the inflammatory response triggered by viral invasion are marked with brown boxes. Targets involved in both processes are labeled green. The example compound CHEMBL ID and the chemical formula corresponding to each pathway are highlighted in red. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

References

    1. Oberfeld B., Achanta A., Carpenter K., Chen P., Gilette N.M., Langat P., Said J.T., Schiff A.E., Zhou A.S., Barczak A.K., Pillai S. SnapShot: COVID-19. Cell. 2020;181:954–954.e951. - PMC - PubMed
    1. Pal M., Berhanu G., Desalegn C., Kandi V. Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2): an update. Cureus. 2020;12:e7423. - PMC - PubMed
    1. Rabaan A.A., Al-Ahmed S.H., Haque S., Sah R., Tiwari R., Malik Y.S., Dhama K., Yatoo M.I., Bonilla-Aldana D.K., Rodriguez-Morales A.J. SARS-CoV-2, SARS-CoV, and MERS-COV: a comparative overview. Infez Med. 2020;28:174–184. - PubMed
    1. Öztürk H., Özgür A., Ozkirimli E. DeepDTA: deep drug-target binding affinity prediction. Bioinformatics. 2018;34:i821–i829. - PMC - PubMed
    1. Joshi T., Joshi T., Pundir H., Sharma P., Mathpal S., Chandra S. vol. 39. 2021. pp. 6728–6746. (Predictive Modeling by Deep Learning, Virtual Screening and Molecular Dynamics Study of Natural Compounds against SARS-CoV-2 Main Protease). - PMC - PubMed

LinkOut - more resources