Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep 15;39(1):79.
doi: 10.1007/s10822-025-00657-6.

Synergy of advanced machine learning and deep neural networks with consensus molecular docking for virtual screening of anaplastic lymphoma kinase inhibitors

Affiliations

Synergy of advanced machine learning and deep neural networks with consensus molecular docking for virtual screening of anaplastic lymphoma kinase inhibitors

The-Chuong Trinh et al. J Comput Aided Mol Des. .

Abstract

This study addresses the urgent need for an AI model to predict Anaplastic Lymphoma Kinase (ALK) inhibitors for Non-Small Cell Lung Cancer treatment, targeting the ALK-positive mutation. With only five Food and Drug Administration approved ALK inhibitors currently available, effective drugs remain in demand. Leveraging machine learning (ML) and deep learning (DL), our research accelerates the precise screening of novel ALK inhibitors using both ligand-based and structure-based approaches. In ligand-based approach, an ensemble voting model comprising three base learners to classify potential ALK inhibitors, achieving promising retrospective validation results. Notably, the ML-based XGBoost algorithm exhibited compelling results with external validation (EV)-f1 score of 0.921, EV-Average Precision (AP) of 0.961, cross-validation (CV)-f1 score of [Formula: see text] and CV-AP of [Formula: see text]. Besides, the DL-based Artificial Neural Network (ANN) model demonstrated comparative performance with EV-f1 score of 0.930, EV-AP of 0.955, CV-f1 score of [Formula: see text] and CV-AP of [Formula: see text]. For structure-based approach, an XGBoost consensus docking model utilized scores from three molecular docking programs (GNINA 1.0, Vina-GPU 2.0, and AutoDock-GPU) as features. Combining these two approaches, we virtually screened 120,571 compounds, identifying three promising ALK inhibitors, CHEMBL1689515, CHEMBL2380351, and CHEMBL102714, that bind to the protein's pocket and establish hydrophobic contacts in the hinge region through their ketone groups, resembling Alectinib's interaction. Comparative analysis revealed traditional ML models outperformed Graph Neural Networks (GNN), highlighting the critical role of feature engineering and dataset size importance. The study recommends further in vitro testing to validate the prospective screening performance of these models. A graphical user interface is available at https://huggingface.co/spaces/thechuongtrinh/ALK_inhibitors_classification .

Keywords: Anaplastic lymphoma kinase; Artificial intelligence; Benchmarking; Computer-aided drug design; Consensus molecular docking; Machine learning.

PubMed Disclaimer

Conflict of interest statement

Declarations. Conflict of interest: The authors declare no conflict of interest.

References

    1. Observatory G-GC (2020) Journal. https://gco.iarc.fr/ . Accessed 2020
    1. Society AC (2023) Journal. https://www.cancer.org/ . Accessed 2023
    1. Targeted Therapy to Treat Cancer (2023) https://www.cancer.gov/about-cancer/treatment/types/targeted-therapies . Accessed 15 May 2023
    1. Carpenter G, Cohen S (1979) Epidermal growth factor. Annu Rev Biochem 48(1):193–216 - PubMed - DOI
    1. Shaw AT, Engelman JA (2013) Alk in lung cancer: past, present, and future. J Clin Oncol 31(8):1105–1111 - PubMed - PMC - DOI

MeSH terms

LinkOut - more resources