Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 15;10(33):37096-37114.
doi: 10.1021/acsomega.5c01473. eCollection 2025 Aug 26.

StackNAFLD: An Accurate Stacking Ensemble Learning Targeting NAFLD Treatment

Affiliations

StackNAFLD: An Accurate Stacking Ensemble Learning Targeting NAFLD Treatment

Andi Endang Kusuma Intan et al. ACS Omega. .

Abstract

Nonalcoholic fatty liver disease (NAFLD) is a slow-progressing yet complex disease with multiple pathophysiological mechanisms that make it challenging to treat. In this study, we developed a machine learning (ML)-based stacking ensemble model to predict molecules that could inhibit NAFLD progression utilizing data from animal experiments. We systematically collected 75 agents from preclinical experiments and classified them as inducers and inhibitors based on each study end point. Then, we computed 12 sets of molecular fingerprints and trained them with three baseline ML models. After that, the stacked model was trained using the predictive features from the baseline models and validated with 5-fold cross-validation (5-CV) and leave-one-out cross-validation (LOOCV). We found that the stacked model outperformed its baseline model across various evaluation metrics, thereby improving the prediction of the NAFLD inhibitory activity. Additionally, we tested the robustness and applicability domain of the stacked model, ensuring that this model delivered a trustworthy prediction. Moreover, we highlighted key molecular features, such as carboxylic, alkene, or aromatic rings, underscoring their influence on the decision-making of the stacked model. In conclusion, we have provided an effective method for improving molecular property prediction by using the stacking ensemble learning approach. Furthermore, we hosted our software in an open-access GitHub repository for further reproducibility and use in the drug discovery pipeline.

PubMed Disclaimer

Figures

1
1
Stacking ensemble learning model architecture used in this study.
2
2
Chemical distribution of the in vivo NAFLD data set using the t-SNE distribution plot.
3
3
5-CV AUROC scores of RF baseline models based on 13 molecular fingerprints.
4
4
5-CV AUPRC scores of RF baseline models based on 13 molecular fingerprints.
5
5
5-CV AUROC scores of SVM baseline models based on 13 molecular fingerprints.
6
6
5-CV AUPRC scores of SVM baseline models based on 13 molecular fingerprints.
7
7
5-CV AUROC scores of XGB baseline models based on 13 molecular fingerprints.
8
8
5-CV AUPRC scores of XGB baseline models based on 13 molecular fingerprints.
9
9
5-CV AUROC and AUPRC metrics of the stacked model. (A) and (B) represent the average AUROC and AUPRC metrics of the stacked model without the Combined fingerprint, while (C) and (D) show the corresponding metrics when the Combined fingerprint is included.
10
10
y-Randomization experiment of the StackNAFLD model.
11
11
Top feature from (A) StackNAFLD model and (B) SubFP with XGB baseline model.
12
12
Example of structure importance for NAFLD classification.

Similar articles

References

    1. Buzzetti E., Pinzani M., Tsochatzis E. A.. The multiple-hit pathogenesis of non-alcoholic fatty liver disease (NAFLD) Metabolism. 2016;65:1038–1048. doi: 10.1016/j.metabol.2015.12.012. - DOI - PubMed
    1. Chalasani N., Younossi Z., Lavine J. E., Charlton M., Cusi K., Rinella M., Harrison S. A., Brunt E. M., Sanyal A. J.. The diagnosis and management of nonalcoholic fatty liver disease: Practice guidance from the American Association for the Study of Liver Diseases. Hepatology. 2018;67:328–357. doi: 10.1002/hep.29367. - DOI - PubMed
    1. Glen J., Floros L., Day C., Pryke R.. Non-alcoholic fatty liver disease (NAFLD): summary of NICE guidance. BMJ. 2016:i4428. doi: 10.1136/bmj.i4428. - DOI - PubMed
    1. European Association for the Study of the Liver (EASL); European Association for the Study of the Liver (EASL); European Association for the Study of the Liver (EASL) EASL–EASD–EASO Clinical Practice Guidelines for the management of non-alcoholic fatty liver disease. J. Hepatol. 2016;64:1388–1402. doi: 10.1016/j.jhep.2015.11.004. - DOI - PubMed
    1. Andrea T. A., Kalayeh H.. Applications of neural networks in quantitative structure-activity relationships of dihydrofolate reductase inhibitors. J. Med. Chem. 1991;34:2824–2836. doi: 10.1021/jm00113a022. - DOI - PubMed

LinkOut - more resources