Machine learning-based models for advanced fibrosis in non-alcoholic steatohepatitis patients: A cohort study
- PMID: 40061588
- PMCID: PMC11886044
- DOI: 10.3748/wjg.v31.i9.101383
Machine learning-based models for advanced fibrosis in non-alcoholic steatohepatitis patients: A cohort study
Abstract
Background: The global prevalence of non-alcoholic steatohepatitis (NASH) and its associated risk of adverse outcomes, particularly in patients with advanced liver fibrosis, underscores the importance of early and accurate diagnosis.
Aim: To develop a machine learning-based diagnostic model for advanced liver fibrosis in NASH patients.
Methods: A total of 749 patients who underwent liver biopsy at Beijing Ditan Hospital, Capital Medical University, between January 2010 and January 2020 were included. Patients were randomly divided into training (n = 522) and validation (n = 224) cohorts. Five machine learning models were applied to predict advanced liver fibrosis, with feature selection based on Shapley Additive Explanations (SHAP). The diagnostic performance of these models was compared to traditional scores such as the aspartate aminotransferase to platelet ratio index (APRI) and fibrosis index based on the 4 factors (FIB-4), using metrics including the area under the receiver operating characteristic curve (AUROC), decision curve analysis (DCA), and calibration curves.
Results: The Extreme Gradient Boosting (XGBoost) model outperformed all other machine learning models, achieving an AUROC of 0.934 (95%CI: 0.914-0.955) in the training cohort and 0.917 (95%CI: 0.880-0.953) in the validation cohort (P < 0.001). Incorporating liver stiffness measurement into the model further improved its performance, with an AUROC of 0.977 (95%CI: 0.966-0.980) in the training cohort and 0.970 (95%CI: 0.950-0.990) in the validation cohort, significantly surpassing APRI and FIB-4 scores (P < 0.001). The XGBoost model also demonstrated superior clinical utility, as evidenced by DCA and calibration curve analysis in both cohorts.
Conclusion: The XGBoost model provides a highly accurate, non-invasive diagnosis of advanced liver fibrosis in NASH patients, outperforming traditional methods. An online tool based on this model has been developed to assist clinicians in evaluating the risk of advanced liver fibrosis.
Keywords: Advanced fibrosis; Extreme Gradient Boosting; Machine learning; Non-alcoholic steatohepatitis; Non-invasive.
©The Author(s) 2025. Published by Baishideng Publishing Group Inc. All rights reserved.
Conflict of interest statement
Conflict-of-interest statement: All the authors report no relevant conflicts of interest for this article.
Figures
References
-
- Man S, Deng Y, Ma Y, Fu J, Bao H, Yu C, Lv J, Liu H, Wang B, Li L. Prevalence of Liver Steatosis and Fibrosis in the General Population and Various High-Risk Populations: A Nationwide Study With 5.7 Million Adults in China. Gastroenterology. 2023;165:1025–1040. - PubMed
-
- Kong AP, Lau ES, O CK, Luk AO, Yip TC, Chow EY, Kwok R, Lee HW, Wong GL, Ma RC, Chan HL, Wong VW, Chan JC. Advanced liver fibrosis predicts heart failure and hospitalizations in people with type 2 diabetes: A prospective cohort study from Hong Kong Diabetes Register. Diabetes Res Clin Pract. 2023;202:110825. - PubMed
-
- Mantovani A, Csermely A, Petracca G, Beatrice G, Corey KE, Simon TG, Byrne CD, Targher G. Non-alcoholic fatty liver disease and risk of fatal and non-fatal cardiovascular events: an updated systematic review and meta-analysis. Lancet Gastroenterol Hepatol. 2021;6:903–913. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
