Machine learning algorithms outperform conventional regression models in predicting development of hepatocellular carcinoma
- PMID: 24169273
- PMCID: PMC4610387
- DOI: 10.1038/ajg.2013.332
Machine learning algorithms outperform conventional regression models in predicting development of hepatocellular carcinoma
Abstract
Objectives: Predictive models for hepatocellular carcinoma (HCC) have been limited by modest accuracy and lack of validation. Machine-learning algorithms offer a novel methodology, which may improve HCC risk prognostication among patients with cirrhosis. Our study's aim was to develop and compare predictive models for HCC development among cirrhotic patients, using conventional regression analysis and machine-learning algorithms.
Methods: We enrolled 442 patients with Child A or B cirrhosis at the University of Michigan between January 2004 and September 2006 (UM cohort) and prospectively followed them until HCC development, liver transplantation, death, or study termination. Regression analysis and machine-learning algorithms were used to construct predictive models for HCC development, which were tested on an independent validation cohort from the Hepatitis C Antiviral Long-term Treatment against Cirrhosis (HALT-C) Trial. Both models were also compared with the previously published HALT-C model. Discrimination was assessed using receiver operating characteristic curve analysis, and diagnostic accuracy was assessed with net reclassification improvement and integrated discrimination improvement statistics.
Results: After a median follow-up of 3.5 years, 41 patients developed HCC. The UM regression model had a c-statistic of 0.61 (95% confidence interval (CI) 0.56-0.67), whereas the machine-learning algorithm had a c-statistic of 0.64 (95% CI 0.60-0.69) in the validation cohort. The HALT-C model had a c-statistic of 0.60 (95% CI 0.50-0.70) in the validation cohort and was outperformed by the machine-learning algorithm. The machine-learning algorithm had significantly better diagnostic accuracy as assessed by net reclassification improvement (P<0.001) and integrated discrimination improvement (P=0.04).
Conclusions: Machine-learning algorithms improve the accuracy of risk stratifying patients with cirrhosis and can be used to accurately identify patients at high-risk for developing HCC.
Figures
References
-
- El-Serag HB, Rudolph KL. Hepatocellular carcinoma: epidemiology and molecular carcinogenesis. Gastroenterology. 2007;132:2557–76. - PubMed
-
- Singal AG, Marrero JA. Recent advances in the treatment of hepatocellular carcinoma. Curr Opin Gastroenterol. 2010;26:189–95. - PubMed
-
- Llovet JM, Bustamante J, Castells A, et al. Natural history of untreated nonsurgical hepatocellular carcinoma: rationale for the design and evaluation of therapeutic trials. Hepatology. 1999;29:62–7. - PubMed
-
- Mazzaferro V, Regalia E, Doci R, et al. Liver transplantation for the treatment of small hepatocellular carcinomas in patients with cirrhosis. N Engl J Med. 1996;334:693–9. - PubMed
-
- Meissner HI, Smith RA, Rimer BK, et al. Promoting cancer screening: Learning from experience. Cancer. 2004;101:1107–17. - PubMed
Publication types
MeSH terms
Grants and funding
- UL1 TR001105/TR/NCATS NIH HHS/United States
- KL2TR000453/TR/NCATS NIH HHS/United States
- K23 DK064909/DK/NIDDK NIH HHS/United States
- UL1RR024986/RR/NCRR NIH HHS/United States
- R03 DK077707/DK/NIDDK NIH HHS/United States
- KL2 TR001103/TR/NCATS NIH HHS/United States
- DK064909/DK/NIDDK NIH HHS/United States
- IK2 HX000775/HX/HSRD VA/United States
- DK077707/DK/NIDDK NIH HHS/United States
- KL2 TR000453/TR/NCATS NIH HHS/United States
- R01 GM097117/GM/NIGMS NIH HHS/United States
- UL1 RR024986/RR/NCRR NIH HHS/United States
- KL2 RR024987/RR/NCRR NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
