Survival path model outperforms conventional static machine learning models in long-term dynamic prognosis prediction for patients with intermediate stage hepatocellular carcinoma

doi:10.1093/bioadv/vbaf027

. 2025 Feb 17;5(1):vbaf027.

doi: 10.1093/bioadv/vbaf027. eCollection 2025.

Survival path model outperforms conventional static machine learning models in long-term dynamic prognosis prediction for patients with intermediate stage hepatocellular carcinoma

Lujun Shen^{1

2}, Tao Zhang³, Jian Xu⁴, Yiquan Jiang^{1

2}, Fei Cao^{1

2}, Qifeng Chen^{1

2}, Chen Li^{1

2}, Gulijiayina Nuerhashi^{1

2}, Wang Li^{1

2}, Peihong Wu^{1

2}, Weijun Fan^{1

2}

Affiliations

¹ Department of Minimally Invasive Therapy, Sun Yat-sen University Cancer Center, Guangzhou 510060, P. R. China.
² State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, Guangzhou 510060, P. R. China.
³ Department of Information, Nanfang Hospital, Southern Medical University, Guangzhou 510060, P. R. China.
⁴ Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, P. R. China.

PMID: 40201235
PMCID: PMC11978388
DOI: 10.1093/bioadv/vbaf027

Survival path model outperforms conventional static machine learning models in long-term dynamic prognosis prediction for patients with intermediate stage hepatocellular carcinoma

Lujun Shen et al. Bioinform Adv. 2025.

. 2025 Feb 17;5(1):vbaf027.

doi: 10.1093/bioadv/vbaf027. eCollection 2025.

Authors

Lujun Shen^{1

2}, Tao Zhang³, Jian Xu⁴, Yiquan Jiang^{1

2}, Fei Cao^{1

2}, Qifeng Chen^{1

2}, Chen Li^{1

2}, Gulijiayina Nuerhashi^{1

2}, Wang Li^{1

2}, Peihong Wu^{1

2}, Weijun Fan^{1

2}

Affiliations

¹ Department of Minimally Invasive Therapy, Sun Yat-sen University Cancer Center, Guangzhou 510060, P. R. China.
² State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, Guangzhou 510060, P. R. China.
³ Department of Information, Nanfang Hospital, Southern Medical University, Guangzhou 510060, P. R. China.
⁴ Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, P. R. China.

PMID: 40201235
PMCID: PMC11978388
DOI: 10.1093/bioadv/vbaf027

Abstract

Motivation: Patients with intermediate stage hepatocellular carcinoma (HCC) require repeated disease monitoring, prognosis assessment, and treatment planning. A novel machine learning model called survival path mapping (SP) model was developed, while its performance as compared with conventional machine learning models remains unknown. Between January 2007 and December 2018, the time-series data of 2644 intermediate stage HCC patients from four medical centers in China were reviewed and included. Static machine learning models by Gaussian Naive Bayes (GNB), support vector machine (SVM), and random forest (RF) for the prediction of survivorship were built based on data at initial admission. Longitudinal data divided into different time slices were utilized for the construction of the SP model. The time-dependent c-index was compared between models.

Results: The training set, internal testing set, and external testing set consisted of 1560, 670, and 414 HCC patients, respectively. The survival path model had superior or non-inferior performance in prognosis prediction compared to GNB and RF models since the 12th month after initial diagnosis in the training set and the external testing set. The survival path model had higher time-dependent c-index over all conventional ML models since the 6th month in the external testing cohort. In conclusion, the survival path model had superior performance in long-term dynamic prognosis prediction compared to conventional static machine learning models for intermediate stage HCC.

Availability and implementation: The parameters of models are provided in the manuscript.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

**Figure 1.**
Flowchart of study design. Times-series data of 2230 patients with intermediate stage HCC were included in this study and were separated into training set (n = 1560) and internal testing set (n = 670), respectively. (A) The time-series data were converted into data of time slices (ts). (B) In building of conventional static ML models, the data of time slice no. 1 were extracted. The conventional models were built based on framework of Bayesian, random forest, or SVM to separately predict survival status of HCC patients at 6, 12, 24, 36, 48 months based on data of time slices no. 1. The construction of survival path model was based on time-series data from ts no. 1 to no. 9. The time-dependent c-index of different category of models in prediction of survival based on data of different time slices (ts 1–9) were compared. The results were also validated in a multicentric cohort (n = 414).

**Figure 2.**
Scatter plot of the correlation between the survival outcome and key features/variables using PCA. The scatter plot describes the correlations between the survival outcome and 16 key features/variables of intermediate-stage liver after dimensionality reduction using PCA. In all subgraphs, the data points for “death” and “survival” overlap, suggesting that using variables from a single time point alone is insufficient to accurately predict whether a patient will have died at a specific time point. An outlier point with PC1 more than two times of the maximal value of the rest patients in PC1 at time slice no. 7 was removed to gain a better visualization of the relationship between the distribution of points and the outcomes.

**Figure 3.**
The survival path model constructed based on the training set. The numbers above each node separated by slash refers to the node name, sample size, and median OS of the patients at the specific node, respectively. The “NA” refers to the condition that median OS was not reached.

**Figure 4.**
Comparison of the time-dependent c-index between different models. The change of time-dependent c-index along with different evaluation time for the four machine learning models in the training set, internal testing set, and multicentric testing set were compared. *** The SP model demonstrated superior c-index compared to other machine learning models, with P-value $<$ .0009.

See this image and copyright information in PMC

References

1. Brown ZJ, Tsilimigras DI, Ruff SM et al. Management of hepatocellular carcinoma: a review. JAMA Surg 2023;158:410–20. - PubMed
1. Cai Z-Q, Si S-B, Chen C et al. Analysis of prognostic factors for survival after hepatectomy for hepatocellular carcinoma based on a Bayesian network. PLoS One 2015;10:e0120805. - PMC - PubMed
1. Campani C, Vitale A, Dragoni G et al. Time-varying mHAP-III is the most accurate predictor of survival in patients with hepatocellular carcinoma undergoing transarterial chemoembolization. Liver Cancer 2021;10:126–36. - PMC - PubMed
1. Chen D, Liu J, Zang L et al. Integrated machine learning and bioinformatic analyses constructed a novel stemness-related classifier to predict prognosis and immunotherapy responses for hepatocellular carcinoma patients. Int J Biol Sci 2022;18:360–73. - PMC - PubMed
1. Hosny A, Parmar C, Coroller TP et al. Deep learning for lung cancer prognostication: a retrospective multi-cohort radiomics study. PLoS Med 2018;15:e1002711. - PMC - PubMed

LinkOut - more resources

Full Text Sources
- PubMed Central
- Silverchair Information Systems

[1] Brown ZJ, Tsilimigras DI, Ruff SM et al. Management of hepatocellular carcinoma: a review. JAMA Surg 2023;158:410–20. - PubMed

[2] Brown ZJ, Tsilimigras DI, Ruff SM et al. Management of hepatocellular carcinoma: a review. JAMA Surg 2023;158:410–20. - PubMed

[3] Cai Z-Q, Si S-B, Chen C et al. Analysis of prognostic factors for survival after hepatectomy for hepatocellular carcinoma based on a Bayesian network. PLoS One 2015;10:e0120805. - PMC - PubMed

[4] Cai Z-Q, Si S-B, Chen C et al. Analysis of prognostic factors for survival after hepatectomy for hepatocellular carcinoma based on a Bayesian network. PLoS One 2015;10:e0120805. - PMC - PubMed

[5] Campani C, Vitale A, Dragoni G et al. Time-varying mHAP-III is the most accurate predictor of survival in patients with hepatocellular carcinoma undergoing transarterial chemoembolization. Liver Cancer 2021;10:126–36. - PMC - PubMed

[6] Campani C, Vitale A, Dragoni G et al. Time-varying mHAP-III is the most accurate predictor of survival in patients with hepatocellular carcinoma undergoing transarterial chemoembolization. Liver Cancer 2021;10:126–36. - PMC - PubMed

[7] Chen D, Liu J, Zang L et al. Integrated machine learning and bioinformatic analyses constructed a novel stemness-related classifier to predict prognosis and immunotherapy responses for hepatocellular carcinoma patients. Int J Biol Sci 2022;18:360–73. - PMC - PubMed

[8] Chen D, Liu J, Zang L et al. Integrated machine learning and bioinformatic analyses constructed a novel stemness-related classifier to predict prognosis and immunotherapy responses for hepatocellular carcinoma patients. Int J Biol Sci 2022;18:360–73. - PMC - PubMed

[9] Hosny A, Parmar C, Coroller TP et al. Deep learning for lung cancer prognostication: a retrospective multi-cohort radiomics study. PLoS Med 2018;15:e1002711. - PMC - PubMed

[10] Hosny A, Parmar C, Coroller TP et al. Deep learning for lung cancer prognostication: a retrospective multi-cohort radiomics study. PLoS Med 2018;15:e1002711. - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Survival path model outperforms conventional static machine learning models in long-term dynamic prognosis prediction for patients with intermediate stage hepatocellular carcinoma

Affiliations

Survival path model outperforms conventional static machine learning models in long-term dynamic prognosis prediction for patients with intermediate stage hepatocellular carcinoma

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

References

Related information

LinkOut - more resources

Full Text Sources