Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 5;7(10):101510.
doi: 10.1016/j.jhepr.2025.101510. eCollection 2025 Oct.

Improving risk stratification and detection of early HCC using ultrasound-based deep learning models

Affiliations

Improving risk stratification and detection of early HCC using ultrasound-based deep learning models

Jérémy Dana et al. JHEP Rep. .

Abstract

Background & aims: Hepatocellular carcinoma (HCC) surveillance programs are suboptimal. We aimed to design an ultrasound-based deep learning model for HCC risk stratification (STARHE-RISK) and early-stage HCC detection (STARHE-DETECT) in patients with compensated advanced chronic liver disease (cACLD).

Methods: This prospective multicentric study included 403 adult patients with cACLD of all causes enrolled in a surveillance program for at least 6 months without prior history of HCC. STARHE-RISK was trained on ultrasound cine clips of the non-tumoral liver parenchyma using two classes: cases (n = 152 patients with early-stage HCC; 137/152 [82%] male; median age 63 years) and controls (n = 170 patients without HCC at inclusion and during a subsequent 1-year follow-up; 120/170 [71%] male; median age 69 years). STARHE-DETECT was trained on tumour ultrasound cine clips. The training/validation and testing sets were stratified according to potential confounders, and 50 patients who were balanced in both groups were allocated to the independent testing set based on sample size calculation. Statistical analysis included classification and detection metrics.

Results: STARHE-RISK achieved good prediction performances in the testing set with a 0.72 accuracy (95% CI 0.57-0.84) and an odds ratio of 6.6 (95% CI 1.9-22.7; p = 0.003). The combination of STARHE-RISK and the FASTRAK score, a multi-aetiology HCC risk stratification score, achieved a higher specificity (0.86 [95% CI 0.65-0.97]) and odds ratio (8.9 [95% CI 2.1-38.3; p = 0.004]) for predicting a patient at high risk of HCC development. STARHE-DETECT achieved a 0.67 mAP10, a 0.68 sensitivity (95% CI 0.47-0.85), and a 0.82 specificity (95% CI 0.69-0.91) for detecting early-stage HCC.

Conclusions: STARHE-RISK and STARHE-DETECT achieved robust performances for HCC risk stratification and early-stage HCC detection, respectively. They could become valuable surveillance tools and pave the way for a risk-based personalised surveillance program.

Impact and implications: STARHE-RISK is a reliable ultrasound-based deep learning model for hepatocellular carcinoma (HCC) risk stratification in patients with compensated advanced chronic liver disease and can be associated with complementary scores integrating clinical and blood parameters. STARHE-DETECT could become a complementary tool to visual assessment for radiologists and sonographers in HCC surveillance. Both models are based on simple and easy-to-perform ultrasound cine clip acquisitions. This study paves the way for a risk-based personalised surveillance program that will not ultimately rely on a single test but rather on a combination of approaches mixing clinical, biological, and radiological data.

Clinical trials registration: The study is registered at ClinicalTrials.gov (NCT04802954).

Keywords: Deep learning; Hepatocellular carcinoma; Prediction; Risk stratification; Ultrasound.

PubMed Disclaimer

Conflict of interest statement

MR received speaker fees from Terumo, Guerbet, Sirtex, General Electrics, Servier, and Canon. PN has received honoraria from and/or consults for AstraZeneca, Bayer, Bristol-Myers Squibb, Eisai, Gilead, Guerbet, Ipsen, and Roche. He received research grants from AstraZeneca, AbbVie, Bristol-Myers Squibb and Eisai. TFB is founder, shareholder, and advisor and received research grant support from Alentis Therapeutics. He serves also as advisor and consultant to Pueros Bioventures and Novo Holding. The other authors have no conflicts of interest to declare. Please refer to the accompanying ICMJE disclosure forms for further details.

Figures

Image 1
Graphical abstract
Fig. 1
Fig. 1
Flow chart of the study.
Fig. 2
Fig. 2
Performance of STARHE-RISK and STARHE-DETECT models in the testing set. (A) Calibration curve and (B) ROC curve of the STARHE-RISK model and FASTRAK score for HCC risk stratification in the testing set. (C) Mean Average Precision (Intersection over Union of 10%, 50%, 75%) with precision-recall curve obtained by plotting the STARHE-DETECT model's precision and recall values as a function of the model's confidence score threshold.
Fig. 3
Fig. 3
Liver parenchyma echotexture patterns and HCC risk stratification performed by STARHE-RISK. (A) Increased homogeneous echostructure in a 55-year-old man with alcohol-related cirrhosis correctly predicted at low-risk (77% confidence) compared to (B) a macronodular echostructure (arrow) in a 42-year-old man with cured hepatitis B cirrhosis and BCLC 0 hepatocellular carcinoma (not shown on the image) correctly predicted at high-risk (on this view of the non-tumoral liver parenchyma).
Fig. 4
Fig. 4
Correctly detected HCC nodule and false positive prediction by STARHE-DETECT model in a 74-year-old patient with metabolic dysfunction-associated steatotic liver disease and 17 mm isoechoic hepatocellular carcinoma. The HCC nodule in Figure 4A was correctly detected with a confidence level of 79.4%. Figure 4B shows a false positive prediction in the non-tumoral parenchyma with a high confidence level (75%). In Figure 4B, the partially obscured lesion was detected with a lower confidence level (42%). Note: Red bounding boxes represent the manually annotated ground truth, and blue bounding boxes are the HCC nodules predicted by the deep learning model.

References

    1. Rumgay H., Arnold M., Ferlay J., et al. Global burden of primary liver cancer in 2020 and predictions to 2040. J Hepatol. 2022;77:1598–1606. - PMC - PubMed
    1. Tzartzeva K., Obi J., Rich N.E., et al. Surveillance imaging and alpha fetoprotein for early detection of hepatocellular carcinoma in patients with cirrhosis: a meta-analysis. Gastroenterology. 2018;154:1706–1718.e1701. - PMC - PubMed
    1. Park H.J., Jang H.Y., Kim S.Y., et al. Non-enhanced magnetic resonance imaging as a surveillance tool for hepatocellular carcinoma: comparison with ultrasound. J Hepatol. 2020;72:718–724. - PubMed
    1. Goutté N., Sogni P., Bendersky N., et al. Geographical variations in incidence, management and survival of hepatocellular carcinoma in a Western country. J Hepatol. 2017;66:537–544. - PubMed
    1. Nahon P., Najean M., Layese R., et al. Early hepatocellular carcinoma detection using magnetic resonance imaging is cost-effective in high-risk patients with cirrhosis. JHEP Rep. 2022;4 - PMC - PubMed

Associated data

LinkOut - more resources