Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Oct 1:arXiv:2404.08611v2.

Automatic Quantification of Serial PET/CT Images for Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-Aware Segmentation Network

Affiliations

Automatic Quantification of Serial PET/CT Images for Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-Aware Segmentation Network

Xin Tie et al. ArXiv. .

Update in

Abstract

Purpose: Automatic quantification of longitudinal changes in PET scans for lymphoma patients has proven challenging, as residual disease in interim-therapy scans is often subtle and difficult to detect. Our goal was to develop a longitudinally-aware segmentation network (LAS-Net) that can quantify serial PET/CT images for pediatric Hodgkin lymphoma patients.

Materials and methods: This retrospective study included baseline (PET1) and interim (PET2) PET/CT images from 297 patients enrolled in two Children's Oncology Group clinical trials (AHOD1331 and AHOD0831). LAS-Net incorporates longitudinal cross-attention, allowing relevant features from PET1 to inform the analysis of PET2. Model performance was evaluated using Dice coefficients for PET1 and detection F1 scores for PET2. Additionally, we extracted and compared quantitative PET metrics, including metabolic tumor volume (MTV) and total lesion glycolysis (TLG) in PET1, as well as qPET and ΔSUVmax in PET2, against physician measurements. We quantified their agreement using Spearman's ρ correlations and employed bootstrap resampling for statistical analysis.

Results: LAS-Net detected residual lymphoma in PET2 with an F1 score of 0.606 (precision/recall: 0.615/0.600), outperforming all comparator methods (P<0.01). For baseline segmentation, LAS-Net achieved a mean Dice score of 0.772. In PET quantification, LAS-Net's measurements of qPET, ΔSUVmax, MTV and TLG were strongly correlated with physician measurements, with Spearman's ρ of 0.78, 0.80, 0.93 and 0.96, respectively. The quantification performance remained high, with a slight decrease, in an external testing cohort.

Conclusion: LAS-Net demonstrated significant improvements in quantifying PET metrics across serial scans, highlighting the value of longitudinal awareness in evaluating multi-time-point imaging datasets.

Keywords: Deep Learning; Image Segmentation; Longitudinal Analysis; Quantitative PET.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
The architecture of longitudinally-aware segmentation network (LAS-Net). (A) The dual-branch design accommodates baseline (PET1) and interim (PET2) PET/CT images. One branch is dedicated to processing PET1 while the other branch focuses on PET2, using features extracted from PET2 as well as the features from the PET1 branch. (B) The longitudinally-aware window attention (LAWA) module introduces multi-head cross-attention following two self-attention blocks. All attention layers have a window size of 7. (C) The longitudinally-aware attention gate (LAAG) introduces a learnable convolutional layer (kernel size=7) following the standard self-attention gate to refine the attention coefficients for PET2. Both LAWA and LAAG modules only allow one-way information flow from the PET1 to the PET2 branch.
Figure 2:
Figure 2:
Performance comparison of interim PET lesion detection in the internal cohort. Results are reported with and without mask propagation through deformable registration (MPDR). Notably, CKD-Trans and ST-Trans utilized baseline lesion masks predicted by DynUNet for MPDR. (A) and (B) present the results of detection F1 scores, precision, and recall using different criteria to classify true positives. In (A), a predicted lesion is classified as a true positive if it overlaps with at least one voxel of the reference lesion. In (B), a predicted lesion is considered a true positive if its SUVmax is matched with the reference lesion’s SUVmax. (C) quantifies the agreement between model predictions and physician measurements for interim PET metrics. In the plots, actual metric values and Spearman’s correlation values are marked by circles with error bars indicating 95% confidence intervals. LAS-Net showed significantly improved performance (P < 0.05) over all comparator methods in F1 scores and interim PET metrics. The only exception was for qPET where its performance did not significantly surpass SegResNet with MPDR (P = 0.057). SUVmax = maximum lesion standardized uptake value, ΔSUVmax = percentage difference of SUVmax between the baseline and interim scans.
Figure 3:
Figure 3:
Performance comparison of baseline PET lesion segmentation in the internal cohort. (A) shows violin plots of evaluation metrics, where vertical lines represent the interquartile ranges and white circles mark the median values. (B) compares the correlations between baseline PET metrics assessed by physicians and those measured by deep learning models. Actual Spearman’s correlation values are marked by circles and their 95% confidence intervals are denoted by error bars. FPV = false positive volume, FNV = false negative volume, MTV = metabolic tumor volume, TLG = total lesion glycolysis, SUVmax = maximum lesion standardized uptake value, Dmax = maximum tumor dissemination, Dspleen = maximum distance between the lesion and the spleen.
Figure 4:
Figure 4:
Comparison of physician-based and automatically extracted PET metrics. Spearman’s ρ correlations are shown in the top left corner of each plot. Correlation values are presented as mean [2.5th percentile, 97.5th percentile].
Figure 5:
Figure 5:
Nine different examples of longitudinally-aware segmentation network (LAS-Net) output. Each case has maximum intensity projections (MIPs) of baseline and interim PET images with overlaying MIPs of the reference and predicted lesion masks. DS = Deauville score.
Figure 6:
Figure 6:
Four examples comparing the proposed longitudinally-aware segmentation network (LAS-Net) with DynUNet, a model without longitudinal cross-attention. Each case has maximum intensity projections (MIPs) of baseline and interim PET images, overlaid with MIPs of reference and predicted lesion masks. For both LAS-Net and DynUNet output, results incorporating mask propagation through deformable registration (MPDR) are also included. DS = Deauville score.

References

    1. Mauz-Körholz C, Metzger ML, Kelly KM, et al. Pediatric Hodgkin Lymphoma. JCO. 2015;33(27):2975–2985. doi: 10.1200/JCO.2014.59.4853. - DOI - PubMed
    1. Castellino SM, Pei Q, Parsons SK, et al. Brentuximab Vedotin with Chemotherapy in Pediatric High-Risk Hodgkin’s Lymphoma. N Engl J Med. 2022;387(18):1649–1660. doi: 10.1056/NEJMoa2206660. - DOI - PMC - PubMed
    1. Friedman DL, Chen L, Wolden S, et al. Dose-Intensive Response-Based Chemotherapy and Radiation Therapy for Children and Adolescents With Newly Diagnosed Intermediate-Risk Hodgkin Lymphoma: A Report From the Children’s Oncology Group Study AHOD0031. JCO. 2014;32(32):3651–3658. doi: 10.1200/JCO.2013.52.5410. - DOI - PMC - PubMed
    1. Kelly KM, Cole PD, Pei Q, et al. Response-adapted therapy for the treatment of children with newly diagnosed high risk Hodgkin lymphoma (AHOD0831): a report from the Children’s Oncology Group. Br J Haematol. 2019;187(1):39–48. 10.1111/bjh.16014. - DOI - PMC - PubMed
    1. Cheson BD, Fisher RI, Barrington SF, et al. Recommendations for Initial Evaluation, Staging, and Response Assessment of Hodgkin and Non-Hodgkin Lymphoma: The Lugano Classification. JCO. 2014;32(27):3059–3067. doi: 10.1200/JCO.2013.54.8800. - DOI - PMC - PubMed

Publication types

LinkOut - more resources