Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul;6(4):e230218.
doi: 10.1148/ryai.230218.

Two-Stage Training Framework Using Multicontrast MRI Radiomics for IDH Mutation Status Prediction in Glioma

Affiliations

Two-Stage Training Framework Using Multicontrast MRI Radiomics for IDH Mutation Status Prediction in Glioma

Nghi C D Truong et al. Radiol Artif Intell. 2024 Jul.

Abstract

Purpose To develop a radiomics framework for preoperative MRI-based prediction of isocitrate dehydrogenase (IDH) mutation status, a crucial glioma prognostic indicator. Materials and Methods Radiomics features (shape, first-order statistics, and texture) were extracted from the whole tumor or the combination of nonenhancing, necrosis, and edema regions. Segmentation masks were obtained via the federated tumor segmentation tool or the original data source. Boruta, a wrapper-based feature selection algorithm, identified relevant features. Addressing the imbalance between mutated and wild-type cases, multiple prediction models were trained on balanced data subsets using random forest or XGBoost and assembled to build the final classifier. The framework was evaluated using retrospective MRI scans from three public datasets (The Cancer Imaging Archive [TCIA, 227 patients], the University of California San Francisco Preoperative Diffuse Glioma MRI dataset [UCSF, 495 patients], and the Erasmus Glioma Database [EGD, 456 patients]) and internal datasets collected from the University of Texas Southwestern Medical Center (UTSW, 356 patients), New York University (NYU, 136 patients), and University of Wisconsin-Madison (UWM, 174 patients). TCIA and UTSW served as separate training sets, while the remaining data constituted the test set (1617 or 1488 testing cases, respectively). Results The best performing models trained on the TCIA dataset achieved area under the receiver operating characteristic curve (AUC) values of 0.89 for UTSW, 0.86 for NYU, 0.93 for UWM, 0.94 for UCSF, and 0.88 for EGD test sets. The best performing models trained on the UTSW dataset achieved slightly higher AUCs: 0.92 for TCIA, 0.88 for NYU, 0.96 for UWM, 0.93 for UCSF, and 0.90 for EGD. Conclusion This MRI radiomics-based framework shows promise for accurate preoperative prediction of IDH mutation status in patients with glioma. Keywords: Glioma, Isocitrate Dehydrogenase Mutation, IDH Mutation, Radiomics, MRI Supplemental material is available for this article. Published under a CC BY 4.0 license. See also commentary by Moassefi and Erickson in this issue.

Keywords: Glioma; IDH Mutation; Isocitrate Dehydrogenase Mutation; MRI; Radiomics.

PubMed Disclaimer

Conflict of interest statement

Disclosures of conflicts of interest: N.C.D.T. No relevant relationships. C.G.B.Y. No relevant relationships. B.C.W. No relevant relationships. J.M.H. No relevant relationships. D.R. No relevant relationships. N.S. No relevant relationships. K.J.H. No relevant relationships. T.R.P. No relevant relationships. B.F. No relevant relationships. M.D.L. No relevant relationships. R.J. No relevant relationships. R.J.B. Supported by NIH 1R01LM013151-01A1. M.C.P. No relevant relationships. A.J.M. Supported by NIH/NCI U01CA207091. J.A.M. Supported by NIH/NCI U01CA207091 and R01CA260705.

Figures

None
Graphical abstract
Flowchart of proposed MRI radiomics-based framework for predicting
isocitrate dehydrogenase (IDH) mutation status in gliomas.
Figure 1:
Flowchart of proposed MRI radiomics-based framework for predicting isocitrate dehydrogenase (IDH) mutation status in gliomas.
Receiver operating characteristic (ROC) curves for the random forest
and XGBoost (XGB) models trained using relevant features from the whole
tumor; the nonenhancing tumor (NET), necrosis (NCR), and edema (ED) region
of interest; and the combined features from both regions of interest. (A)
ROC curves for the combined test sets from the University of Texas
Southwestern Medical Center (UTSW), New York University (NYU), University of
Wisconsin–Madison, and University of California San Francisco
Preoperative Diffuse Glioma MRI dataset (UCSF) obtained by models trained on
The Cancer Imaging Archive (TCIA) dataset. (B) ROC curves for the combined
test sets from TCIA, UTSW, NYU, and UCSF obtained by models trained on the
UTSW dataset. AUC = area under the receiver operating characteristic curve,
RF = random forest.
Figure 2:
Receiver operating characteristic (ROC) curves for the random forest and XGBoost (XGB) models trained using relevant features from the whole tumor; the nonenhancing tumor (NET), necrosis (NCR), and edema (ED) region of interest; and the combined features from both regions of interest. (A) ROC curves for the combined test sets from the University of Texas Southwestern Medical Center (UTSW), New York University (NYU), University of Wisconsin–Madison, and University of California San Francisco Preoperative Diffuse Glioma MRI dataset (UCSF) obtained by models trained on The Cancer Imaging Archive (TCIA) dataset. (B) ROC curves for the combined test sets from TCIA, UTSW, NYU, and UCSF obtained by models trained on the UTSW dataset. AUC = area under the receiver operating characteristic curve, RF = random forest.
Summary of the most frequently selected image types, feature classes,
and MRI sequences. (A) The most selected image types. (B) The most selected
feature classes. (C) The percentage of MRI sequences identified as relevant
features. GLCM = gray-level co-occurrence matrix, GLDM = gray-level
dependence matrix, GLRLM = gray-level run-length matrix, GLSZM = gray-level
size zone matrix, LoG = Laplacian of Gaussian, NGTDM = neighboring gray-tone
difference matrix, WL-HH = wavelet filtering with high-pass filters, WL-HL =
wavelet filtering with high-pass and low-pass filters, WL-LH = wavelet
filtering with low-pass and high-pass filters, WL-LL = wavelet filtering
with low-pass filters.
Figure 3:
Summary of the most frequently selected image types, feature classes, and MRI sequences. (A) The most selected image types. (B) The most selected feature classes. (C) The percentage of MRI sequences identified as relevant features. GLCM = gray-level co-occurrence matrix, GLDM = gray-level dependence matrix, GLRLM = gray-level run-length matrix, GLSZM = gray-level size zone matrix, LoG = Laplacian of Gaussian, NGTDM = neighboring gray-tone difference matrix, WL-HH = wavelet filtering with high-pass filters, WL-HL = wavelet filtering with high-pass and low-pass filters, WL-LH = wavelet filtering with low-pass and high-pass filters, WL-LL = wavelet filtering with low-pass filters.
Chart of the top 20 relevant features with their importance scores
measured by Shapley additive explanations (SHAP) values. The feature names
are labeled as region of interest 1 (ROI1) and ROI2, representing the whole
tumor and the nonenhancing tumor (NET), necrosis (NCR), and edema (ED)
region of interests, respectively. glcm = gray-level co-occurrence matrix,
glszm = gray-level size zone matrix, log = Laplacian of Gaussian, ROI =
region of interest, wavelet-LH = wavelet filtering with low-pass and
high-pass filters, wavelet-LL = wavelet filtering with low-pass
filters.
Figure 4:
Chart of the top 20 relevant features with their importance scores measured by Shapley additive explanations (SHAP) values. The feature names are labeled as region of interest 1 (ROI1) and ROI2, representing the whole tumor and the nonenhancing tumor (NET), necrosis (NCR), and edema (ED) region of interests, respectively. glcm = gray-level co-occurrence matrix, glszm = gray-level size zone matrix, log = Laplacian of Gaussian, ROI = region of interest, wavelet-LH = wavelet filtering with low-pass and high-pass filters, wavelet-LL = wavelet filtering with low-pass filters.

References

    1. Louis DN , Perry A , Reifenberger G , et al. . The 2016 World Health Organization Classification of Tumors of the Central Nervous System: a summary . Acta Neuropathol (Berl) 2016. ; 131 ( 6 ): 803 – 820 . - PubMed
    1. Gillies RJ , Kinahan PE , Hricak H . Radiomics: Images Are More than Pictures, They Are Data . Radiology 2016. ; 278 ( 2 ): 563 – 577 . - PMC - PubMed
    1. Li Y , Ammari S , Lawrance L , et al. . Radiomics-Based Method for Predicting the Glioma Subtype as Defined by Tumor Grade, IDH Mutation, and 1p/19q Codeletion . Cancers (Basel) 2022. ; 14 ( 7 ): 1778 . - PMC - PubMed
    1. Liu X , Li Y , Li S , et al. . IDH mutation-specific radiomic signature in lower-grade gliomas . Aging (Albany NY) . 2019. ; 11 ( 2 ): 673 – 696 . - PMC - PubMed
    1. Tan Y , Zhang S-T , Wei J-W , et al. . A radiomics nomogram may improve the prediction of IDH genotype for astrocytoma before surgery . Eur Radiol 2019. ; 29 ( 7 ): 3325 – 3337 . - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources