Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Nov;6(6):e230514.
doi: 10.1148/ryai.230514.

Addressing the Generalizability of AI in Radiology Using a Novel Data Augmentation Framework with Synthetic Patient Image Data: Proof-of-Concept and External Validation for Classification Tasks in Multiple Sclerosis

Affiliations

Addressing the Generalizability of AI in Radiology Using a Novel Data Augmentation Framework with Synthetic Patient Image Data: Proof-of-Concept and External Validation for Classification Tasks in Multiple Sclerosis

Gianluca Brugnara et al. Radiol Artif Intell. 2024 Nov.

Abstract

Artificial intelligence (AI) models often face performance drops after deployment to external datasets. This study evaluated the potential of a novel data augmentation framework based on generative adversarial networks (GANs) that creates synthetic patient image data for model training to improve model generalizability. Model development and external testing were performed for a given classification task, namely the detection of new fluid-attenuated inversion recovery lesions at MRI during longitudinal follow-up of patients with multiple sclerosis (MS). An internal dataset of 669 patients with MS (n = 3083 examinations) was used to develop an attention-based network, trained both with and without the inclusion of the GAN-based synthetic data augmentation framework. External testing was performed on 134 patients with MS from a different institution, with MR images acquired using different scanners and protocols than images used during training. Models trained using synthetic data augmentation showed a significant performance improvement when applied on external data (area under the receiver operating characteristic curve [AUC], 83.6% without synthetic data vs 93.3% with synthetic data augmentation; P = .03), achieving comparable results to the internal test set (AUC, 95.0%; P = .53), whereas models without synthetic data augmentation demonstrated a performance drop upon external testing (AUC, 93.8% on internal dataset vs 83.6% on external data; P = .03). Data augmentation with synthetic patient data substantially improved performance of AI models on unseen MRI data and may be extended to other clinical conditions or tasks to mitigate domain shift, limit class imbalance, and enhance the robustness of AI applications in medical imaging. Keywords: Brain, Brain Stem, Multiple Sclerosis, Synthetic Data Augmentation, Generative Adversarial Network Supplemental material is available for this article. © RSNA, 2024.

Keywords: Brain; Brain Stem; Generative Adversarial Network; Multiple Sclerosis; Synthetic Data Augmentation.

PubMed Disclaimer

Conflict of interest statement

Disclosures of conflicts of interest: G.B. Consulting fees, Cercare Medical (outside the submitted work). C.J.P. Support, University Hospital Heidelberg. K.D. Grants or contracts, EU Joint Programme – Neurodegenerative Disease Research, BMBF, University Bonn, SPRIND, WomenTech EU, Medical Imaging Center Bonn; shareholder, relios.vision GmbH; patent applications: EP 23216473.1, WO 2023/237742 A1; junior member, Contrast Media Safety Committee of the European Society of Uroradiology. R.H. No relevant relationships. T.P. No relevant relationships. M.F.D. No relevant relationships. M.A.M. No relevant relationships. B.W. Grants, Deutsche Forschungsgemeinschaft, German Ministry of Education and Research, Dietmar Hopp Foundation, Klaus Tschira Foundation, Novartis, Roche; honoraria, Alexion, INSTAND, Novartis, Roche. R.D. No relevant relationships. W.W. No relevant relationships. A. Radbruch No relevant relationships. M.B. Grants, European Union, DFG; consulting fees, Boehringer Ingelheim, NeuroScios, Guerbet, Seagen; fees for lectures, Novartis, Boehringer Ingelheim, Seagen; Editor in Chief for Clinical Neuroradiology (Springer). H.M. No relevant relationships. A. Rastogi No relevant relationships. P.V. Consulting fees and stock options, Need Inc; participation in advisory board for Cercare Medical (outside the submitted work).

Figures

(A) Schematic representation of the process for generation of
synthetic MS patient data. Briefly, T1 data from the OASIS dataset was
coregistered to a matched template patient T1 data from the same training
fold. The registered T1 data were then fed to the FAST segmentation
algorithm to generate three-dimensional segmentations of the anatomic
compartments (ie, white matter, gray matter, cerebrospinal fluid).
Simultaneously, the corresponding FLAIR image from the matched patient was
fed to a U-Net lesion segmentation algorithm to produce MS lesion
segmentation maps. Two different versions of the lesion maps (with and
without random elimination of lesion volumes) were generated and merged with
the brain segmentation. The registered T1 and the merged segmentation maps
are then fed to the cGAN model to create synthetic FLAIR images based on the
patient’s template. (B) Three example output cases of synthetic
patient data. Yellow arrows indicate simulated new lesions in the synthetic
follow-up examination. cGAN = conditional generative adversarial network,
FLAIR = fluid-attenuated inversion recovery, MS = multiple
sclerosis.
Figure 1:
(A) Schematic representation of the process for generation of synthetic MS patient data. Briefly, T1 data from the OASIS dataset was coregistered to a matched template patient T1 data from the same training fold. The registered T1 data were then fed to the FAST segmentation algorithm to generate three-dimensional segmentations of the anatomic compartments (ie, white matter, gray matter, cerebrospinal fluid). Simultaneously, the corresponding FLAIR image from the matched patient was fed to a U-Net lesion segmentation algorithm to produce MS lesion segmentation maps. Two different versions of the lesion maps (with and without random elimination of lesion volumes) were generated and merged with the brain segmentation. The registered T1 and the merged segmentation maps are then fed to the cGAN model to create synthetic FLAIR images based on the patient’s template. (B) Three example output cases of synthetic patient data. Yellow arrows indicate simulated new lesions in the synthetic follow-up examination. cGAN = conditional generative adversarial network, FLAIR = fluid-attenuated inversion recovery, MS = multiple sclerosis.
ROC curves with performance of the developed models across all
datasets. Our attention-based model achieved the significantly higher
performance as compared with the simpler ResNet architecture (model 1, P
< .001). Performance of the attention-based model significantly
improved on the external UKB test set after the inclusion of synthetic data
augmentation, demonstrating similar performance compared with the internal
test set (P = .53). AUC = area under the ROC curve, CNN = convolutional
neural network, ROC = receiver operating characteristic, UKB = Bonn
University Hospital.
Figure 2:
ROC curves with performance of the developed models across all datasets. Our attention-based model achieved the significantly higher performance as compared with the simpler ResNet architecture (model 1, P < .001). Performance of the attention-based model significantly improved on the external UKB test set after the inclusion of synthetic data augmentation, demonstrating similar performance compared with the internal test set (P = .53). AUC = area under the ROC curve, CNN = convolutional neural network, ROC = receiver operating characteristic, UKB = Bonn University Hospital.

References

    1. Widner K , Virmani S , Krause J , et al. . Lessons learned from translating AI from development to deployment in healthcare . Nat Med 2023. ; 29 ( 6 ): 1304 – 1306 . - PubMed
    1. Agarwal S , Wood D , Grzeda M , et al. . Systematic Review of Artificial Intelligence for Abnormality Detection in High-volume Neuroimaging and Subgroup Meta-analysis for Intracranial Hemorrhage Detection . Clin Neuroradiol 2023. ; 33 ( 4 ): 943 – 956 . - PMC - PubMed
    1. Rajpurkar P , Lungren MP . The Current and Future State of AI Interpretation of Medical Images . N Engl J Med 2023. ; 388 ( 21 ): 1981 – 1990 . - PubMed
    1. Eche T , Schwartz LH , Mokrane FZ , Dercle L . Toward Generalizability in the Deployment of Artificial Intelligence in Radiology: Role of Computation Stress Testing to Overcome Underspecification . Radiol Artif Intell 2021. ; 3 ( 6 ): e210097 . - PMC - PubMed
    1. Domingos P . A few useful things to know about machine learning . Commun ACM 2012. ; 55 ( 10 ): 78 – 87 .

Publication types

MeSH terms

LinkOut - more resources