Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2024 Sep;6(5):e240076.
doi: 10.1148/ryai.240076.

Artificial Intelligence Outcome Prediction in Neonates with Encephalopathy (AI-OPiNE)

Affiliations
Multicenter Study

Artificial Intelligence Outcome Prediction in Neonates with Encephalopathy (AI-OPiNE)

Christopher O Lew et al. Radiol Artif Intell. 2024 Sep.

Abstract

Purpose To develop a deep learning algorithm to predict 2-year neurodevelopmental outcomes in neonates with hypoxic-ischemic encephalopathy using MRI and basic clinical data. Materials and Methods In this study, MRI data of term neonates with encephalopathy in the High-dose Erythropoietin for Asphyxia and Encephalopathy (HEAL) trial (ClinicalTrials.gov: NCT02811263), who were enrolled from 17 institutions between January 25, 2017, and October 9, 2019, were retrospectively analyzed. The harmonized MRI protocol included T1-weighted, T2-weighted, and diffusion tensor imaging. Deep learning classifiers were trained to predict the primary outcome of the HEAL trial (death or any neurodevelopmental impairment at 2 years) using multisequence MRI and basic clinical variables, including sex and gestational age at birth. Model performance was evaluated on test sets comprising 10% of cases from 15 institutions (in-distribution test set, n = 41) and 10% of cases from two institutions (out-of-distribution test set, n = 41). Model performance in predicting additional secondary outcomes, including death alone, was also assessed. Results For the 414 neonates (mean gestational age, 39 weeks ± 1.4 [SD]; 232 male, 182 female), in the study cohort, 198 (48%) died or had any neurodevelopmental impairment at 2 years. The deep learning model achieved an area under the receiver operating characteristic curve (AUC) of 0.74 (95% CI: 0.60, 0.86) and 63% accuracy in the in-distribution test set and an AUC of 0.77 (95% CI: 0.63, 0.90) and 78% accuracy in the out-of-distribution test set. Performance was similar or better for predicting secondary outcomes. Conclusion Deep learning analysis of neonatal brain MRI yielded high performance for predicting 2-year neurodevelopmental outcomes. Keywords: Convolutional Neural Network (CNN), Prognosis, Pediatrics, Brain, Brain Stem Clinical trial registration no. NCT02811263 Supplemental material is available for this article. © RSNA, 2024 See also commentary by Rafful and Reis Teixeira in this issue.

Keywords: Brain; Brain Stem; Convolutional Neural Network (CNN); Pediatrics; Prognosis.

PubMed Disclaimer

Conflict of interest statement

Disclosures of conflicts of interest: C.O.L. No relevant relationships. E.C. No relevant relationships. J.V.C. No relevant relationships. F.T. No relevant relationships. G.C. No relevant relationships. A.L. No relevant relationships. J.F. No relevant relationships. S.J. Support for the present article from the National Institutes of Health (NIH) National Institute of Neurological Disorders and Stroke (NINDS) grants 1U01NS092764 and U01NS092553, paid to author’s institution; grants or contracts from the NINDS (1R13NS127525-01, 2023-2024 and R01HD101422-01A1, 2021-2026), the National Institute of Child Health and Human Development (NICHD) (P50 HD103524, 2020-2025 and 1R01HD107003-01, 2022-2027), and COOL Prime (2022), all paid to author’s institution; royalties from Elsevier for editing Avery’s Diseases of the Newborn, 10th edition, paid to author directly; support from the above-mentioned grants for attending meetings and/or travel, paid to author’s institution; participation on a Data Safety monitoring board or advisory board for ALBINO, COOL Prime, and 1K23HL150300-01A1 (Enteral iron supplementation and intestinal health in preterm infants), no payment; director of the Institute on Human Development and Disability, paid position, paid to author directly. A.M. NIH grant RO1: HEAL trial. R.C.M. Grant 5U01NS092764 High-dose Erythropoietin for Asphyxia and Encephalopathy (HEAL), payments made to Washington University; support from Siemens Healthcare and Philips Healthcare for travel and meals, to learn about MRI and CT scanners (author helps make purchasing decisions for health care system, unrelated to the research in this article); stock options from Turing Medical for medical advisory board participation (Turing makes software and hardware for MRI scanners, their products were not used in this article). J.L.W. Support for the present article provided by the NIH (U01NS092764 and U01NS092553), author received no additional funds toward this work. A.R. Grant support from GE HealthCare, the Hydrocephalus Association, UCSF Helen Diller Comprehensive Cancer Center, NCCN, and the Society for Pediatric Radiology; consulting fees from Arterys; stock options for MRImatch. Y.W.W. NIH grant U01NS092764; participation on the NICHD Neonatal Research Network Data Safety Monitoring Committee. Y.L. NIH grant U01 NS092764-01; leadership or fiduciary role on the American Society of Pediatric Radiology Board of Directors.

Figures

None
Graphical abstract
Flow diagram for participants included in each subset. Participants
were enrolled as part of the High-dose Erythropoietin for Asphyxia and
Encephalopathy (HEAL) study from 17 different institutions within the United
States. The out-of-distribution test set contains all participants from two
institutions. Participants from the remaining 15 institutions were randomly
split into a training set, validation set, and in-distribution test set with
an equal distribution of institutions in each subset.
Figure 1:
Flow diagram for participants included in each subset. Participants were enrolled as part of the High-dose Erythropoietin for Asphyxia and Encephalopathy (HEAL) study from 17 different institutions within the United States. The out-of-distribution test set contains all participants from two institutions. Participants from the remaining 15 institutions were randomly split into a training set, validation set, and in-distribution test set with an equal distribution of institutions in each subset.
Example MR images for a participant with moderate neurodevelopmental
impairment (NDI) (top row) and no death or NDI (bottom row) at 2-year
follow-up. Neither set of images reveals any definite focal areas of brain
injury, which highlights the difficulty in prognostication based on MRI. ADC
= apparent diffusion coefficient.
Figure 2:
Example MR images for a participant with moderate neurodevelopmental impairment (NDI) (top row) and no death or NDI (bottom row) at 2-year follow-up. Neither set of images reveals any definite focal areas of brain injury, which highlights the difficulty in prognostication based on MRI. ADC = apparent diffusion coefficient.
Flowchart of (A) imaging preprocessing steps and (B) model
architecture overview. *Only T1- and T2-weighted images underwent
bias correction.
Figure 3:
Flowchart of (A) imaging preprocessing steps and (B) model architecture overview. *Only T1- and T2-weighted images underwent bias correction.
Graph of receiver operating characteristic curves for predictions in
the in-distribution and out-of-distribution test sets. The Outcome
Prediction in Neonates with Encephalopathy (OPiNE) model used T1-weighted,
T2-weighted, apparent diffusion coefficient, trace, and readily available
clinical tabular data to perform predictions. The tabular data–only
model used logistic regression on the tabular data only. Radiologist scoring
used the same imaging sequences as the OPiNE model. All methods were
compared using the DeLong method, and there was a difference between the
OPiNE and tabular data–only model in the out-of-distribution test
set. All other comparisons demonstrated no difference. AUC = area under the
receiver operating characteristic curve.
Figure 4:
Graph of receiver operating characteristic curves for predictions in the in-distribution and out-of-distribution test sets. The Outcome Prediction in Neonates with Encephalopathy (OPiNE) model used T1-weighted, T2-weighted, apparent diffusion coefficient, trace, and readily available clinical tabular data to perform predictions. The tabular data–only model used logistic regression on the tabular data only. Radiologist scoring used the same imaging sequences as the OPiNE model. All methods were compared using the DeLong method, and there was a difference between the OPiNE and tabular data–only model in the out-of-distribution test set. All other comparisons demonstrated no difference. AUC = area under the receiver operating characteristic curve.
Gradient-weighted class activation mapping overlaid on T1-weighted,
T2-weighted, apparent diffusion coefficient (ADC), and trace images at the
level of the basal ganglia and thalami for cases of death or
neurodevelopmental impairment (NDI, top row) and no death or NDI (bottom
row). Gradients of the final convolutional layer were scaled between 0 and 1
and demonstrate salient areas of the image used in classification.
Gestational age and sex for each neonate included in the figure, from left
to right, are: upper row, 39-week male, 36-week male, 36-week male, and
39-week male; lower row, 40-week male, 39-week female, 41-week male, and
40-week female.
Figure 5:
Gradient-weighted class activation mapping overlaid on T1-weighted, T2-weighted, apparent diffusion coefficient (ADC), and trace images at the level of the basal ganglia and thalami for cases of death or neurodevelopmental impairment (NDI, top row) and no death or NDI (bottom row). Gradients of the final convolutional layer were scaled between 0 and 1 and demonstrate salient areas of the image used in classification. Gestational age and sex for each neonate included in the figure, from left to right, are: upper row, 39-week male, 36-week male, 36-week male, and 39-week male; lower row, 40-week male, 39-week female, 41-week male, and 40-week female.

Comment in

Similar articles

Cited by

References

    1. Lawn JE , Kerber K , Enweronu-Laryea C , Cousens S . 3.6 million neonatal deaths--what is progressing and what is not? Semin Perinatol 2010. ; 34 ( 6 ): 371 – 386 . - PubMed
    1. Heinz ER , Provenzale JM . Imaging findings in neonatal hypoxia: a practical review . AJR Am J Roentgenol 2009. ; 192 ( 1 ): 41 – 47 . - PubMed
    1. Miller SP , Ramaswamy V , Michelson D , et al. . Patterns of brain injury in term neonatal encephalopathy . J Pediatr 2005. ; 146 ( 4 ): 453 – 460 . - PubMed
    1. Dixon BJ , Reis C , Ho WM , Tang J , Zhang JH . Neuroprotective Strategies after Neonatal Hypoxic Ischemic Encephalopathy . Int J Mol Sci 2015. ; 16 ( 9 ): 22368 – 22401 . - PMC - PubMed
    1. Goergen SK , Ang H , Wong F , et al. . Early MRI in term infants with perinatal hypoxic-ischaemic brain injury: interobserver agreement and MRI predictors of outcome at 2 years . Clin Radiol 2014. ; 69 ( 1 ): 72 – 81 . - PubMed

Publication types

Associated data