Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep;40(9):2306-2317.
doi: 10.1109/TMI.2021.3075856. Epub 2021 Aug 31.

Results of the 2020 fastMRI Challenge for Machine Learning MR Image Reconstruction

Results of the 2020 fastMRI Challenge for Machine Learning MR Image Reconstruction

Matthew J Muckley et al. IEEE Trans Med Imaging. 2021 Sep.

Abstract

Accelerating MRI scans is one of the principal outstanding problems in the MRI research community. Towards this goal, we hosted the second fastMRI competition targeted towards reconstructing MR images with subsampled k-space data. We provided participants with data from 7,299 clinical brain scans (de-identified via a HIPAA-compliant procedure by NYU Langone Health), holding back the fully-sampled data from 894 of these scans for challenge evaluation purposes. In contrast to the 2019 challenge, we focused our radiologist evaluations on pathological assessment in brain images. We also debuted a new Transfer track that required participants to submit models evaluated on MRI scanners from outside the training set. We received 19 submissions from eight different groups. Results showed one team scoring best in both SSIM scores and qualitative radiologist evaluations. We also performed analysis on alternative metrics to mitigate the effects of background noise and collected feedback from the participants to inform future challenges. Lastly, we identify common failure modes across the submissions, highlighting areas of need for future research in the MRI reconstruction community.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Examples of 4X submissions evaluated by radiologists with slice-level SSIM scores. All methods reasonably reconstructed T2 and FLAIR images. The ATB and Neurospin methods struggled with a susceptibility region, exaggerating the focus of susceptibility and introducing a few false vessels between the susceptibility and the lateral ventricular wall. In other cases, radiologists observed mild smoothing of white matter regions on T1POST images.
Fig. 2.
Fig. 2.
Examples of 8X submissions evaluated by radiologists with slice-level SSIM scores. At this level of acceleration fine details are smoothed and obscured for all contrasts. On T1POST images, AIRS Medical was relatively more successful than ATB and Neurospin in showing fine details of the mass, particularly in its periphery. Noticeable on the FLAIR images are horizontal “banding” effects that arise from how neural networks interact with anisotropic sampling patterns.
Fig. 3.
Fig. 3.
Examples of 4X Transfer submissions evaluated by radiologists with slice-level SSIM scores. The T1POST and T2 examples are from GE scanners, whereas the FLAIR example is from a Philips scanner. All methods introduced blurring to the images. Several methods had trouble adapting to the GE data while performing relatively well on the Philips data, as seen in the form of aliasing artifacts in one of the T1POST images.
Fig. 4.
Fig. 4.
Summary of SSIM values across contestants. (top) Model perfomance for teams submitting to the main 4X and 8X Siemens competition tracks. (bottom) Model performance for teams submitting to the Transfer track (combination of GE and Philips data). The “AVG” model score for the Transfer track was a simple average across all volumes in the Transfer track.
Fig. 5.
Fig. 5.
Scatter plot of mean radiologist rank across cases. The horizontal axis has a separate tick for each case evaluated by the radiologist cohort. The scatter plot markers indicate whether that method was from the team with the highest, middle, or lowest SSIM scores. We generally observed radiologists awarding the best ranks to models with the best SSIM score.
Fig. 6.
Fig. 6.
Examples of reconstruction hallucinations among challenge submissions with SSIM scores over residual plots (residuals magnified by 5). (top) A 4X submission from Neurospin generated a false vessel, possibly related to susceptibilities introduced by surgical staples. (middle) An 8X submission from ATB introduced a linear bright signal mimicking a cleft of cerebrospinal fluid, as well as blurring of the boundaries of the extra-axial mass. (bottom) A submission from ResoNNance introduced a false sulcus or prominent vessel.

References

    1. Chetlur S et al., “cuDNN: Efficient primitives for deep learning,” 2014, arXiv:1410.0759 [Online]. Available: http://arxiv.org/abs/1410.0759
    1. Abadi M et al., “TensorFlow: A system for large-scale machine learning,” in Proc. USENIX Conf. Oper. Syst. Design Implement, 2016, pp. 265–283.
    1. Paszke A et al., “PyTorch: An imperative style, high-performance deep learning library,” in Proc. Adv. Neural Inf. Process. Syst, 2019, pp. 8026–8037.
    1. Yang Y, Sun J, Li H, and Xu Z, “Deep ADMM-Net for compressive sensing MRI,” in Proc. Adv. Neural Inf. Process. Syst, 2016, pp. 10–18.
    1. Hammernik K et al., “Learning a variational network for reconstruction of accelerated MRI data,” Magn. Reson. Med, vol. 79, no. 6, pp. 3055–3071, Jun. 2018. - PMC - PubMed

Publication types