Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet
- PMID: 30481176
- PMCID: PMC6258509
- DOI: 10.1371/journal.pmed.1002699
Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet
Abstract
Background: Magnetic resonance imaging (MRI) of the knee is the preferred method for diagnosing knee injuries. However, interpretation of knee MRI is time-intensive and subject to diagnostic error and variability. An automated system for interpreting knee MRI could prioritize high-risk patients and assist clinicians in making diagnoses. Deep learning methods, in being able to automatically learn layers of features, are well suited for modeling the complex relationships between medical images and their interpretations. In this study we developed a deep learning model for detecting general abnormalities and specific diagnoses (anterior cruciate ligament [ACL] tears and meniscal tears) on knee MRI exams. We then measured the effect of providing the model's predictions to clinical experts during interpretation.
Methods and findings: Our dataset consisted of 1,370 knee MRI exams performed at Stanford University Medical Center between January 1, 2001, and December 31, 2012 (mean age 38.0 years; 569 [41.5%] female patients). The majority vote of 3 musculoskeletal radiologists established reference standard labels on an internal validation set of 120 exams. We developed MRNet, a convolutional neural network for classifying MRI series and combined predictions from 3 series per exam using logistic regression. In detecting abnormalities, ACL tears, and meniscal tears, this model achieved area under the receiver operating characteristic curve (AUC) values of 0.937 (95% CI 0.895, 0.980), 0.965 (95% CI 0.938, 0.993), and 0.847 (95% CI 0.780, 0.914), respectively, on the internal validation set. We also obtained a public dataset of 917 exams with sagittal T1-weighted series and labels for ACL injury from Clinical Hospital Centre Rijeka, Croatia. On the external validation set of 183 exams, the MRNet trained on Stanford sagittal T2-weighted series achieved an AUC of 0.824 (95% CI 0.757, 0.892) in the detection of ACL injuries with no additional training, while an MRNet trained on the rest of the external data achieved an AUC of 0.911 (95% CI 0.864, 0.958). We additionally measured the specificity, sensitivity, and accuracy of 9 clinical experts (7 board-certified general radiologists and 2 orthopedic surgeons) on the internal validation set both with and without model assistance. Using a 2-sided Pearson's chi-squared test with adjustment for multiple comparisons, we found no significant differences between the performance of the model and that of unassisted general radiologists in detecting abnormalities. General radiologists achieved significantly higher sensitivity in detecting ACL tears (p-value = 0.002; q-value = 0.019) and significantly higher specificity in detecting meniscal tears (p-value = 0.003; q-value = 0.019). Using a 1-tailed t test on the change in performance metrics, we found that providing model predictions significantly increased clinical experts' specificity in identifying ACL tears (p-value < 0.001; q-value = 0.006). The primary limitations of our study include lack of surgical ground truth and the small size of the panel of clinical experts.
Conclusions: Our deep learning model can rapidly generate accurate clinical pathology classifications of knee MRI exams from both internal and external datasets. Moreover, our results support the assertion that deep learning models can improve the performance of clinical experts during medical imaging interpretation. Further research is needed to validate the model prospectively and to determine its utility in the clinical setting.
Conflict of interest statement
I have read the journal's policy and the authors of this manuscript have the following competing interests: CL is a shareholder of whiterabbit.ai and nines.ai. Since submitting this manuscript, RLB has joined and received stock options from Roam Analytics, whose mission is to use AI methodology to improve human health.
Figures






Similar articles
-
Deep learning to detect anterior cruciate ligament tear on knee MRI: multi-continental external validation.Eur Radiol. 2022 Dec;32(12):8394-8403. doi: 10.1007/s00330-022-08923-z. Epub 2022 Jun 21. Eur Radiol. 2022. PMID: 35726103
-
Deep learning-assisted detection of meniscus and anterior cruciate ligament combined tears in adult knee magnetic resonance imaging: a crossover study with arthroscopy correlation.Int Orthop. 2025 Jul;49(7):1689-1697. doi: 10.1007/s00264-025-06531-2. Epub 2025 Apr 28. Int Orthop. 2025. PMID: 40293511
-
Automatic Detection of Meniscus Tears Using Backbone Convolutional Neural Networks on Knee MRI.J Magn Reson Imaging. 2023 Mar;57(3):740-749. doi: 10.1002/jmri.28284. Epub 2022 Jun 1. J Magn Reson Imaging. 2023. PMID: 35648374
-
Diagnostic Performance of Magnetic Resonance Imaging for Detecting Meniscal Ramp Lesions in Patients With Anterior Cruciate Ligament Tears: A Systematic Review and Meta-analysis.Am J Sports Med. 2020 Jul;48(8):2051-2059. doi: 10.1177/0363546519880528. Epub 2019 Nov 4. Am J Sports Med. 2020. PMID: 31684739
-
Diagnostic Performance of Artificial Intelligence for Detection of Anterior Cruciate Ligament and Meniscus Tears: A Systematic Review.Arthroscopy. 2021 Feb;37(2):771-781. doi: 10.1016/j.arthro.2020.09.012. Epub 2020 Sep 18. Arthroscopy. 2021. PMID: 32956803
Cited by
-
Inconsistency in the use of the term "validation" in studies reporting the performance of deep learning algorithms in providing diagnosis from medical imaging.PLoS One. 2020 Sep 11;15(9):e0238908. doi: 10.1371/journal.pone.0238908. eCollection 2020. PLoS One. 2020. PMID: 32915901 Free PMC article.
-
A practical guide to the implementation of AI in orthopaedic research - part 1: opportunities in clinical application and overcoming existing challenges.J Exp Orthop. 2023 Nov 16;10(1):117. doi: 10.1186/s40634-023-00683-z. J Exp Orthop. 2023. PMID: 37968370 Free PMC article. Review.
-
Self-supervised pre-training with contrastive and masked autoencoder methods for dealing with small datasets in deep learning for medical imaging.Sci Rep. 2023 Nov 20;13(1):20260. doi: 10.1038/s41598-023-46433-0. Sci Rep. 2023. PMID: 37985685 Free PMC article.
-
Understanding the Research Landscape of Deep Learning in Biomedical Science: Scientometric Analysis.J Med Internet Res. 2022 Apr 22;24(4):e28114. doi: 10.2196/28114. J Med Internet Res. 2022. PMID: 35451980 Free PMC article.
-
LigaNET: A multi-modal deep learning approach to predict the risk of subsequent anterior cruciate ligament injury after surgery.medRxiv [Preprint]. 2023 Jul 27:2023.07.25.23293102. doi: 10.1101/2023.07.25.23293102. medRxiv. 2023. PMID: 37546855 Free PMC article. Preprint.
References
-
- Nacey NC, Geeslin MG, Miller GW, Pierce JL. Magnetic resonance imaging of the knee: an overview and update of conventional and state of the art imaging. J Magn Reson Imaging. 2017;45:1257–75. 10.1002/jmri.25620 - DOI - PubMed
-
- Naraghi AM, White LM. Imaging of athletic injuries of knee ligaments and menisci: sports imaging series. Radiology. 2016;281:23–40. 10.1148/radiol.2016152320 - DOI - PubMed
-
- Helms CA. Magnetic resonance imaging of the knee In: Brant WE, Helms CA, editors. Fundamentals of diagnostic radiology. Philadelphia: Lippincott Williams & Wilkins; 2007. pp. 1193–204.
-
- Oei EH, Nikken JJ, Verstijnen AC, Ginai AZ, Myriam Hunink MG. MR Imaging of the menisci and cruciate ligaments: a systematic review. Radiology. 2003;226:837–48. 10.1148/radiol.2263011892 - DOI - PubMed
-
- Rangger C, Klestil T, Kathrein A, Inderster A, Hamid L. Influence of magnetic resonance imaging on indications for arthroscopy of the knee. Clin Orthop Relat Res. 1996;330:133–42. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases