Comparison of Radiologists and Deep Learning for US Grading of Hepatic Steatosis

Pedro Vianna¹, Sara-Ivana Calce¹, Pamela Boustros¹, Cassandra Larocque-Rigney¹, Laurent Patry-Beaudoin¹, Yi Hui Luo¹, Emre Aslan¹, John Marinos¹, Talal M Alamri¹, Kim-Nhien Vu¹, Jessica Murphy-Lavallée¹, Jean-Sébastien Billiard¹, Emmanuel Montagnon¹, Hongliang Li¹, Samuel Kadoury¹, Bich N Nguyen¹, Shanel Gauthier¹, Benjamin Therien¹, Irina Rish¹, Eugene Belilovsky¹, Guy Wolf¹, Michaël Chassé¹, Guy Cloutier¹, An Tang¹

Affiliations

Affiliation

¹ From the Department of Imaging and Engineering (P.V., S.I.C., C.L.R., L.P.B., E.M., H.L., S.K., M.C., G.C., A.T.), Laboratory of Biorheology and Medical Ultrasonics (P.V., G.C.), and Clinical Laboratory of Image Processing (E.M., A.T.), Centre de Recherche du Centre Hospitalier de l'Université de Montréal (CRCHUM), Montréal, Canada; Institute of Biomedical Engineering (P.V., G.C.) and Department of Computer Science and Operations Research (S.G., I.R., G.W.), Université de Montréal, Montréal, Canada; Departments of Radiology (S.I.C., P.B., C.L.R., L.P.B., Y.H.L., E.A., J.M., T.M.A., K.N.V., J.M.L., J.S.B., A.T.) and Pathology (B.N.N.), Centre Hospitalier de l'Université de Montréal (CHUM), 1058 rue Saint-Denis, Montréal, QC, Canada H2X 3J4; Department of Computer Engineering, École Polytechnique de Montréal, Montréal, Canada (S.K.); Mila-Quebec Artificial Intelligence Institute, Montréal, Canada (S.G., B.T., I.R., E.B., G.W.); and Department of Computer Science and Software Engineering, Concordia University, Montréal, Canada (B.T., E.B.).

PMID: 37787678
DOI: 10.1148/radiol.230659

Comparison of Radiologists and Deep Learning for US Grading of Hepatic Steatosis

Pedro Vianna et al. Radiology. 2023 Oct.

. 2023 Oct;309(1):e230659.

doi: 10.1148/radiol.230659.

Authors

Affiliation

¹ From the Department of Imaging and Engineering (P.V., S.I.C., C.L.R., L.P.B., E.M., H.L., S.K., M.C., G.C., A.T.), Laboratory of Biorheology and Medical Ultrasonics (P.V., G.C.), and Clinical Laboratory of Image Processing (E.M., A.T.), Centre de Recherche du Centre Hospitalier de l'Université de Montréal (CRCHUM), Montréal, Canada; Institute of Biomedical Engineering (P.V., G.C.) and Department of Computer Science and Operations Research (S.G., I.R., G.W.), Université de Montréal, Montréal, Canada; Departments of Radiology (S.I.C., P.B., C.L.R., L.P.B., Y.H.L., E.A., J.M., T.M.A., K.N.V., J.M.L., J.S.B., A.T.) and Pathology (B.N.N.), Centre Hospitalier de l'Université de Montréal (CHUM), 1058 rue Saint-Denis, Montréal, QC, Canada H2X 3J4; Department of Computer Engineering, École Polytechnique de Montréal, Montréal, Canada (S.K.); Mila-Quebec Artificial Intelligence Institute, Montréal, Canada (S.G., B.T., I.R., E.B., G.W.); and Department of Computer Science and Software Engineering, Concordia University, Montréal, Canada (B.T., E.B.).

PMID: 37787678
DOI: 10.1148/radiol.230659

Abstract

Background Screening for nonalcoholic fatty liver disease (NAFLD) is suboptimal due to the subjective interpretation of US images. Purpose To evaluate the agreement and diagnostic performance of radiologists and a deep learning model in grading hepatic steatosis in NAFLD at US, with biopsy as the reference standard. Materials and Methods This retrospective study included patients with NAFLD and control patients without hepatic steatosis who underwent abdominal US and contemporaneous liver biopsy from September 2010 to October 2019. Six readers visually graded steatosis on US images twice, 2 weeks apart. Reader agreement was assessed with use of κ statistics. Three deep learning techniques applied to B-mode US images were used to classify dichotomized steatosis grades. Classification performance of human radiologists and the deep learning model for dichotomized steatosis grades (S0, S1, S2, and S3) was assessed with area under the receiver operating characteristic curve (AUC) on a separate test set. Results The study included 199 patients (mean age, 53 years ± 13 [SD]; 101 men). On the test set (n = 52), radiologists had fair interreader agreement (0.34 [95% CI: 0.31, 0.37]) for classifying steatosis grades S0 versus S1 or higher, while AUCs were between 0.49 and 0.84 for radiologists and 0.85 (95% CI: 0.83, 0.87) for the deep learning model. For S0 or S1 versus S2 or S3, radiologists had fair interreader agreement (0.30 [95% CI: 0.27, 0.33]), while AUCs were between 0.57 and 0.76 for radiologists and 0.73 (95% CI: 0.71, 0.75) for the deep learning model. For S2 or lower versus S3, radiologists had fair interreader agreement (0.37 [95% CI: 0.33, 0.40]), while AUCs were between 0.52 and 0.81 for radiologists and 0.67 (95% CI: 0.64, 0.69) for the deep learning model. Conclusion Deep learning approaches applied to B-mode US images provided comparable performance with human readers for detection and grading of hepatic steatosis. Published under a CC BY 4.0 license. Supplemental material is available for this article. See also the editorial by Tuthill in this issue.

PubMed Disclaimer

Comment in

Advancing AI-assisted US Screening for Fatty Liver.
Tuthill TA. Tuthill TA. Radiology. 2023 Oct;309(1):e232442. doi: 10.1148/radiol.232442. Radiology. 2023. PMID: 37787674 No abstract available.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Atypon
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Comparison of Radiologists and Deep Learning for US Grading of Hepatic Steatosis

Affiliation

Comparison of Radiologists and Deep Learning for US Grading of Hepatic Steatosis

Authors

Affiliation

Abstract

Comment in

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical