Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 13;5(6):e230085.
doi: 10.1148/ryai.230085. eCollection 2023 Nov.

Anonymizing Radiographs Using an Object Detection Deep Learning Algorithm

Affiliations

Anonymizing Radiographs Using an Object Detection Deep Learning Algorithm

Bardia Khosravi et al. Radiol Artif Intell. .

Abstract

Radiographic markers contain protected health information that must be removed before public release. This work presents a deep learning algorithm that localizes radiographic markers and selectively removes them to enable de-identified data sharing. The authors annotated 2000 hip and pelvic radiographs to train an object detection computer vision model. Data were split into training, validation, and test sets at the patient level. Extracted markers were then characterized using an image processing algorithm, and potentially useful markers (eg, "L" and "R") without identifying information were retained. The model achieved an area under the precision-recall curve of 0.96 on the internal test set. The de-identification accuracy was 100% (400 of 400), with a de-identification false-positive rate of 1% (eight of 632) and a retention accuracy of 93% (359 of 386) for laterality markers. The algorithm was further validated on an external dataset of chest radiographs, achieving a de-identification accuracy of 96% (221 of 231). After fine-tuning the model on 20 images from the external dataset to investigate the potential for improvement, a 99.6% (230 of 231, P = .04) de-identification accuracy and decreased false-positive rate of 5% (26 of 512) were achieved. These results demonstrate the effectiveness of a two-pass approach in image de-identification. Keywords: Conventional Radiography, Skeletal-Axial, Thorax, Experimental Investigations, Supervised Learning, Transfer Learning, Convolutional Neural Network (CNN) Supplemental material is available for this article. © RSNA, 2023 See also the commentary by Chang and Li in this issue.

Keywords: Conventional Radiography; Convolutional Neural Network (CNN); Experimental Investigations; Skeletal-Axial; Supervised Learning; Thorax; Transfer Learning.

PubMed Disclaimer

Conflict of interest statement

Disclosures of conflicts of interest: B.K. Member of the Radiology: Artificial Intelligence trainee editorial board. J.P.M. No relevant relationships. P.R. Member of RadioGraphics TEAM. M.J.T. No relevant relationships. A.N.L. Royalties from Globus to Mayo Orthopedics and to author; consulting fees from Globus, Orthopediatrics, Stryker, Depuy Synthes, Alexion, ZimVie, and Medtronic (all funds directed to Mayo orthopedic research); patent to Mayo on vertebral body tethering; participation on Medtronic Advisory Board and Steering Committee for Braive study; board member at POSNA and the Setting Scoliosis Straight Foundation; committee member at the Scoliosis Research Society; leadership or fiduciary role on the research council at the Pediatric Spine Study Group. B.J.E. Research chair for SIIM; stock/stock options in FlowSIGMA, VoiceIT, and Yunu; consultant to the editor for Radiology: Artificial Intelligence. C.C.W. No relevant relationships.

Figures

Overview of the proposed pipeline. * The patient identifiers
were in the same place, but they were taken out and replaced with fake
numbers, using the same font settings.
Figure 1:
Overview of the proposed pipeline. * The patient identifiers were in the same place, but they were taken out and replaced with fake numbers, using the same font settings.
(A) Marker detection on pelvic radiographs. Note that the marker areas
were obfuscated to ensure patient privacy. (B, C) Results of the localizer
model's performance on out-of-domain chest radiographs before (B) and
after (C) fine-tuning. The red boxes are zoomed-in versions of the marker areas
that were detected. As shown, the fine-tuned model draws tighter bounding boxes
with fewer nonmarker areas and also has more specificity, not mistaking
nonmarker densities, like electrocardiography leads, compared with the initial
model.
Figure 2:
(A) Marker detection on pelvic radiographs. Note that the marker areas were obfuscated to ensure patient privacy. (B, C) Results of the localizer model's performance on out-of-domain chest radiographs before (B) and after (C) fine-tuning. The red boxes are zoomed-in versions of the marker areas that were detected. As shown, the fine-tuned model draws tighter bounding boxes with fewer nonmarker areas and also has more specificity, not mistaking nonmarker densities, like electrocardiography leads, compared with the initial model.
Cases of model failure on the (A) internal and external datasets (B)
before and (C) after fine-tuning.
Figure 3:
Cases of model failure on the (A) internal and external datasets (B) before and (C) after fine-tuning.

References

    1. National Health Services Government Statistical Service (NHS-GSS) . Diagnostic Imaging Dataset 2021-22 Data . https://www.england.nhs.uk/statistics/statistical-work-areas/diagnostic-.... Published 2023. Accessed January 4, 2023.
    1. Barry K , Kumar S , Linke R , Dawes E . A clinical audit of anatomical side marker use in a paediatric medical imaging department . J Med Radiat Sci 2016. ; 63 ( 3 ): 148 – 154 . - PMC - PubMed
    1. Edwards A , Hollin I , Barry J , Kachnowski S . Barriers to cross--institutional health information exchange: a literature review . J Healthc Inf Manag 2010. ; 24 ( 3 ): 22 – 34 . - PubMed
    1. Institute of Medicine · Board on Health Care Services · Board on Health Sciences Policy · Committee on Health Research and the Privacy of Health Information: The HIPAA Privacy Rule . Beyond the HIPAA Privacy Rule: Enhancing Privacy, Improving Health Through Research . National Academies Press; ; 2009. . https://play.google.com/store/books/details?id=jUGcAgAAQBAJ. Accessed February 1, 2023. - PubMed
    1. Rutherford M , Mun SK , Levine B , et al. . A DICOM dataset for evaluation of medical image de-identification . Sci Data 2021. ; 8 ( 1 ): 183 . - PMC - PubMed

LinkOut - more resources