Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov 3;13(1):21-30.
doi: 10.1007/s13534-022-00249-5. eCollection 2023 Feb.

Localization of lung abnormalities on chest X-rays using self-supervised equivariant attention

Affiliations

Localization of lung abnormalities on chest X-rays using self-supervised equivariant attention

Gavin D'Souza et al. Biomed Eng Lett. .

Abstract

Chest X-Ray (CXR) images provide most anatomical details and the abnormalities on a 2D plane. Therefore, a 2D view of the 3D anatomy is sometimes sufficient for the initial diagnosis. However, close to fourteen commonly occurring diseases are sometimes difficult to identify by visually inspecting the images. Therefore, there is a drift toward developing computer-aided assistive systems to help radiologists. This paper proposes a deep learning model for the classification and localization of chest diseases by using image-level annotations. The model consists of a modified Resnet50 backbone for extracting feature corpus from the images, a classifier, and a pixel correlation module (PCM). During PCM training, the network is a weight-shared siamese architecture where the first branch applies the affine transform to the image before feeding to the network, while the second applies the same transform to the network output. The method was evaluated on CXR from the clinical center in the ratio of 70:20 for training and testing. The model was developed and tested using the cloud computing platform Google Colaboratory (NVidia Tesla P100 GPU, 16 GB of RAM). A radiologist subjectively validated the results. Our model trained with the configurations mentioned in this paper outperformed benchmark results.

Supplementary information: The online version contains supplementary material available at 10.1007/s13534-022-00249-5.

Keywords: CAM; Pixel correlation module; ResNet50; Self-attention; Self-supervised equivariant attention; Siamese network; Weak supervision.

PubMed Disclaimer

Conflict of interest statement

Conflict of interestNone.

Figures

Fig. 1
Fig. 1
Lung diseases and their appearance on anterior to posterior CXR pose a great challenge to validate in a CAD system (images from [1]). Few examples, a Atelatasis, b cardiomegaly, c effusion, d Infiltration
Fig. 2
Fig. 2
a Multi-resolution ResNet backbone architecture that sequentially reintegrates spatial information from previous layer outputs using the Merge Block. b shows the internal structure of the Merge Block which uses upsampling and concatenation to merge feature maps.
Fig. 3
Fig. 3
Structure of PCM. H, W, Cin /C denote the height, width and channel length of feature maps and original cams
Fig. 4
Fig. 4
Siamese Network for SEAM during PCM training. PCM refines the pixel-wise CAM predictions by using context information from feature maps produced by the backbone feature extractor. Image augmentations by affine transformations create inconsistencies in the generated CAMs compared to the original CAMs. Equivariant regularization enables the use of these inconsistencies for PCM training. MSML loss provides an additional source of supervision
Fig. 5
Fig. 5
a Shows the training loss curves during training for all models. b Shows the validation loss curves during training for all models.
Fig. 6
Fig. 6
ROC Curve for the performance of MultiResolution-ResNet50
Fig. 7
Fig. 7
Sample localization heatmaps and bounding boxes generated by standard ResNet50 (left), MultiRes-ResNet50 (Middle), and MultiRes-ResNet50 CAMs refined using PCM (right). In each figure, green bounding boxes indicate ground truths, whereas red bounding boxes indicate predicted predictions by the corresponding model

References

    1. Wang X, Peng Y, Lu L, Lu Z, Bagheri M, Summers R. ChestX-Ray8: hospital-scale chest X-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. in 2017 IEEE Conference On Computer Vision And Pattern Recognition (CVPR). 2017; 10.1109/cvpr.2017.369
    1. Nijiati M, Zhang Z, Abulizi A, Miao H, Tuluhong A, Quan S, et al. Deep learning assistance for tuberculosis diagnosis with chest radiography in low-resource settings. J X-Ray Sci Technol. 2021;29(5):785–796. doi: 10.3233/xst-210894. - DOI - PubMed
    1. Rajpurkar P, Irvin J, Zhu K, Yang B, Mehta H, Duan T, Ding D, Bagul A, Langlotz C, Shpanskaya K, Lungren M, Ng A, CheXNet: radiologist-level pneumonia detection on chest X-rays with deep learning. 2017; https://arxiv.org/abs/1711.05225.
    1. Huang G, Liu Z, van der Maaten L, Weinberger K, Densely connected convolutional networks, 2016, https://arxiv.org/abs/1608.06993
    1. Yao L, Poblenz E, Dagunts D, Covington B, Bernard D, Lyman K. Learning to diagnose from scratch by exploiting dependencies among labels, 2017. https://arxiv.org/abs/1710.10501

LinkOut - more resources