Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 29;14(15):1634.
doi: 10.3390/diagnostics14151634.

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets

Affiliations

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets

Yufeng Zhang et al. Diagnostics (Basel). .

Abstract

Pediatric respiratory disease diagnosis and subsequent treatment require accurate and interpretable analysis. A chest X-ray is the most cost-effective and rapid method for identifying and monitoring various thoracic diseases in children. Recent developments in self-supervised and transfer learning have shown their potential in medical imaging, including chest X-ray areas. In this article, we propose a three-stage framework with knowledge transfer from adult chest X-rays to aid the diagnosis and interpretation of pediatric thorax diseases. We conducted comprehensive experiments with different pre-training and fine-tuning strategies to develop transformer or convolutional neural network models and then evaluate them qualitatively and quantitatively. The ViT-Base/16 model, fine-tuned with the CheXpert dataset, a large chest X-ray dataset, emerged as the most effective, achieving a mean AUC of 0.761 (95% CI: 0.759-0.763) across six disease categories and demonstrating a high sensitivity (average 0.639) and specificity (average 0.683), which are indicative of its strong discriminative ability. The baseline models, ViT-Small/16 and ViT-Base/16, when directly trained on the Pediatric CXR dataset, only achieved mean AUC scores of 0.646 (95% CI: 0.641-0.651) and 0.654 (95% CI: 0.648-0.660), respectively. Qualitatively, our model excels in localizing diseased regions, outperforming models pre-trained on ImageNet and other fine-tuning approaches, thus providing superior explanations. The source code is available online and the data can be obtained from PhysioNet.

Keywords: chest X-ray; medical image analysis; model interpretability; self-supervised learning; transfer learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interests.

Figures

Figure 1
Figure 1
The overall workflow of PediCXR classification task. It consists of three stages. (a) Pre-training stage: self-supervised learning is performed using MAE on natural images or adult CXRs. (b) Adult CXR fine-tuning stage: the trained encoder undergoes supervised learning with the adult CXR dataset. (c) Knowledge-transferring stage: the trained encoder is further linear-probed/fine-tuned on the PediCXR dataset for specific knowledge acquisition.
Figure 2
Figure 2
Grad-CAM visualizations on four pediatric CXR samples. The first column on the left, featuring (ad) as four randomly drawn diseased samples, displays the original CXR, recognized as the ground truth, with the diseased areas highlighted in red boxes. The subsequent columns showcase saliency maps created with various initializations overlaying on the original X-ray images. The bright colors signify areas of relevance to the model’s predictions.
Figure 3
Figure 3
t-SNE comparison of image representations from ViT-Base/16 models (DBI is presented along with the title): (a) supervised training with random initialization; (b) pre-trained on ImageNet using MAE; (c) pre-trained on adult CXR using; (d) pre-trained on adult CXR with MAE and subsequently fine-tuned using CheXpert data.

Similar articles

Cited by

References

    1. Reyes M.A., Etinger V., Hronek C., Hall M., Davidson A., Mangione-Smith R., Kaiser S.V., Parikh K. Pediatric respiratory illnesses: An update on achievable benchmarks of care. Pediatrics. 2023;152:e2022058389. doi: 10.1542/peds.2022-058389. - DOI - PubMed
    1. World Health Organization . Stakeholder Consultative Meeting on Prevention and Management of Childhood Pneumonia and Diarrhoea: Report, 12–14 October 2021. World Health Organization; Geneva, Switzerland: 2022.
    1. Rahman T., Chowdhury M.E., Khandakar A., Islam K.R., Islam K.F., Mahbub Z.B., Kadir M.A., Kashem S. Transfer learning with deep convolutional neural network (CNN) for pneumonia detection using chest X-ray. Appl. Sci. 2020;10:3233. doi: 10.3390/app10093233. - DOI
    1. Banerjee A., Sarkar A., Roy S., Singh P.K., Sarkar R. COVID-19 chest X-ray detection through blending ensemble of CNN snapshots. Biomed. Signal Process. Control. 2022;78:104000. doi: 10.1016/j.bspc.2022.104000. - DOI - PMC - PubMed
    1. Chen S., Ren S., Wang G., Huang M., Xue C. Interpretable cnn-multilevel attention transformer for rapid recognition of pneumonia from chest X-ray images. IEEE J. Biomed. Health Inform. 2023;28:753–764. doi: 10.1109/JBHI.2023.3247949. - DOI - PubMed

LinkOut - more resources