. 2024 Jul 29;14(15):1634.

doi: 10.3390/diagnostics14151634.

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets

Yufeng Zhang¹, Joseph Kohne², Emily Wittrup¹, Kayvan Najarian^{1

3

4

5}

Affiliations

¹ Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
² Department of Pediatrics, University of Michigan, Ann Arbor, MI 48103, USA.
³ Michigan Institute for Data Science (MIDAS), University of Michigan, Ann Arbor, MI 48109, USA.
⁴ Department of Emergency Medicine, University of Michigan, Ann Arbor, MI 48109, USA.
⁵ Max Harry Weil Institute for Critical Care Research and Innovation, University of Michigan, Ann Arbor, MI 48109, USA.

PMID: 39125510
PMCID: PMC11312211
DOI: 10.3390/diagnostics14151634

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets

Yufeng Zhang et al. Diagnostics (Basel). 2024.

. 2024 Jul 29;14(15):1634.

doi: 10.3390/diagnostics14151634.

Authors

Yufeng Zhang¹, Joseph Kohne², Emily Wittrup¹, Kayvan Najarian^{1

3

4

5}

Affiliations

¹ Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
² Department of Pediatrics, University of Michigan, Ann Arbor, MI 48103, USA.
³ Michigan Institute for Data Science (MIDAS), University of Michigan, Ann Arbor, MI 48109, USA.
⁴ Department of Emergency Medicine, University of Michigan, Ann Arbor, MI 48109, USA.
⁵ Max Harry Weil Institute for Critical Care Research and Innovation, University of Michigan, Ann Arbor, MI 48109, USA.

PMID: 39125510
PMCID: PMC11312211
DOI: 10.3390/diagnostics14151634

Abstract

Pediatric respiratory disease diagnosis and subsequent treatment require accurate and interpretable analysis. A chest X-ray is the most cost-effective and rapid method for identifying and monitoring various thoracic diseases in children. Recent developments in self-supervised and transfer learning have shown their potential in medical imaging, including chest X-ray areas. In this article, we propose a three-stage framework with knowledge transfer from adult chest X-rays to aid the diagnosis and interpretation of pediatric thorax diseases. We conducted comprehensive experiments with different pre-training and fine-tuning strategies to develop transformer or convolutional neural network models and then evaluate them qualitatively and quantitatively. The ViT-Base/16 model, fine-tuned with the CheXpert dataset, a large chest X-ray dataset, emerged as the most effective, achieving a mean AUC of 0.761 (95% CI: 0.759-0.763) across six disease categories and demonstrating a high sensitivity (average 0.639) and specificity (average 0.683), which are indicative of its strong discriminative ability. The baseline models, ViT-Small/16 and ViT-Base/16, when directly trained on the Pediatric CXR dataset, only achieved mean AUC scores of 0.646 (95% CI: 0.641-0.651) and 0.654 (95% CI: 0.648-0.660), respectively. Qualitatively, our model excels in localizing diseased regions, outperforming models pre-trained on ImageNet and other fine-tuning approaches, thus providing superior explanations. The source code is available online and the data can be obtained from PhysioNet.

Keywords: chest X-ray; medical image analysis; model interpretability; self-supervised learning; transfer learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interests.

Figures

**Figure 1**
**The overall workflow of PediCXR classification task**. It consists of three stages. (a) Pre-training stage: self-supervised learning is performed using MAE on natural images or adult CXRs. (b) Adult CXR fine-tuning stage: the trained encoder undergoes supervised learning with the adult CXR dataset. (c) Knowledge-transferring stage: the trained encoder is further linear-probed/fine-tuned on the PediCXR dataset for specific knowledge acquisition.

**Figure 2**
**Grad-CAM visualizations on four pediatric CXR samples**. The first column on the left, featuring (a–d) as four randomly drawn diseased samples, displays the original CXR, recognized as the ground truth, with the diseased areas highlighted in red boxes. The subsequent columns showcase saliency maps created with various initializations overlaying on the original X-ray images. The bright colors signify areas of relevance to the model’s predictions.

**Figure 3**
**t-SNE comparison of image representations from ViT-Base/16 models (DBI is presented along with the title)**: (a) supervised training with random initialization; (b) pre-trained on ImageNet using MAE; (c) pre-trained on adult CXR using; (d) pre-trained on adult CXR with MAE and subsequently fine-tuned using CheXpert data.

See this image and copyright information in PMC

Cited by

Self-supervised learning framework application for medical image analysis: a review and summary.
Zeng X, Abdullah N, Sumari P. Zeng X, et al. Biomed Eng Online. 2024 Oct 27;23(1):107. doi: 10.1186/s12938-024-01299-9. Biomed Eng Online. 2024. PMID: 39465395 Free PMC article. Review.
Gradual poisoning of a chest x-ray convolutional neural network with an adversarial attack and AI explainability methods.
Lee SB. Lee SB. Sci Rep. 2025 Jul 1;15(1):21779. doi: 10.1038/s41598-025-02294-3. Sci Rep. 2025. PMID: 40593872 Free PMC article.

References

1. Reyes M.A., Etinger V., Hronek C., Hall M., Davidson A., Mangione-Smith R., Kaiser S.V., Parikh K. Pediatric respiratory illnesses: An update on achievable benchmarks of care. Pediatrics. 2023;152:e2022058389. doi: 10.1542/peds.2022-058389. - DOI - PubMed
1. World Health Organization . Stakeholder Consultative Meeting on Prevention and Management of Childhood Pneumonia and Diarrhoea: Report, 12–14 October 2021. World Health Organization; Geneva, Switzerland: 2022.
1. Rahman T., Chowdhury M.E., Khandakar A., Islam K.R., Islam K.F., Mahbub Z.B., Kadir M.A., Kashem S. Transfer learning with deep convolutional neural network (CNN) for pneumonia detection using chest X-ray. Appl. Sci. 2020;10:3233. doi: 10.3390/app10093233. - DOI
1. Banerjee A., Sarkar A., Roy S., Singh P.K., Sarkar R. COVID-19 chest X-ray detection through blending ensemble of CNN snapshots. Biomed. Signal Process. Control. 2022;78:104000. doi: 10.1016/j.bspc.2022.104000. - DOI - PMC - PubMed
1. Chen S., Ren S., Wang G., Huang M., Xue C. Interpretable cnn-multilevel attention transformer for rapid recognition of pneumonia from chest X-ray images. IEEE J. Biomed. Health Inform. 2023;28:753–764. doi: 10.1109/JBHI.2023.3247949. - DOI - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets

Affiliations

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources