. 2021 Sep 7;16(9):e0256630.

doi: 10.1371/journal.pone.0256630. eCollection 2021.

Pneumonia detection in chest X-ray images using an ensemble of deep learning models

Rohit Kundu¹, Ritacheta Das², Zong Woo Geem³, Gi-Tae Han³, Ram Sarkar²

Affiliations

¹ Department of Electrical Engineering, Jadavpur University, Kolkata, India.
² Department of Computer Science & Engineering, Jadavpur University, Kolkata, India.
³ College of IT Convergence, Gachon University, Seongnam, South Korea.

PMID: 34492046
PMCID: PMC8423280
DOI: 10.1371/journal.pone.0256630

Pneumonia detection in chest X-ray images using an ensemble of deep learning models

Rohit Kundu et al. PLoS One. 2021.

. 2021 Sep 7;16(9):e0256630.

doi: 10.1371/journal.pone.0256630. eCollection 2021.

Authors

Rohit Kundu¹, Ritacheta Das², Zong Woo Geem³, Gi-Tae Han³, Ram Sarkar²

Affiliations

¹ Department of Electrical Engineering, Jadavpur University, Kolkata, India.
² Department of Computer Science & Engineering, Jadavpur University, Kolkata, India.
³ College of IT Convergence, Gachon University, Seongnam, South Korea.

PMID: 34492046
PMCID: PMC8423280
DOI: 10.1371/journal.pone.0256630

Abstract

Pneumonia is a respiratory infection caused by bacteria or viruses; it affects many individuals, especially in developing and underdeveloped nations, where high levels of pollution, unhygienic living conditions, and overcrowding are relatively common, together with inadequate medical infrastructure. Pneumonia causes pleural effusion, a condition in which fluids fill the lung, causing respiratory difficulty. Early diagnosis of pneumonia is crucial to ensure curative treatment and increase survival rates. Chest X-ray imaging is the most frequently used method for diagnosing pneumonia. However, the examination of chest X-rays is a challenging task and is prone to subjective variability. In this study, we developed a computer-aided diagnosis system for automatic pneumonia detection using chest X-ray images. We employed deep transfer learning to handle the scarcity of available data and designed an ensemble of three convolutional neural network models: GoogLeNet, ResNet-18, and DenseNet-121. A weighted average ensemble technique was adopted, wherein the weights assigned to the base learners were determined using a novel approach. The scores of four standard evaluation metrics, precision, recall, f1-score, and the area under the curve, are fused to form the weight vector, which in studies in the literature was frequently set experimentally, a method that is prone to error. The proposed approach was evaluated on two publicly available pneumonia X-ray datasets, provided by Kermany et al. and the Radiological Society of North America (RSNA), respectively, using a five-fold cross-validation scheme. The proposed method achieved accuracy rates of 98.81% and 86.85% and sensitivity rates of 98.80% and 87.02% on the Kermany and RSNA datasets, respectively. The results were superior to those of state-of-the-art methods and our method performed better than the widely used ensemble techniques. Statistical analyses on the datasets using McNemar's and ANOVA tests showed the robustness of the approach. The codes for the proposed work are available at https://github.com/Rohit-Kundu/Ensemble-Pneumonia-Detection.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Examples of two X-ray plates that display (a) a healthy lung and (b) a pneumonic lung.**
The red arrows in (b) indicate white infiltrates, a distinguishing feature of pneumonia. The images were taken from the Kermany dataset [4].

**Fig 2. Representation of the proposed pneumonia detection framework.**
*Pre* = Precision score, *Rec* = Recall score, F1 = F1-score, *AUC* = AUC score, and A⁽ⁱ⁾ = {*Pre*_i, *Rec*_i, F1_i, *AUC*_i}; w⁽ⁱ⁾ is the weight generated for the i^th base learner to compute the ensemble, $p_{j}^{(i)}$ is the probability score for the j^th sample by the i^th classifier, and *ens*_j is the fused probability score for the j^th sample; and the *argmax* function returns the position having the highest value in a 1D array, i.e., in this case it generates the predicted class of the sample.

**Fig 3. Inception modules in the GoogLeNet architecture.**
(a) The naive inception block that is replaced by (b) the dimension reduction inception block in the GoogLeNet architecture to improve computational efficiency.

**Fig 4. Architecture of the GoogLeNet model used in this study.**
The inception block is shown in Fig 3(b).

**Fig 5. Architecture of the ResNet-18 model used in this study.**

**Fig 6. Basic architecture of the DenseNet convolutional neural network model.**

**Fig 7. Confusion matrices obtained on the Kermany pneumonia chest X-ray dataset by the proposed method by 5-fold cross validation.**
a) Fold-1. (b) Fold-2. (c) Fold-3. (d) Fold-4. (e) Fold-5.

**Fig 8. Confusion matrices obtained on the Radiological Society of North America pneumonia challenge chest X-ray dataset by the proposed method by five-fold cross validation.**
a) Fold-1. (b) Fold-2. (c) Fold-3. (d) Fold-4. (e) Fold-5.

**Fig 9. Receiver operating characteristic curves obtained by the proposed ensemble method on the two pneumonia chest X-ray datasets used in this research.**
(a) Kermany dataset [4]. (b) RSNA challenge dataset [33].

Fig 10. Variation of accuracy rates on the Kermany dataset [4]) achieved by the three base learners, GoogLeNet, ResNet-18, and DenseNet-121 and their ensemble, according to the optimizers chosen for fine tuning.

**Fig 11. Variation in performance (accuracy rates) of the ensemble with respect to the number of fixed non-trainable layers in the base learners on the two datasets used in this study.**
(a) Kermany dataset [4]. (b) RSNA challenge dataset [33].

**Fig 12. Gradient-weighted class activation map (GradCAM) decision visualization of chest X-ray images when the three chosen base learners were used to form the ensemble.**
Different regions of the X-rays are the focus of the different models that capture complementary information. Case-1: (a)–(c) show a pneumonic lung X-ray analyzed using the three base learners; the confidence scores of the three base learners are GoogLeNet: 99.99%, ResNet-18: 75.21%, and DenseNet-121: 98.90% Case-2: (d)–(f) show a healthy lung X-ray analyzed using the three base learners; the confidence scores of the three base learners are GoogLeNet: 99.47%, ResNet-18: 97.61%, and DenseNet-121: 98.93%.

**Fig 13. Examples of samples from the Kermany dataset where two out of three base learners yielded incorrect predictions, but the ensemble yielded the correct prediction.**
Both images are of class “Normal”. **(a) Case-1**: GoogLeNet predicted “*Pneumonia*” with a confidence score of 53.1%, ResNet-18 predicted “*Pneumonia*” with a confidence score of 73.8%, and DenseNet-121 predicted “*Normal*” with a confidence score of 89.4%. The proposed ensemble framework predicted “*Normal*” (correct classification) with a confidence rate of 68.1 **(b) Case-2**: GoogLeNet predicted “*Normal*” with a confidence score of 98.6%, ResNet-18 predicted “*Pneumonia*” with a confidence score of 58.3%, and DenseNet-121 predicted “*Pneumonia*” with a confidence score of 69.3%. The proposed ensemble framework predicted “*Normal*” (correct classification) with a confidence rate of 66.3%.

**Fig 14. Examples of samples from the Kermany dataset [4] that were classified incorrectly by the proposed ensemble framework.**
Case-1: (a) shows an image originally belonging to class “Normal” but misclassified as “Pneumonia” by the framework. The GradCAM analysis images are shown in (c), (d), and (e) for GoogLeNet, ResNet-18, and DenseNet-121, respectively. Case-2: (b) shows an image of class “Pneumonia” predicted to belong to the “Normal” class by the framework. The GradCAM analysis images are shown in (f), (g), and (h)for GoogLeNet, ResNet-18, and DenseNet-121, respectively.

See this image and copyright information in PMC

Cited by

Detection of COVID-19 using deep learning on x-ray lung images.
Odeh A, Alomar A, Aljawarneh S. Odeh A, et al. PeerJ Comput Sci. 2022 Sep 7;8:e1082. doi: 10.7717/peerj-cs.1082. eCollection 2022. PeerJ Comput Sci. 2022. PMID: 36262134 Free PMC article.
In Vivo Prediction of Breast Muscle Weight in Broiler Chickens Using X-ray Images Based on Deep Learning and Machine Learning.
Zhu R, Li J, Yang J, Sun R, Yu K. Zhu R, et al. Animals (Basel). 2024 Feb 16;14(4):628. doi: 10.3390/ani14040628. Animals (Basel). 2024. PMID: 38396595 Free PMC article.
Chest X-Ray Images to Differentiate COVID-19 from Pneumonia with Artificial Intelligence Techniques.
Islam R, Tarique M. Islam R, et al. Int J Biomed Imaging. 2022 Dec 22;2022:5318447. doi: 10.1155/2022/5318447. eCollection 2022. Int J Biomed Imaging. 2022. PMID: 36588667 Free PMC article.
COVID-19 detection from lung CT-Scans using a fuzzy integral-based CNN ensemble.
Kundu R, Singh PK, Mirjalili S, Sarkar R. Kundu R, et al. Comput Biol Med. 2021 Nov;138:104895. doi: 10.1016/j.compbiomed.2021.104895. Epub 2021 Oct 1. Comput Biol Med. 2021. PMID: 34649147 Free PMC article.
Improving diagnosis accuracy with an intelligent image retrieval system for lung pathologies detection: a features extractor approach.
Souid A, Alsubaie N, Soufiene BO, Alqahtani MS, Abbas M, Jambi LK, Sakli H. Souid A, et al. Sci Rep. 2023 Oct 3;13(1):16619. doi: 10.1038/s41598-023-42366-w. Sci Rep. 2023. PMID: 37789095 Free PMC article.

See all "Cited by" articles

References

1. WHO Pneumonia. World Health Organization. (2019), https://www.who.int/news-room/fact-sheets/detail/pneumonia
1. Neuman M., Lee E., Bixby S., Diperna S., Hellinger J., Markowitz R., et al.. Variability in the interpretation of chest radiographs for the diagnosis of pneumonia in children. Journal Of Hospital Medicine. 7, 294–298 (2012) doi: 10.1002/jhm.955 - DOI - PubMed
1. Williams G., Macaskill P., Kerr M., Fitzgerald D., Isaacs D., Codarini M., et al.. Variability and accuracy in interpretation of consolidation on chest radiography for diagnosing pneumonia in children under 5 years of age. Pediatric Pulmonology. 48, 1195–1200 (2013) doi: 10.1002/ppul.22806 - DOI - PubMed
1. Kermany D., Zhang K. & Goldbaum M. Labeled Optical Coherence Tomography (OCT) and Chest X-Ray Images for Classification. (Mendeley,2018)
1. Lal S., Rehman S., Shah J., Meraj T., Rauf H., Damaševičius R., et al.. Adversarial Attack and Defence through Adversarial Training and Feature Fusion for Diabetic Retinopathy Recognition. Sensors. 21, 3922 (2021) doi: 10.3390/s21113922 - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- ClinicalTrials.gov
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Pneumonia detection in chest X-ray images using an ensemble of deep learning models

Affiliations

Pneumonia detection in chest X-ray images using an ensemble of deep learning models

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Medical