Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Oct 15;22(1):178.
doi: 10.1186/s12880-022-00904-4.

Computer-aided diagnostic for classifying chest X-ray images using deep ensemble learning

Affiliations

Computer-aided diagnostic for classifying chest X-ray images using deep ensemble learning

Lara Visuña et al. BMC Med Imaging. .

Abstract

Background: Nowadays doctors and radiologists are overwhelmed with a huge amount of work. This led to the effort to design different Computer-Aided Diagnosis systems (CAD system), with the aim of accomplishing a faster and more accurate diagnosis. The current development of deep learning is a big opportunity for the development of new CADs. In this paper, we propose a novel architecture for a convolutional neural network (CNN) ensemble for classifying chest X-ray (CRX) images into four classes: viral Pneumonia, Tuberculosis, COVID-19, and Healthy. Although Computed tomography (CT) is the best way to detect and diagnoses pulmonary issues, CT is more expensive than CRX. Furthermore, CRX is commonly the first step in the diagnosis, so it's very important to be accurate in the early stages of diagnosis and treatment.

Results: We applied the transfer learning technique and data augmentation to all CNNs for obtaining better performance. We have designed and evaluated two different CNN-ensembles: Stacking and Voting. This system is ready to be applied in a CAD system to automated diagnosis such a second or previous opinion before the doctors or radiology's. Our results show a great improvement, 99% accuracy of the Stacking Ensemble and 98% of accuracy for the the Voting Ensemble.

Conclusions: To minimize missclassifications, we included six different base CNN models in our architecture (VGG16, VGG19, InceptionV3, ResNet101V2, DenseNet121 and CheXnet) and it could be extended to any number as well as we expect extend the number of diseases to detected. The proposed method has been validated using a large dataset created by mixing several public datasets with different image sizes and quality. As we demonstrate in the evaluation carried out, we reach better results and generalization compared with previous works. In addition, we make a first approach to explainable deep learning with the objective of providing professionals more information that may be valuable when evaluating CRXs.

Keywords: CNN; COVID-19 classification; Deep ensemble learning; Grad-CAM; Stacking; Voting.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Proposed architecture of computer-aided diagnosis system based on CNN-ensemble
Fig. 2
Fig. 2
CRX examples from the dataset
Fig. 3
Fig. 3
Stacking Ensemble architecture
Fig. 4
Fig. 4
Receiver operating characteristic (ROC) and precision-recall curve (PR) for the CNN (micro-averaging)
Fig. 5
Fig. 5
Grad-CAM heatmaps for pneumonia CRX images
Fig. 6
Fig. 6
Grad-CAM heatmaps for tuberculosis CRX images with radiological findings
Fig. 7
Fig. 7
Validation confusion matrix (classes: 0-COVID-19, 1-Healthy, 2-Pneumonia, and 3-Tuberculosis)
Fig. 8
Fig. 8
Training/validation loss and accuracy graph for the stacking ensemble
Fig. 9
Fig. 9
Test confusion matrix (classes: 0-COVID-19, 1-Healthy, 2-Pneumonia, and 3-Tuberculosis)
Fig. 10
Fig. 10
TSNE visualization for train and validation set. a Originals radiography images, b intermediate layer VGG19 (block3—pool), and c last layer VGG19 (dense layer). Classes: Blue-COVID-19, Green-Healthy, Purple-Pneumonia, and Orange-Tuberculosis
Fig. 11
Fig. 11
Receiver operating characteristic (ROC) for stacking ensemble (a) and voting ensemble (b) (classes: 0-COVID-19, 1-Healthy, 2-Pneumonia, and 3-Tuberculosis)
Fig. 12
Fig. 12
Precision-recall curve for stacking ensemble (a) and voting ensemble (b) (classes: 0-COVID-19, 1-Healthy, 2-Pneumonia, and 3-Tuberculosis)

Similar articles

Cited by

References

    1. World Health Organization. Weekly epidemiological update on COVID-19—29 June 2021. Edition 46 (2021). https://www.who.int/publications/m/item/weekly-epidemiological-update-on....
    1. World Health Organization. Weekly epidemiological update on COVID-19—4 May 2022. (2022). https://www.who.int/publications/m/item/weekly-epidemiological-update-on....
    1. Turkoglu M. COVIDetectioNet: COVID-19 diagnosis system based on X-ray images using features selected from pre-learned deep features ensemble. Appl Intell. 2021;51(3):1213–1226. doi: 10.1007/s10489-020-01888-w. - DOI - PMC - PubMed
    1. Niyaz U, Sambyal AS, et al. Advances in deep learning techniques for medical image analysis. In: 2018 fifth international conference on parallel, distributed and grid computing (PDGC). IEEE; 2018. p. 271–77.
    1. Ismael AM, Şengür A. Deep learning approaches for COVID-19 detection based on chest X-ray images. Expert Syst Appl. 2021;164:114054. doi: 10.1016/j.eswa.2020.114054. - DOI - PMC - PubMed

Publication types