. 2022 Nov:150:106092.

doi: 10.1016/j.compbiomed.2022.106092. Epub 2022 Sep 28.

SVD-CLAHE boosting and balanced loss function for Covid-19 detection from an imbalanced Chest X-Ray dataset

Santanu Roy¹, Mrinal Tyagi², Vibhuti Bansal³, Vikas Jain⁴

Affiliations

¹ School of Engineering and Technology, Christ (Deemed to be University), Bangalore 560074, India. Electronic address: santanuroy35@gmail.com.
² Bharati Vidyapeeth's College of Engineering, New Delhi 110063, India. Electronic address: mrinaltyagi24@gmail.com.
³ Bharati Vidyapeeth's College of Engineering, New Delhi 110063, India. Electronic address: bansal.vibhuti25@gmail.com.
⁴ School of Computer Science Engineering and Technology, Bennett University, Greater Noida, UP 201310, India. Electronic address: vikasjain.cse@gmail.com.

PMID: 36208598
PMCID: PMC9514969
DOI: 10.1016/j.compbiomed.2022.106092

SVD-CLAHE boosting and balanced loss function for Covid-19 detection from an imbalanced Chest X-Ray dataset

Santanu Roy et al. Comput Biol Med. 2022 Nov.

. 2022 Nov:150:106092.

doi: 10.1016/j.compbiomed.2022.106092. Epub 2022 Sep 28.

Authors

Santanu Roy¹, Mrinal Tyagi², Vibhuti Bansal³, Vikas Jain⁴

Affiliations

¹ School of Engineering and Technology, Christ (Deemed to be University), Bangalore 560074, India. Electronic address: santanuroy35@gmail.com.
² Bharati Vidyapeeth's College of Engineering, New Delhi 110063, India. Electronic address: mrinaltyagi24@gmail.com.
³ Bharati Vidyapeeth's College of Engineering, New Delhi 110063, India. Electronic address: bansal.vibhuti25@gmail.com.
⁴ School of Computer Science Engineering and Technology, Bennett University, Greater Noida, UP 201310, India. Electronic address: vikasjain.cse@gmail.com.

PMID: 36208598
PMCID: PMC9514969
DOI: 10.1016/j.compbiomed.2022.106092

Abstract

Covid-19 disease has had a disastrous effect on the health of the global population, for the last two years. Automatic early detection of Covid-19 disease from Chest X-Ray (CXR) images is a very crucial step for human survival against Covid-19. In this paper, we propose a novel data-augmentation technique, called SVD-CLAHE Boosting and a novel loss function Balanced Weighted Categorical Cross Entropy (BWCCE), in order to detect Covid 19 disease efficiently from a highly class-imbalanced Chest X-Ray image dataset. Our proposed SVD-CLAHE Boosting method is comprised of both oversampling and under-sampling methods. First, a novel Singular Value Decomposition (SVD) based contrast enhancement and Contrast Limited Adaptive Histogram Equalization (CLAHE) methods are employed for oversampling the data in minor classes. Simultaneously, a Random Under Sampling (RUS) method is incorporated in major classes, so that the number of images per class will be more balanced. Thereafter, Balanced Weighted Categorical Cross Entropy (BWCCE) loss function is proposed in order to further reduce small class imbalance after SVD-CLAHE Boosting. Experimental results reveal that ResNet-50 model on the augmented dataset (by SVD-CLAHE Boosting), along with BWCCE loss function, achieved 95% F1 score, 94% accuracy, 95% recall, 96% precision and 96% AUC, which is far better than the results by other conventional Convolutional Neural Network (CNN) models like InceptionV3, DenseNet-121, Xception etc. as well as other existing models like Covid-Lite and Covid-Net. Hence, our proposed framework outperforms other existing methods for Covid-19 detection. Furthermore, the same experiment is conducted on VGG-19 model in order to check the validity of our proposed framework. Both ResNet-50 and VGG-19 model are pre-trained on the ImageNet dataset. We publicly shared our proposed augmented dataset on Kaggle website (https://www.kaggle.com/tr1gg3rtrash/balanced-augmented-covid-cxr-dataset), so that any research community can widely utilize this dataset. Our code is available on GitHub website online (https://github.com/MrinalTyagi/SVD-CLAHE-and-BWCCE).

Keywords: Categorical Cross Entropy (CCE); Chest X-Ray (CXR) images; Class imbalance problem; Contrast Limited Adaptive Histogram Equalization (CLAHE); Covid-19 detection; Data augmentation; Singular Value Decomposition (SVD).

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

**Fig. 1**
Block Diagram of entire proposed model (SVD-CLAHE Boosting + ResNet-50 + BWCCE).

**Fig. 2**
Example of proposed Augmented Dataset (by SVD-CLAHE Boosting).

**Fig. 3**
The entire scheme of proposed SVD-CLAHE Boosting.

**Fig. 4**
**Visualization of entire proposed methodology**Fig. 4(a) presents the distribution of major and minor classes in the original dataset, based on a number of images. Figs. 4(b) to 4(d) present the changes in distribution after employing the proposed methodology. Fig. 4(e) indicates the cluster representation of major and minor classes for the original dataset. Fig. 4(f) to Fig. 4(h) indicate the changes in cluster representation in major and minor classes after employing RUS, SVD-CLAHE Over-sampling, and BWCCE loss function, respectively. All these diagrams of distribution and cluster representation are completely imaginary and have not been taken from any statistical plot of the dataset.

**Fig. 5**
Comparisons of performances of several experiments on ResNet-50 Model (a) training accuracy, (b) training F1 score, (c) training loss, (d) validation accuracy, (e) validation F1 score, (f) validation loss. The experiments are employed are already labeled in the diagram, those are ResNet-50 on original dataset, ResNet-50 on augmented dataset (by SVD-CLAHE boosting) with equal no of images per class, ResNet-50 on augmented dataset (by proposed SVD-CLAHE boosting), ResNet-50+ SVD-CLAHE Boosting +WCCE, proposed method (ResNet-50+ SVD-CLAHE Boosting +BWCCE).

**Fig. 6**
Comparisons of performances of several experiments on VGG-19 Model (a) training accuracy, (b) training F1 score, (c) training loss, (d) validation accuracy, (e) validation F1 score, (f) validation loss. The experiments are employed are already labeled in the diagram, those are VGG-19 on original dataset, VGG-19 on augmented dataset (by SVD-CLAHE boosting) with equal no of images per class, VGG-19 on augmented dataset (by proposed SVD-CLAHE boosting), VGG-19+ SVD-CLAHE Boosting +WCCE, proposed method (VGG-19+ SVD-CLAHE Boosting +BWCCE).

**Fig. 7**
(a) No of Epochs of Convergence for ResNet-50 with different loss functions on proposed augmented dataset, (b) Average time taken per epochs in sec, for ResNet-50 with different loss functions on proposed augmented dataset.

**Fig. 8**
Confusion matrix for different experiments on ResNet-50 model, (a) Confusion Matrix (CM1) for ResNet-50 on original dataset, (b) Confusion Matrix (CM2) for ResNet-50+SVD-CLAHE Boosting, (c) Confusion Matrix (CM3) for Proposed methodology (ResNet-50+SVD-CLAHE Boosting+ BWCCE).

See this image and copyright information in PMC

Cited by

Automated classification of chest X-rays: a deep learning approach with attention mechanisms.
Oltu B, Güney S, Yuksel SE, Dengiz B. Oltu B, et al. BMC Med Imaging. 2025 Mar 4;25(1):71. doi: 10.1186/s12880-025-01604-5. BMC Med Imaging. 2025. PMID: 40038588 Free PMC article.
Improving Image Quality of Chest Radiography with Artificial Intelligence-Supported Dual-Energy X-Ray Imaging System: An Observer Preference Study in Healthy Volunteers.
Yoon SH, Kim J, Kim J, Lee JH, Choi I, Shin CW, Park CM. Yoon SH, et al. J Clin Med. 2025 Mar 19;14(6):2091. doi: 10.3390/jcm14062091. J Clin Med. 2025. PMID: 40142899 Free PMC article.

References

1. of the International C.S.G., et al. The species severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 2020;5(4):536. - PMC - PubMed
1. Koh H.K., Geller A.C., VanderWeele T.J. Deaths from COVID-19. JAMA. 2021;325(2):133–134. - PubMed
1. Shirani F., Shayganfar A., Hajiahmadi S. COVID-19 pneumonia: a pictorial review of CT findings and differential diagnosis. Egypt. J. Radiol. Nucl. Med. 2021;52(1):1–8.
1. Peng X., Xu X., Li Y., Cheng L., Zhou X., Ren B. Transmission routes of 2019-nCoV and controls in dental practice. Int. J. Oral Sci. 2020;12(1):1–6. - PMC - PubMed
1. Smyrlaki I., Ekman M., Lentini A., Rufino de Sousa N., Papanicolaou N., Vondracek M., Aarum J., Safari H., Muradrasoli S., Rothfuchs A.G., et al. Massive and rapid COVID-19 testing is feasible by extraction-free SARS-CoV-2 RT-PCR. Nature Commun. 2020;11(1):1–12. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

SVD-CLAHE boosting and balanced loss function for Covid-19 detection from an imbalanced Chest X-Ray dataset

Affiliations

SVD-CLAHE boosting and balanced loss function for Covid-19 detection from an imbalanced Chest X-Ray dataset

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Medical