Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Dec 12:11:1331517.
doi: 10.3389/fpubh.2023.1331517. eCollection 2023.

An enhanced diabetes prediction amidst COVID-19 using ensemble models

Affiliations

An enhanced diabetes prediction amidst COVID-19 using ensemble models

Deepak Thakur et al. Front Public Health. .

Abstract

In the contemporary landscape of healthcare, the early and accurate prediction of diabetes has garnered paramount importance, especially in the wake of the COVID-19 pandemic where individuals with diabetes exhibit increased vulnerability. This research embarked on a mission to enhance diabetes prediction by employing state-of-the-art machine learning techniques. Initial evaluations highlighted the Support Vector Machines (SVM) classifier as a promising candidate with an accuracy of 76.62%. To further optimize predictions, the study delved into advanced feature engineering techniques, generating interaction and polynomial features that unearthed hidden patterns in the data. Subsequent correlation analyses, visualized through heatmaps, revealed significant correlations, especially with attributes like Glucose. By integrating the strengths of Decision Trees, Gradient Boosting, and SVM in an ensemble model, we achieved an accuracy of 93.2%, showcasing the potential of harmonizing diverse algorithms. This research offers a robust blueprint for diabetes prediction, holding profound implications for early diagnosis, personalized treatments, and preventive care in the context of global health challenges and with the goal of increasing life expectancy.

Keywords: COVID-19; classification; correlation analysis; diabetes; ensemble models; feature engineering; interaction; polynomial.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Figures

Figure 1
Figure 1
The word cloud.
Figure 2
Figure 2
Various features original and median imputed distribution.
Figure 3
Figure 3
Box plot for all the features.
Figure 4
Figure 4
Corrected box plots for all the features.
Figure 5
Figure 5
Cumulative and individual explained variance by different principal components, 2D PCA of diabetes data, ROC curves and PR curves of four different classifiers—Logistic Regression, Random Forest, Gradient Boosting, and Support Vector Machines.
Figure 6
Figure 6
Confusion matrix for the classifiers used in this work.
Figure 7
Figure 7
Correlation matrix.
Figure 8
Figure 8
Visual representation of the model's performance across various metrics, trade-off between precision and recall at various thresholds and receiver operating characteristic (ROC) curve provides insights into the model's ability to discriminate between the classes.

References

    1. Adamidi ES, Mitsis K, Nikita KS. Artificial intelligence in clinical care amidst COVID-19 pandemic: a systematic review. Comput Struct Biotechnol J. (2021) 19:2833–50. 10.1016/j.csbj.2021.05.010 - DOI - PMC - PubMed
    1. Krishnamoorthi R. A novel diabetes healthcare disease prediction framework using machine learning techniques. J Healthc Eng. (2022) 2022:9872970. 10.1155/2022/1684017 - DOI - PMC - PubMed
    1. Saleem K, Saleem M, Ahmad RZ, Javed AR, Alazab M, Gadekallu TR, et al. . Situation-aware BDI reasoning to detect early symptoms of covid 19 using smartwatch. IEEE Sens J. (2022) 23:898–905. 10.1109/JSEN.2022.3156819 - DOI - PMC - PubMed
    1. Jalil Z, Abbasi A, Javed AR, Khan MB, Hasanat MHA, Malik KM, et al. . COVID-19 related sentiment analysis using state-of-the-art machine learning and deep learning techniques. Front Public Health. (2022) 9:812735. 10.3389/fpubh.2021.812735 - DOI - PMC - PubMed
    1. Cahn A. Prediction of progression from pre-diabetes to diabetes: development and validation of a machine learning model. Diabetes Metab Res Rev. (2020) 36:3252. 10.1002/dmrr.3252 - DOI - PubMed

Publication types