Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Jan 18;19(1):e0292100.
doi: 10.1371/journal.pone.0292100. eCollection 2024.

Hybrid feature selection and classification technique for early prediction and severity of diabetes type 2

Affiliations
Review

Hybrid feature selection and classification technique for early prediction and severity of diabetes type 2

Praveen Talari et al. PLoS One. .

Expression of concern in

Abstract

Diabetes prediction is an ongoing study topic in which medical specialists are attempting to forecast the condition with greater precision. Diabetes typically stays lethargic, and on the off chance that patients are determined to have another illness, like harm to the kidney vessels, issues with the retina of the eye, or a heart issue, it can cause metabolic problems and various complexities in the body. Various worldwide learning procedures, including casting a ballot, supporting, and sacking, have been applied in this review. The Engineered Minority Oversampling Procedure (Destroyed), along with the K-overlay cross-approval approach, was utilized to achieve class evening out and approve the discoveries. Pima Indian Diabetes (PID) dataset is accumulated from the UCI Machine Learning (UCI ML) store for this review, and this dataset was picked. A highlighted engineering technique was used to calculate the influence of lifestyle factors. A two-phase classification model has been developed to predict insulin resistance using the Sequential Minimal Optimisation (SMO) and SMOTE approaches together. The SMOTE technique is used to preprocess data in the model's first phase, while SMO classes are used in the second phase. All other categorization techniques were outperformed by bagging decision trees in terms of Misclassification Error rate, Accuracy, Specificity, Precision, Recall, F1 measures, and ROC curve. The model was created using a combined SMOTE and SMO strategy, which achieved 99.07% correction with 0.1 ms of runtime. The suggested system's result is to enhance the classifier's performance in spotting illness early.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Steps for traditional pre-processing of data.
Fig 2
Fig 2. Proposed architecture for diagnosing diabetes.
Fig 3
Fig 3. Description of parameters used in the dataset.
Fig 4
Fig 4. Correlation coefficient matrix.
Fig 5
Fig 5. Confusion matrix.
a. Training: BDT, b. 18.Testing- BDT, c. Training RF, d. Testing: RF, e. Training–ET, f. Testing–ET, g.Training–AB, h.Testing—AB.
Fig 6
Fig 6. Feature importance towards prediction of T2DM.

References

    1. Sneha N., & Gangil T. (2019). Analysis of diabetes mellitus for early prediction using optimal features selection. Journal of Big Data, 6(1), 1–19.
    1. Tigga N. P., & Garg S. (2020). Prediction of type 2 diabetes using machine learning classification methods. Procedia Computer Science, 167, 706–716.
    1. Maleki N., Zeinali Y., & Niaki S. T. A. (2021). A k-NN method for lung cancer prognosis with the use of a genetic algorithm for feature selection. Expert Systems with Applications, 164, 113981.
    1. Haq A. U., Li J. P., Khan J., Memon M. H., Nazir S., Ahmad S.,… & Ali A. (2020). Intelligent machine learning approach for effective recognition of diabetes in E-healthcare using clinical data. Sensors, 20(9), 2649. doi: 10.3390/s20092649 - DOI - PMC - PubMed
    1. Carter J. A., Long C. S., Smith B. P., Smith T. L., & Donati G. L. (2019). Combining elemental analysis of toenails and machine learning techniques as a non-invasive diagnostic tool for the robust classification of type-2 diabetes. Expert Systems with Applications, 115, 245–255.