Predicting Diabetes Mellitus With Machine Learning Techniques
- PMID: 30459809
- PMCID: PMC6232260
- DOI: 10.3389/fgene.2018.00515
Predicting Diabetes Mellitus With Machine Learning Techniques
Abstract
Diabetes mellitus is a chronic disease characterized by hyperglycemia. It may cause many complications. According to the growing morbidity in recent years, in 2040, the world's diabetic patients will reach 642 million, which means that one of the ten adults in the future is suffering from diabetes. There is no doubt that this alarming figure needs great attention. With the rapid development of machine learning, machine learning has been applied to many aspects of medical health. In this study, we used decision tree, random forest and neural network to predict diabetes mellitus. The dataset is the hospital physical examination data in Luzhou, China. It contains 14 attributes. In this study, five-fold cross validation was used to examine the models. In order to verity the universal applicability of the methods, we chose some methods that have the better performance to conduct independent test experiments. We randomly selected 68994 healthy people and diabetic patients' data, respectively as training set. Due to the data unbalance, we randomly extracted 5 times data. And the result is the average of these five experiments. In this study, we used principal component analysis (PCA) and minimum redundancy maximum relevance (mRMR) to reduce the dimensionality. The results showed that prediction with random forest could reach the highest accuracy (ACC = 0.8084) when all the attributes were used.
Keywords: decision tree; diabetes mellitus; feature ranking; machine learning; neural network; random forest.
Figures





Similar articles
-
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25. Artif Intell Med. 2019. PMID: 31521253
-
Accurate Diabetes Risk Stratification Using Machine Learning: Role of Missing Value and Outliers.J Med Syst. 2018 Apr 10;42(5):92. doi: 10.1007/s10916-018-0940-7. J Med Syst. 2018. PMID: 29637403 Free PMC article.
-
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26. Artif Intell Med. 2019. PMID: 31383477 Review.
-
Two Machine-learning Hybrid Models for Predicting Type 2 Diabetes Mellitus.J Med Signals Sens. 2025 Apr 19;15:11. doi: 10.4103/jmss.jmss_29_24. eCollection 2025. J Med Signals Sens. 2025. PMID: 40351779 Free PMC article.
-
Predicting the Onset of Diabetes with Machine Learning Methods.J Pers Med. 2023 Feb 24;13(3):406. doi: 10.3390/jpm13030406. J Pers Med. 2023. PMID: 36983587 Free PMC article. Review.
Cited by
-
Construction of a 3-year risk prediction model for developing diabetes in patients with pre-diabetes.Front Endocrinol (Lausanne). 2024 Jun 13;15:1410502. doi: 10.3389/fendo.2024.1410502. eCollection 2024. Front Endocrinol (Lausanne). 2024. PMID: 38938520 Free PMC article.
-
An enhanced diabetes prediction amidst COVID-19 using ensemble models.Front Public Health. 2023 Dec 12;11:1331517. doi: 10.3389/fpubh.2023.1331517. eCollection 2023. Front Public Health. 2023. PMID: 38155892 Free PMC article.
-
Genetic Risk Score Increased Discriminant Efficiency of Predictive Models for Type 2 Diabetes Mellitus Using Machine Learning: Cohort Study.Front Public Health. 2021 Feb 17;9:606711. doi: 10.3389/fpubh.2021.606711. eCollection 2021. Front Public Health. 2021. PMID: 33681127 Free PMC article.
-
Prognostic Modeling and Prevention of Diabetes Using Machine Learning Technique.Sci Rep. 2019 Sep 24;9(1):13805. doi: 10.1038/s41598-019-49563-6. Sci Rep. 2019. PMID: 31551457 Free PMC article.
-
Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study.Sci Rep. 2020 Mar 10;10(1):4406. doi: 10.1038/s41598-020-61123-x. Sci Rep. 2020. PMID: 32157171 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Other Literature Sources