A Comparison between Decision Tree and Random Forest in Determining the Risk Factors Associated with Type 2 Diabetes
- PMID: 29784893
A Comparison between Decision Tree and Random Forest in Determining the Risk Factors Associated with Type 2 Diabetes
Abstract
Background: We aimed to identify the associated risk factors of type 2 diabetes mellitus (T2DM) using data mining approach, decision tree and random forest techniques using the Mashhad Stroke and Heart Atherosclerotic Disorders (MASHAD) Study program.
Study design: A cross-sectional study.
Methods: The MASHAD study started in 2010 and will continue until 2020. Two data mining tools, namely decision trees, and random forests, are used for predicting T2DM when some other characteristics are observed on 9528 subjects recruited from MASHAD database. This paper makes a comparison between these two models in terms of accuracy, sensitivity, specificity and the area under ROC curve.
Results: The prevalence rate of T2DM was 14% among these subjects. The decision tree model has 64.9% accuracy, 64.5% sensitivity, 66.8% specificity, and area under the ROC curve measuring 68.6%, while the random forest model has 71.1% accuracy, 71.3% sensitivity, 69.9% specificity, and area under the ROC curve measuring 77.3% respectively.
Conclusions: The random forest model, when used with demographic, clinical, and anthropometric and biochemical measurements, can provide a simple tool to identify associated risk factors for type 2 diabetes. Such identification can substantially use for managing the health policy to reduce the number of subjects with T2DM .
Keywords: Decision tree; Diabetes mellitus; Iran; Random forest; data mining.
Similar articles
-
Type 2 Diabetes Mellitus Screening and Risk Factors Using Decision Tree: Results of Data Mining.Glob J Health Sci. 2015 Mar 18;7(5):304-10. doi: 10.5539/gjhs.v7n5p304. Glob J Health Sci. 2015. PMID: 26156928 Free PMC article.
-
Applying decision tree for identification of a low risk population for type 2 diabetes. Tehran Lipid and Glucose Study.Diabetes Res Clin Pract. 2014 Sep;105(3):391-8. doi: 10.1016/j.diabres.2014.07.003. Epub 2014 Jul 18. Diabetes Res Clin Pract. 2014. PMID: 25085758
-
Using Random Forest Models to Identify Correlates of a Diabetic Peripheral Neuropathy Diagnosis from Electronic Health Record Data.Pain Med. 2017 Jan 1;18(1):107-115. doi: 10.1093/pm/pnw096. Pain Med. 2017. PMID: 27252307
-
Comparing Three Data Mining Algorithms for Identifying the Associated Risk Factors of Type 2 Diabetes.Iran Biomed J. 2018 Sep;22(5):303-11. doi: 10.29252/ibj.22.5.303. Epub 2018 Jan 27. Iran Biomed J. 2018. PMID: 29374085 Free PMC article.
-
The application of a decision tree to establish the parameters associated with hypertension.Comput Methods Programs Biomed. 2017 Feb;139:83-91. doi: 10.1016/j.cmpb.2016.10.020. Epub 2016 Oct 24. Comput Methods Programs Biomed. 2017. PMID: 28187897
Cited by
-
Identification of Potential Key Genes for the Comorbidity of Myasthenia Gravis With Thymoma by Integrated Bioinformatics Analysis and Machine Learning.Bioinform Biol Insights. 2024 Sep 26;18:11779322241281652. doi: 10.1177/11779322241281652. eCollection 2024. Bioinform Biol Insights. 2024. PMID: 39345724 Free PMC article.
-
Machine learning and deep learning techniques to support clinical diagnosis of arboviral diseases: A systematic review.PLoS Negl Trop Dis. 2022 Jan 13;16(1):e0010061. doi: 10.1371/journal.pntd.0010061. eCollection 2022 Jan. PLoS Negl Trop Dis. 2022. PMID: 35025860 Free PMC article.
-
Association between Multiple Recurrent Events with Multivariate Modeling: A Retrospective Cohort Study.J Res Health Sci. 2018 Dec 29;18(4):e00433. J Res Health Sci. 2018. PMID: 30728319 Free PMC article.
-
Environmental exposures in machine learning and data mining approaches to diabetes etiology: A scoping review.Artif Intell Med. 2023 Jan;135:102461. doi: 10.1016/j.artmed.2022.102461. Epub 2022 Nov 30. Artif Intell Med. 2023. PMID: 36628796 Free PMC article.
-
Application of machine learning to predict aneuploidy and mosaicism in embryos from in vitro fertilization cycles.AJOG Glob Rep. 2022 Sep 19;2(4):100103. doi: 10.1016/j.xagr.2022.100103. eCollection 2022 Nov. AJOG Glob Rep. 2022. PMID: 36275401 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical