Identification of type 2 diabetes-associated combination of SNPs using support vector machine
- PMID: 20416077
- PMCID: PMC2875201
- DOI: 10.1186/1471-2156-11-26
Identification of type 2 diabetes-associated combination of SNPs using support vector machine
Abstract
Background: Type 2 diabetes mellitus (T2D), a metabolic disorder characterized by insulin resistance and relative insulin deficiency, is a complex disease of major public health importance. Its incidence is rapidly increasing in the developed countries. Complex diseases are caused by interactions between multiple genes and environmental factors. Most association studies aim to identify individual susceptibility single markers using a simple disease model. Recent studies are trying to estimate the effects of multiple genes and multi-locus in genome-wide association. However, estimating the effects of association is very difficult. We aim to assess the rules for classifying diseased and normal subjects by evaluating potential gene-gene interactions in the same or distinct biological pathways.
Results: We analyzed the importance of gene-gene interactions in T2D susceptibility by investigating 408 single nucleotide polymorphisms (SNPs) in 87 genes involved in major T2D-related pathways in 462 T2D patients and 456 healthy controls from the Korean cohort studies. We evaluated the support vector machine (SVM) method to differentiate between cases and controls using SNP information in a 10-fold cross-validation test. We achieved a 65.3% prediction rate with a combination of 14 SNPs in 12 genes by using the radial basis function (RBF)-kernel SVM. Similarly, we investigated subpopulation data sets of men and women and identified different SNP combinations with the prediction rates of 70.9% and 70.6%, respectively. As the high-throughput technology for genome-wide SNPs improves, it is likely that a much higher prediction rate with biologically more interesting combination of SNPs can be acquired by using this method.
Conclusions: Support Vector Machine based feature selection method in this research found novel association between combinations of SNPs and T2D in a Korean population.
Figures


Similar articles
-
Finding type 2 diabetes causal single nucleotide polymorphism combinations and functional modules from genome-wide association data.BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S3. doi: 10.1186/1472-6947-13-S1-S3. Epub 2013 Apr 5. BMC Med Inform Decis Mak. 2013. PMID: 23566118 Free PMC article.
-
Shared genetic etiology underlying Alzheimer's disease and type 2 diabetes.Mol Aspects Med. 2015 Jun-Oct;43-44:66-76. doi: 10.1016/j.mam.2015.06.006. Epub 2015 Jun 23. Mol Aspects Med. 2015. PMID: 26116273 Free PMC article. Review.
-
Mapping of Diabetes Susceptibility Loci in a Domestic Cat Breed with an Unusually High Incidence of Diabetes Mellitus.Genes (Basel). 2020 Nov 19;11(11):1369. doi: 10.3390/genes11111369. Genes (Basel). 2020. PMID: 33228033 Free PMC article.
-
Linking Alzheimer's disease and type 2 diabetes: Novel shared susceptibility genes detected by cFDR approach.J Neurol Sci. 2017 Sep 15;380:262-272. doi: 10.1016/j.jns.2017.07.044. Epub 2017 Aug 1. J Neurol Sci. 2017. PMID: 28870582 Free PMC article.
-
Understanding Genetic Heterogeneity in Type 2 Diabetes by Delineating Physiological Phenotypes: SIRT1 and its Gene Network in Impaired Insulin Secretion.Rev Diabet Stud. 2016 Spring;13(1):17-34. doi: 10.1900/RDS.2016.13.17. Epub 2016 May 10. Rev Diabet Stud. 2016. PMID: 27563694 Free PMC article. Review.
Cited by
-
Eye-color and Type-2 diabetes phenotype prediction from genotype data using deep learning methods.BMC Bioinformatics. 2021 Apr 19;22(1):198. doi: 10.1186/s12859-021-04077-9. BMC Bioinformatics. 2021. PMID: 33874881 Free PMC article.
-
Statistical and Computational Methods for Genetic Diseases: An Overview.Comput Math Methods Med. 2015;2015:954598. doi: 10.1155/2015/954598. Epub 2015 May 28. Comput Math Methods Med. 2015. PMID: 26106440 Free PMC article. Review.
-
Novel insights through the integration of structural and functional genomics data with protein networks.J Struct Biol. 2012 Sep;179(3):320-6. doi: 10.1016/j.jsb.2012.02.001. Epub 2012 Feb 11. J Struct Biol. 2012. PMID: 22343087 Free PMC article.
-
Genome-wide association studies identified novel loci for non-high-density lipoprotein cholesterol and its postprandial lipemic response.Hum Genet. 2014 Jul;133(7):919-30. doi: 10.1007/s00439-014-1435-3. Epub 2014 Mar 7. Hum Genet. 2014. PMID: 24604477 Free PMC article.
-
A single nucleotide polymorphism panel for individual identification and ancestry assignment in Caucasians and four East and Southeast Asian populations using a machine learning classifier.Forensic Sci Med Pathol. 2019 Mar;15(1):67-74. doi: 10.1007/s12024-018-0071-y. Epub 2019 Jan 16. Forensic Sci Med Pathol. 2019. PMID: 30649693
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Molecular Biology Databases
Research Materials