Finding type 2 diabetes causal single nucleotide polymorphism combinations and functional modules from genome-wide association data
- PMID: 23566118
- PMCID: PMC3618247
- DOI: 10.1186/1472-6947-13-S1-S3
Finding type 2 diabetes causal single nucleotide polymorphism combinations and functional modules from genome-wide association data
Abstract
Background: Due to the low statistical power of individual markers from a genome-wide association study (GWAS), detecting causal single nucleotide polymorphisms (SNPs) for complex diseases is a challenge. SNP combinations are suggested to compensate for the low statistical power of individual markers, but SNP combinations from GWAS generate high computational complexity.
Methods: We aim to detect type 2 diabetes (T2D) causal SNP combinations from a GWAS dataset with optimal filtration and to discover the biological meaning of the detected SNP combinations. Optimal filtration can enhance the statistical power of SNP combinations by comparing the error rates of SNP combinations from various Bonferroni thresholds and p-value range-based thresholds combined with linkage disequilibrium (LD) pruning. T2D causal SNP combinations are selected using random forests with variable selection from an optimal SNP dataset. T2D causal SNP combinations and genome-wide SNPs are mapped into functional modules using expanded gene set enrichment analysis (GSEA) considering pathway, transcription factor (TF)-target, miRNA-target, gene ontology, and protein complex functional modules. The prediction error rates are measured for SNP sets from functional module-based filtration that selects SNPs within functional modules from genome-wide SNPs based expanded GSEA.
Results: A T2D causal SNP combination containing 101 SNPs from the Wellcome Trust Case Control Consortium (WTCCC) GWAS dataset are selected using optimal filtration criteria, with an error rate of 10.25%. Matching 101 SNPs with known T2D genes and functional modules reveals the relationships between T2D and SNP combinations. The prediction error rates of SNP sets from functional module-based filtration record no significance compared to the prediction error rates of randomly selected SNP sets and T2D causal SNP combinations from optimal filtration.
Conclusions: We propose a detection method for complex disease causal SNP combinations from an optimal SNP dataset by using random forests with variable selection. Mapping the biological meanings of detected SNP combinations can help uncover complex disease mechanisms.
Figures
Similar articles
-
Performance of epistasis detection methods in semi-simulated GWAS.BMC Bioinformatics. 2018 Jun 18;19(1):231. doi: 10.1186/s12859-018-2229-8. BMC Bioinformatics. 2018. PMID: 29914375 Free PMC article.
-
Shared genetic etiology underlying Alzheimer's disease and type 2 diabetes.Mol Aspects Med. 2015 Jun-Oct;43-44:66-76. doi: 10.1016/j.mam.2015.06.006. Epub 2015 Jun 23. Mol Aspects Med. 2015. PMID: 26116273 Free PMC article. Review.
-
Computational analyses of type 2 diabetes-associated loci identified by genome-wide association studies.J Diabetes. 2017 Apr;9(4):362-377. doi: 10.1111/1753-0407.12421. Epub 2016 Jul 27. J Diabetes. 2017. PMID: 27121852
-
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21. BMC Genomics. 2015. PMID: 25708662 Free PMC article.
-
Reducing GWAS Complexity.Cell Cycle. 2016;15(1):22-4. doi: 10.1080/15384101.2015.1120928. Cell Cycle. 2016. PMID: 26771711 Free PMC article. Review.
Cited by
-
Detection and analysis of disease-associated single nucleotide polymorphism influencing post-translational modification.BMC Med Genomics. 2015;8 Suppl 2(Suppl 2):S7. doi: 10.1186/1755-8794-8-S2-S7. Epub 2015 May 29. BMC Med Genomics. 2015. PMID: 26043787 Free PMC article.
-
Epigenomes: the missing heritability in human cardiovascular disease?Proteomics Clin Appl. 2014 Aug;8(7-8):480-7. doi: 10.1002/prca.201400031. Proteomics Clin Appl. 2014. PMID: 24957631 Free PMC article. Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials
Miscellaneous