Comparison of artificial neural network analysis with other multimarker methods for detecting genetic association
- PMID: 17640352
- PMCID: PMC1940019
- DOI: 10.1186/1471-2156-8-49
Comparison of artificial neural network analysis with other multimarker methods for detecting genetic association
Abstract
Background: Debate remains as to the optimal method for utilising genotype data obtained from multiple markers in case-control association studies. I and colleagues have previously described a method of association analysis using artificial neural networks (ANNs), whose performance compared favourably to single-marker methods. Here, the performance of ANN analysis is compared with other multi-marker methods, comprising different haplotype-based analyses and locus-based analyses.
Results: Of several methods studied and applied to simulated SNP datasets, heterogeneity testing of estimated haplotype frequencies using asymptotic p values rather than permutation testing had the lowest power of the methods studied and ANN analysis had the highest power. The difference in power to detect association between these two methods was statistically significant (p = 0.001) but other comparisons between methods were not significant. The raw t statistic obtained from ANN analysis correlated highly with the empirical statistical significance obtained from permutation testing of the ANN results and with the p value obtained from the heterogeneity test.
Conclusion: Although ANN analysis was more powerful than the standard haplotype-based test it is unlikely to be taken up widely. The permutation testing necessary to obtain a valid p value makes it slow to perform and it is not underpinned by a theoretical model relating marker genotypes to disease phenotype. Nevertheless, the superior performance of this method does imply that the widely-used haplotype-based methods for detecting association with multiple markers are not optimal and efforts could be made to improve upon them. The fact that the t statistic obtained from ANN analysis is highly correlated with the statistical significance does suggest a possibility to use ANN analysis in situations where large numbers of markers have been genotyped, since the t value could be used as a proxy for the p value in preliminary analyses.
Similar articles
-
Multiple testing in the context of haplotype analysis revisited: application to case-control data.Ann Hum Genet. 2005 Nov;69(Pt 6):747-56. doi: 10.1111/j.1529-8817.2005.00198.x. Ann Hum Genet. 2005. PMID: 16266412
-
Efficiency and power in genetic association studies.Nat Genet. 2005 Nov;37(11):1217-23. doi: 10.1038/ng1669. Epub 2005 Oct 23. Nat Genet. 2005. PMID: 16244653
-
Detailed analysis of the relative power of direct and indirect association studies and the implications for their interpretation.Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27. Hum Hered. 2007. PMID: 17483598
-
[Algorithms of artificial neural networks--practical application in medical science].Pol Merkur Lekarski. 2005 Dec;19(114):819-22. Pol Merkur Lekarski. 2005. PMID: 16521432 Review. Polish.
-
Method and computer program for controlling the family-wise alpha rate in gene association studies involving multiple phenotypes.Genet Epidemiol. 1998;15(1):87-101. doi: 10.1002/(SICI)1098-2272(1998)15:1<87::AID-GEPI7>3.0.CO;2-1. Genet Epidemiol. 1998. PMID: 9523213 Review.
Cited by
-
A review for detecting gene-gene interactions using machine learning methods in genetic epidemiology.Biomed Res Int. 2013;2013:432375. doi: 10.1155/2013/432375. Epub 2013 Oct 21. Biomed Res Int. 2013. PMID: 24228248 Free PMC article.
-
Genomic prediction of genetic merit using LD-based haplotypes in the Nordic Holstein population.BMC Genomics. 2014 Dec 23;15(1):1171. doi: 10.1186/1471-2164-15-1171. BMC Genomics. 2014. PMID: 25539631 Free PMC article.
-
Investigation into the ability of SNP chipsets and microsatellites to detect association with a disease locus.Ann Hum Genet. 2008 Jul;72(Pt 4):547-56. doi: 10.1111/j.1469-1809.2008.00434.x. Epub 2008 Mar 18. Ann Hum Genet. 2008. PMID: 18355389 Free PMC article.
-
What makes a good prediction? Feature importance and beginning to open the black box of machine learning in genetics.Hum Genet. 2022 Sep;141(9):1515-1528. doi: 10.1007/s00439-021-02402-z. Epub 2021 Dec 4. Hum Genet. 2022. PMID: 34862561 Free PMC article. Review.
-
Genetic classification of populations using supervised learning.PLoS One. 2011 May 12;6(5):e14802. doi: 10.1371/journal.pone.0014802. PLoS One. 2011. PMID: 21589856 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources