Gene-centric genomewide association study via entropy
- PMID: 18458106
- PMCID: PMC2390640
- DOI: 10.1534/genetics.107.082370
Gene-centric genomewide association study via entropy
Abstract
Genes are the functional units in most organisms. Compared to genetic variants located outside genes, genic variants are more likely to affect disease risk. The development of the human HapMap project provides an unprecedented opportunity for genetic association studies at the genomewide level for elucidating disease etiology. Currently, most association studies at the single-nucleotide polymorphism (SNP) or the haplotype level rely on the linkage information between SNP markers and disease variants, with which association findings are difficult to replicate. Moreover, variants in genes might not be sufficiently covered by currently available methods. In this article, we present a gene-centric approach via entropy statistics for a genomewide association study to identify disease genes. The new entropy-based approach considers genic variants within one gene simultaneously and is developed on the basis of a joint genotype distribution among genetic variants for an association test. A grouping algorithm based on a penalized entropy measure is proposed to reduce the dimension of the test statistic. Type I error rates and power of the entropy test are evaluated through extensive simulation studies. The results indicate that the entropy test has stable power under different disease models with a reasonable sample size. Compared to single SNP-based analysis, the gene-centric approach has greater power, especially when there is more than one disease variant in a gene. As the genomewide genic SNPs become available, our entropy-based gene-centric approach would provide a robust and computationally efficient way for gene-based genomewide association study.
Figures





Similar articles
-
The application of the entropy-based statistic for genomic association study of QTL.J Genet Genomics. 2008 Mar;35(3):183-8. doi: 10.1016/S1673-8527(08)60025-9. J Genet Genomics. 2008. PMID: 18355762
-
A gene-centric approach to genome-wide association studies.Nat Rev Genet. 2006 Nov;7(11):885-91. doi: 10.1038/nrg1962. Nat Rev Genet. 2006. PMID: 17047687 Review.
-
Generalized T2 test for genome association studies.Am J Hum Genet. 2002 May;70(5):1257-68. doi: 10.1086/340392. Epub 2002 Mar 29. Am J Hum Genet. 2002. PMID: 11923914 Free PMC article.
-
Genotype-based association analysis via entropy.J Hum Genet. 2012 Nov 26;57(11):734-7. doi: 10.1038/jhg.2012.102. Epub 2012 Aug 23. J Hum Genet. 2012. PMID: 22914671
-
Tag SNP selection for association studies.Genet Epidemiol. 2004 Dec;27(4):365-74. doi: 10.1002/gepi.20028. Genet Epidemiol. 2004. PMID: 15372618 Review.
Cited by
-
A new permutation strategy of pathway-based approach for genome-wide association study.BMC Bioinformatics. 2009 Dec 18;10:429. doi: 10.1186/1471-2105-10-429. BMC Bioinformatics. 2009. PMID: 20021635 Free PMC article.
-
Sparse group variable selection for gene-environment interactions in the longitudinal study.Genet Epidemiol. 2022 Jul;46(5-6):317-340. doi: 10.1002/gepi.22461. Epub 2022 Jun 29. Genet Epidemiol. 2022. PMID: 35766061 Free PMC article.
-
Integrative analysis of gene-environment interactions under a multi-response partially linear varying coefficient model.Stat Med. 2014 Dec 10;33(28):4988-98. doi: 10.1002/sim.6287. Epub 2014 Aug 21. Stat Med. 2014. PMID: 25146388 Free PMC article.
-
A modified generalized Fisher method for combining probabilities from dependent tests.Front Genet. 2014 Feb 20;5:32. doi: 10.3389/fgene.2014.00032. eCollection 2014. Front Genet. 2014. PMID: 24600471 Free PMC article.
-
A versatile gene-based test for genome-wide association studies.Am J Hum Genet. 2010 Jul 9;87(1):139-45. doi: 10.1016/j.ajhg.2010.06.009. Am J Hum Genet. 2010. PMID: 20598278 Free PMC article.
References
-
- Anteby, E. Y., C. Greenfield, S. Natanson-Yaron, D. Goldman-Wohl, Y. Hamani et al., 2004. Vascular endothelial growth factor, epidermal growth factor and fibroblast growth factor-4 and -10 stimulate trophoblast plasminogen activator system and metalloproteinase-9. Mol. Hum. Reprod. 10 229–235. - PubMed
-
- Benjamini, Y., and Y. Hochberg, 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 57 289–300.
-
- Conley, Y. P., A. Thalamuthu, J. Jacobsdottir, D. E. Weeks, T. Mah et al., 2005. Candidate gene analysis suggests a role for fatty acid biosynthesis and regulation of the complement system in the etiology of age-related maculopathy. Hum. Mol. Genet. 14 1991–2002. - PubMed
-
- Cover, T. M., and J. A. Thomas, 1991. Elements of Information Theory, pp. 12–15. Wiley, New York.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Research Materials