Designing a GWAS: power, sample size, and data structure
- PMID: 23756887
- DOI: 10.1007/978-1-62703-447-0_3
Designing a GWAS: power, sample size, and data structure
Abstract
In this chapter we describe a novel Bayesian approach to designing GWAS studies with the goal of ensuring robust detection of effects of genomic loci associated with trait variation.The goal of GWAS is to detect loci associated with variation in traits of interest. Finding which of 500,000-1,000,000 loci has a practically significant effect is a difficult statistical problem, like finding a needle in a haystack. We address this problem by designing experiments to detect effects with a given Bayes factor, where the Bayes factor is chosen sufficiently large to overcome the low prior odds for genomic associations. Methods are given for various possible data structures including random population samples, case-control designs, transmission disequilibrium tests, sib-based transmission disequilibrium tests, and other family-based designs including designs for plants with clonal replication. We also consider the problem of eliciting prior information from experts, which is necessary to quantify prior odds for loci. We advocate a "subjective" Bayesian approach, where the prior distribution is considered as a mathematical representation of our prior knowledge, while also giving generic formulae that allow conservative computations based on low prior information, e.g., equivalent to the information in a single sample point. Examples using R and the R packages ldDesign are given throughout.
Similar articles
-
Statistical analysis of genomic data.Methods Mol Biol. 2013;1019:171-92. doi: 10.1007/978-1-62703-447-0_7. Methods Mol Biol. 2013. PMID: 23756891
-
Implementing a QTL detection study (GWAS) using genomic prediction methodology.Methods Mol Biol. 2013;1019:275-98. doi: 10.1007/978-1-62703-447-0_11. Methods Mol Biol. 2013. PMID: 23756895
-
Detailed analysis of the relative power of direct and indirect association studies and the implications for their interpretation.Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27. Hum Hered. 2007. PMID: 17483598
-
Overview of Statistical Methods for Genome-Wide Association Studies (GWAS).Methods Mol Biol. 2013;1019:149-69. doi: 10.1007/978-1-62703-447-0_6. Methods Mol Biol. 2013. PMID: 23756890 Review.
-
[A review of power and sample size estimation in genomewide association studies].J Prev Med Public Health. 2007 Mar;40(2):114-21. doi: 10.3961/jpmph.2007.40.2.114. J Prev Med Public Health. 2007. PMID: 17426422 Review. Korean.
Cited by
-
Genome-wide association study of footrot in Texel sheep.Genet Sel Evol. 2015 Apr 30;47(1):35. doi: 10.1186/s12711-015-0119-3. Genet Sel Evol. 2015. PMID: 25926335 Free PMC article.
-
AccuCalc: A Python Package for Accuracy Calculation in GWAS.Genes (Basel). 2023 Jan 1;14(1):123. doi: 10.3390/genes14010123. Genes (Basel). 2023. PMID: 36672864 Free PMC article.
-
A Parallel Population Genomic and Hydrodynamic Approach to Fishery Management of Highly-Dispersive Marine Invertebrates: The Case of the Fijian Black-Lip Pearl Oyster Pinctada margaritifera.PLoS One. 2016 Aug 25;11(8):e0161390. doi: 10.1371/journal.pone.0161390. eCollection 2016. PLoS One. 2016. PMID: 27559735 Free PMC article.
-
Optimizing Translational Research for Exceptional Health and Life Span: A Systematic Narrative of Studies to Identify Translatable Therapeutic Target(s) for Exceptional Health Span in Humans.J Gerontol A Biol Sci Med Sci. 2022 Nov 21;77(11):2272-2280. doi: 10.1093/gerona/glac065. J Gerontol A Biol Sci Med Sci. 2022. PMID: 35279027 Free PMC article.
-
Reviewing the essential roles of remote phenotyping, GWAS and explainable AI in practical marker-assisted selection for drought-tolerant winter wheat breeding.Front Plant Sci. 2024 Apr 18;15:1319938. doi: 10.3389/fpls.2024.1319938. eCollection 2024. Front Plant Sci. 2024. PMID: 38699541 Free PMC article. Review.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources