SNP prioritization using a Bayesian probability of association
- PMID: 23280596
- PMCID: PMC3725584
- DOI: 10.1002/gepi.21704
SNP prioritization using a Bayesian probability of association
Abstract
Prioritization is the process whereby a set of possible candidate genes or SNPs is ranked so that the most promising can be taken forward into further studies. In a genome-wide association study, prioritization is usually based on the P-values alone, but researchers sometimes take account of external annotation information about the SNPs such as whether the SNP lies close to a good candidate gene. Using external information in this way is inherently subjective and is often not formalized, making the analysis difficult to reproduce. Building on previous work that has identified 14 important types of external information, we present an approximate Bayesian analysis that produces an estimate of the probability of association. The calculation combines four sources of information: the genome-wide data, SNP information derived from bioinformatics databases, empirical SNP weights, and the researchers' subjective prior opinions. The calculation is fast enough that it can be applied to millions of SNPS and although it does rely on subjective judgments, those judgments are made explicit so that the final SNP selection can be reproduced. We show that the resulting probability of association is intuitively more appealing than the P-value because it is easier to interpret and it makes allowance for the power of the study. We illustrate the use of the probability of association for SNP prioritization by applying it to a meta-analysis of kidney function genome-wide association studies and demonstrate that SNP selection performs better using the probability of association compared with P-values alone.
© 2012 WILEY PERIODICALS, INC.
Conflict of interest statement
None of the authors declares any conflict of interest.
Similar articles
-
A latent model for prioritization of SNPs for functional studies.PLoS One. 2011;6(6):e20764. doi: 10.1371/journal.pone.0020764. Epub 2011 Jun 8. PLoS One. 2011. PMID: 21687685 Free PMC article.
-
Incorporating Functional Genomic Information in Genetic Association Studies Using an Empirical Bayes Approach.Genet Epidemiol. 2016 Apr;40(3):176-87. doi: 10.1002/gepi.21956. Epub 2016 Feb 1. Genet Epidemiol. 2016. PMID: 26833494 Free PMC article.
-
Importance of different types of prior knowledge in selecting genome-wide findings for follow-up.Genet Epidemiol. 2013 Feb;37(2):205-13. doi: 10.1002/gepi.21705. Genet Epidemiol. 2013. PMID: 23307621 Free PMC article.
-
Bayesian statistical methods in genetic association studies: Empirical examination of statistically non-significant Genome Wide Association Study (GWAS) meta-analyses in cancers: A systematic review.Gene. 2019 Feb 15;685:170-178. doi: 10.1016/j.gene.2018.10.057. Epub 2018 Oct 26. Gene. 2019. PMID: 30416053
-
Bayesian statistical methods for genetic association studies.Nat Rev Genet. 2009 Oct;10(10):681-90. doi: 10.1038/nrg2615. Nat Rev Genet. 2009. PMID: 19763151 Review.
Cited by
-
iFunMed: Integrative functional mediation analysis of GWAS and eQTL studies.Genet Epidemiol. 2019 Oct;43(7):742-760. doi: 10.1002/gepi.22217. Epub 2019 Jul 22. Genet Epidemiol. 2019. PMID: 31328826 Free PMC article.
-
Biologically Enhanced Genome-Wide Association Study Provides Further Evidence for Candidate Loci and Discovers Novel Loci That Influence Risk of Anterior Cruciate Ligament Rupture in a Dog Model.Front Genet. 2021 Mar 5;12:593515. doi: 10.3389/fgene.2021.593515. eCollection 2021. Front Genet. 2021. PMID: 33763109 Free PMC article.
-
Inclusion of biological knowledge in a Bayesian shrinkage model for joint estimation of SNP effects.Genet Epidemiol. 2017 May;41(4):320-331. doi: 10.1002/gepi.22038. Epub 2017 Apr 10. Genet Epidemiol. 2017. PMID: 28393391 Free PMC article.
-
The Evolving Field of Genetic Epidemiology: From Familial Aggregation to Genomic Sequencing.Am J Epidemiol. 2019 Dec 31;188(12):2069-2077. doi: 10.1093/aje/kwz193. Am J Epidemiol. 2019. PMID: 31509181 Free PMC article. Review.
-
Exploring the underlying biology of intrinsic cardiorespiratory fitness through integrative analysis of genomic variants and muscle gene expression profiling.J Appl Physiol (1985). 2019 May 1;126(5):1292-1314. doi: 10.1152/japplphysiol.00035.2018. Epub 2019 Jan 3. J Appl Physiol (1985). 2019. PMID: 30605401 Free PMC article.
References
-
- Goodman SN. A comment on replication, p-values and evidence. Statistics in Medicine. 2007;11:875–879. - PubMed
-
- Gögele M, Minelli C, Thakkinstian A, Yurkiewich A, Pattaro C, Pramstaller P, Little J, Attia J, Thompson JR. Methods for meta-analysis of genome-wide association studies: critical assessment of empirical evidence. American Journal of Epidemiology. 2012;175:739–749. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources