Review

. 2018 Aug;19(8):491-504.

doi: 10.1038/s41576-018-0016-z.

From genome-wide associations to candidate causal variants by statistical fine-mapping

Daniel J Schaid¹, Wenan Chen², Nicholas B Larson³

Affiliations

¹ Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, MN, USA. schaid@mayo.edu.
² Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN, USA.
³ Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, MN, USA.

PMID: 29844615
PMCID: PMC6050137
DOI: 10.1038/s41576-018-0016-z

Review

From genome-wide associations to candidate causal variants by statistical fine-mapping

Daniel J Schaid et al. Nat Rev Genet. 2018 Aug.

. 2018 Aug;19(8):491-504.

doi: 10.1038/s41576-018-0016-z.

Authors

Daniel J Schaid¹, Wenan Chen², Nicholas B Larson³

Affiliations

¹ Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, MN, USA. schaid@mayo.edu.
² Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, TN, USA.
³ Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, MN, USA.

PMID: 29844615
PMCID: PMC6050137
DOI: 10.1038/s41576-018-0016-z

Abstract

Advancing from statistical associations of complex traits with genetic markers to understanding the functional genetic variants that influence traits is often a complex process. Fine-mapping can select and prioritize genetic variants for further study, yet the multitude of analytical strategies and study designs makes it challenging to choose an optimal approach. We review the strengths and weaknesses of different fine-mapping approaches, emphasizing the main factors that affect performance. Topics include interpreting results from genome-wide association studies (GWAS), the role of linkage disequilibrium, statistical fine-mapping approaches, trans-ethnic studies, genomic annotation and data integration, and other analysis and design issues.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: None

Figures

**Figure 1. Flow of typical process from intial GWAS to annotation of SNPs selected from fine-mapping analyses**
Based on GWAS p-values summarized in a Manhattan plot, a list of SNPs that achieve genome-wide statistical significance (i.e., p-value <5×10⁻⁸) is used to determine regions of interest for fine-mapping. Each region is typlically explored according to the structure of linkage disequilibrium among single-nucleotide polymorphisms (SNPs) using Halpoview plots. Statistical assocaitions are viewed with LocusZoom plots that illustrate the patterns of association of each SNP with the lead SNP, as well as annotation of genes in the region. The regions can then be partitioned into independent sub-regions to ease computational burden, based on statistical models that evalute the simultaneous effects of multiple SNPs on a trait. Statistical fine-mapping is conducted in each region, using one of the methods illusrated in Figure 2. The SNPs selected from fine-mapping are then annotated with genomic features to prioritize follow-up functional studies. Figure is adapted from REF.

**Figure 3. Power of conditional analysis**
This figure illustrates how conditional analyses have weaker power to detect secondary associated single-nucleotide polymorphisms (SNPs) compared to the power of an initial genome-wide association study (GWAS). Power of conditional analyses diminishes as the correlation of a primary SNP (indicated by SNP₁) and a secondary SNP (indicated by SNP₂) increases, and when the effect size of a secondary SNP is weaker than that for a primary SNP. For this figure, the power for an initial GWAS to detect a primary SNP₁ is 90% for an effect size of R² = 1% of explained trait variation. The effect size of a secondary SNP₂ is varied from 100% to 50% of the effect size of primary SNP₁.

**Figure 4. Posterior probability for a single causal SNP when 5–40 SNPs are in a region of interest**
The prior probability that a SNP is causal is assumed to be equal for all SNPs. Sample size (N) ranges from 500–20,000, and the percent of trait variation explained by the causal variant, R², is 1%. SNPs are assumed to be equally correlated with magnitude ρ. The horizontal dotted line is for equal prior probabilities for SNPs, and the posterior probability approaches this line when the data have little information to distinguish causal from non-causal SNPs.

See this image and copyright information in PMC

References

1. Hardy J, Singleton A. Genomewide association studies and human disease. The New England journal of medicine. 2009;360:1759–68. - PMC - PubMed
1. Consortium WTCC. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447:661–78. - PMC - PubMed
1. Yang J, et al. Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010;42:565–9. - PMC - PubMed
1. Willer CJ, et al. Discovery and refinement of loci associated with lipid levels. Nature genetics. 2013;45:1274–1283. - PMC - PubMed
1. Nikpay M, et al. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nature genetics. 2015;47:1121–1130. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

R01 GM065450/GM/NIGMS NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

From genome-wide associations to candidate causal variants by statistical fine-mapping

Affiliations

From genome-wide associations to candidate causal variants by statistical fine-mapping

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources