Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jul;36(5):419-29.
doi: 10.1002/gepi.21637. Epub 2012 Apr 30.

Is it rare or common?

Affiliations

Is it rare or common?

Kaustubh Adhikari et al. Genet Epidemiol. 2012 Jul.

Abstract

Many genome-wide association studies (GWAS) have signals with unknown etiology. This paper addresses the question-is such an association signal caused by rare or common variants that lead to increased disease risk? For a genomic region implicated by a GWAS, we use single nucleotide polymorphism (SNP) data in a case-control setting to predict how many common or rare variants there are, using a Bayesian analysis. Our objective is to compute posterior probabilities for configurations of rare and/or common variants. We use an extension of coalescent trees--the ancestral recombination graphs--to model the genealogical history of the samples based on marker data. As we expect SNPs to be in linkage disequilibrium with common disease variants, we can expect the trees to reflect the type of variants. To demonstrate the application, we apply our method to candidate gene sequencing data from a German case-control study on nonsyndromic cleft lip with or without cleft palate.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Two scenarios where a complex disease is caused by a common or two rare variants
Figure 2
Figure 2. Flowchart for the algorithm
Figure 3
Figure 3. A simulated dendrogram for the dataset with cases and controls shown corresponding to the leaves, as red and black dots respectively
Note that some subjects have identical genotypes and therefore are grouped together.
Figure 4
Figure 4. Smoothed posterior for real dataset
The X and Y axes denote counts of rare and common variants, and the Z axis shows the (un-normalized) posterior density. The density is smoothed with a Gaussian kernel.
Figure 5
Figure 5. Posterior for simulated dataset with rare variants
Figure 6
Figure 6. Posterior for simulated dataset with common variants
Figure 7
Figure 7. Posterior for simulated dataset with no variants
Figure 8
Figure 8. Some possible DSL configurations

References

    1. Asimit J, Zeggini E. Rare variant association analysis methods for complex traits. Annual Review of Genetics. 2010;44(1):293–308. - PubMed
    1. Balding DJ, Bishop M, Cannings C. Handbook of Statistical Genetics. 3 edition. Wiley-Interscience; 2007.
    1. Bodmer W, Bonilla C. Common and rare variants in multifactorial susceptibility to common diseases. Nature Genetics. 2008;40(6):695–701. - PMC - PubMed
    1. Chen GK, Marjoram P, Wall JD. Fast and flexible simulation of dna sequence data. Genome Research. 2009;19(1):136–142. - PMC - PubMed
    1. Crawford DC, Bhangale T, Li N, Hellenthal G, Rieder MJ, Nickerson DA, Stephens M. Evidence for substantial fine-scale variation in recombination rates across the human genome. Nature Genetics. 2004;36(7):700–706. - PubMed

Publication types

Substances