Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Apr;190(4):1521-31.
doi: 10.1534/genetics.111.136937. Epub 2012 Jan 20.

Detecting rare variant associations by identity-by-descent mapping in case-control studies

Affiliations

Detecting rare variant associations by identity-by-descent mapping in case-control studies

Sharon R Browning et al. Genetics. 2012 Apr.

Abstract

Identity-by-descent (IBD) mapping tests whether cases share more segments of IBD around a putative causal variant than do controls. These segments of IBD can be accurately detected from genome-wide SNP data. We investigate the power of IBD mapping relative to that of SNP association testing for genome-wide case-control SNP data. Our focus is particularly on rare variants, as these tend to be more recent and hence more likely to have recent shared ancestry. We simulate data from both large and small populations and find that the relative performance of IBD mapping and SNP association testing depends on population demographic history and the strength of selection against causal variants. We also present an IBD mapping analysis of a type 1 diabetes data set. In those data we find that we can detect association only with the HLA region using IBD mapping. Overall, our results suggest that IBD mapping may have higher power than association analysis of SNP data when multiple rare causal variants are clustered within a gene. However, for outbred populations, very large sample sizes may be required for genome-wide significance unless the causal variants have strong effects.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Simulation scheme. Each simulated region is made up of 100 simulated segments of length 1 kb with gaps of length 1 kb between them. The central five segments can contain causal SNPs. Causal SNPs are those that the simulation program designates as protein-changing mutations. These SNPs have been subject to negative selection at a specified rate. Only the causal SNPs and one SNP per segment with highest minor allele frequency (MAF) are retained. The causal SNPs are used to determine disease status, while the high MAF SNPs are tested in the association analysis. IBD status is determined through further simulation, as described in the main text.
Figure 2
Figure 2
Distribution of lengths of detected IBD segments in the WTCCC type 1 diabetes data. IBD segments were detected using BEAGLE fastIBD. Lengths greater than 8 cM are not shown.
Figure 3
Figure 3
Permutation P-values for the IBD test in the WTCCC type 1 diabetes data. P-values were calculated at every tenth marker along the autosomes. The smallest possible P-value from the 5,000,000 permutations (2 × 10−7) is shown by the black horizontal line. The genome-wide significance level determined by 1000 permutations (6 × 10−6) is shown by the blue horizontal line.

Similar articles

Cited by

References

    1. Albrechtsen A., Korneliussen T. S., Moltle I., Hansen T. V., Nielsen F. C., et al. , 2009. Relatedness mapping and tracts of relatedness for genome-wide data in the presence of linkage disequilibrium. Genet. Epidemiol. 33(3): 266–274 - PubMed
    1. Aldous D. J., 2010. Probability Approximations via the Poisson Clumping Heuristic. Springer-Verlag, New York
    1. Baker P. R., Steck A. K., 2011. The past, present, and future of genetic associations in type 1 diabetes. Curr. Diab. Rep. 11(5): 445–453 - PubMed
    1. Barrett J. C., Clayton D. G., Concannon P., Akolkar B., Cooper J. D., et al. , 2009. Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes. Nat. Genet. 41(6): 703–707 - PMC - PubMed
    1. Bercovici S., Meek C., Wexler Y., Geiger D., 2010. Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping. Bioinformatics 26(12): i175–i182 - PMC - PubMed

Publication types