Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2001 Jun;68(6):1447-56.
doi: 10.1086/320607. Epub 2001 May 16.

The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures

Affiliations

The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures

J M Akey et al. Am J Hum Genet. 2001 Jun.

Abstract

The rapid development of a dense single-nucleotide-polymorphism marker map has stimulated numerous studies attempting to characterize the magnitude and distribution of background linkage disequilibrium (LD) within and between human populations. Although genotyping errors are an inherent problem in all LD studies, there have been few systematic investigations documenting their consequences on estimates of background LD. Therefore, we derived simple deterministic formulas to investigate the effect that genotyping errors have on four commonly used LD measures-D', r, Q, and d-in studies of background LD. We have found that genotyping error rates as small as 3% can have serious affects on these LD measures, depending on the allele frequencies and the assumed error model. Furthermore, we compared the robustness of D', r, Q, and d, in the presence of genotyping errors. In general, Q and d are more robust than D' and r, although exceptions do exist. Finally, through stochastic simulations, we illustrate how genotyping errors can lead to erroneous inferences when measures of LD between two samples are compared.

PubMed Disclaimer

Figures

Figure  1
Figure 1
Symbolic and mathematical representations of error models used in analyses. The genotyping-error rates at loci 1 and 2 are denoted as μ and ν, respectively. Panels A and B correspond to the SEM and the DEM, respectively.
Figure  2
Figure 2
The effect that genotyping errors have on FT values, as a function of genotyping-error rate. In panel A, the haplotype frequencies are PAB=.3, PAb=.3, PaB=.1, and Pab=.3; in panel B, the haplotype frequencies are PAB=.1, PAb=.4, PaB=.3, and Pab=.2. Note that, in panel B, the lines corresponding to r and Q for the SEM are nearly superimposable.
Figure  3
Figure 3
Comparison of the robustness of D′, r, Q, and d. The ratio of FT values is plotted versus PAB. The curves correspond to the ratios d/r (blue), r/D (green), r/Q (orange), Q/d (red), Q/D (black), and D/d (brown). In panel A, the haplotype frequencies are PAb=PaB=0 and Pab=1-PAB. In panel B, the haplotype frequencies are PAb=PaB=.25 and Pab=.5-PAB. The genotyping-error rate was set at 5%.
Figure 4
Figure 4
Distribution of the difference, in absolute value of D′, between two simulated samples, in the presence of genotyping errors. In panel A, the allele frequencies are PA=PB=.5; in panel B, the allele frequencies are PA=PB=.8. For both panels, the value of formula image in the absence of genotyping errors is 0. Note that as the error rates increase the distribution begins to shift away from the expected difference of 0.

Similar articles

Cited by

References

    1. Abecasis GR, Noguchi E, Heinzmann A, Traherne JA, Bhattacharyya S, Leaves NI, Anderson GG, Zhang Y, Lench NJ, Carey A, Cardon LR, Moffatt MF, Cookson WO (2001) Extent and distribution of linkage disequilibrium in three genomic regions. Am J Hum Genet 68:191–197 - PMC - PubMed
    1. Akey J, Jin L, Xiong M (2001a) Haplotypes vs. single marker linkage disequilibrium tests: what do we gain? Eur J Hum Genet 9:291–300 - PubMed
    1. Akey JM, Sosnoski D, Parra E, Dios S, Hiester K, Su B, Bonilla C, Jin L, Shriver MD (2001b) Melting curve analysis of SNPs (McSNP): a gel-free and inexpensive approach for SNP genotyping. Biotechniques 30:358–367 - PubMed
    1. Buetow KH (1991) Influence of aberrant observations on high-resolution linkage analysis outcomes. Am J Hum Genet 49:985–994 - PMC - PubMed
    1. Cho RJ, Mindrinos M, Richards DR, Sapolsky RJ, Anderson M, Drenkard E, Dewdney J, Reuber TL, Stammers M, Federspiel N (1999) Genome-wide mapping with biallelic markers in Arabidopsis thaliana. Nat Genet 23:203–207 - PubMed

Publication types