Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Jun;74(6):1136-53.
doi: 10.1086/421329. Epub 2004 May 6.

Linkage analysis of a complex disease through use of admixed populations

Affiliations

Linkage analysis of a complex disease through use of admixed populations

Xiaofeng Zhu et al. Am J Hum Genet. 2004 Jun.

Abstract

Linkage disequilibrium arising from the recent admixture of genetically distinct populations can be potentially useful in mapping genes for complex diseases. McKeigue has proposed a method that conditions on parental admixture to detect linkage. We show that this method tests for linkage only under specific assumptions, such as equal admixture in the parental generation and admixture that occurs in a single generation. In practice, these assumptions are unlikely to hold for natural populations, resulting in an inflation of the type I error rate when testing for linkage by this method. In this article, we generalize McKeigue's approach of testing for linkage to allow two different admixture models: (1) intermixture admixture and (2) continuous gene flow. We calculate the sample size required for a genomewide search by this method under different disease models: multiplicative, additive, recessive, and dominant. Our results show that the sample size required to obtain 90% power to detect a putative mutant allele at a genomewide significance level of 5% can usually be achieved in practice if informative markers are available at a density of 2 cM.

PubMed Disclaimer

Figures

Figure  1
Figure 1
Two admixture models used in the calculation of ancestral probability. Left panel, IA model. Right panel, CGF model.
Figure  2
Figure 2
IA model. Number of cases required for 90% power to detect linkage at a significance level of .001 at different population risk ratios and recombination fractions between marker and disease loci under four genetic models: multiplicative, additive, recessive (with the low-risk allele in the low-risk population), and dominant (with the high-risk allele in the high-risk population). Upper panels, Total contributions of the parental populations X and Y are 50%/50% (λ=1.0). Lower panels, Total contributions of parental populations X and Y are 74%/26% (λ=0.52). Samples are drawn from the 10th generation.
Figure  3
Figure 3
CGF model. Number of cases required for 90% power to detect linkage at a significance level of .001 at different population risk ratios and recombination fractions between marker and disease loci under four genetic models: multiplicative, additive, recessive (with the low-risk allele in the low-risk population), and dominant (with the high-risk allele in the high-risk population). Upper panels, Total contributions of parental populations X and Y are 50%/50% (λ=1). Lower panels, total contributions of parental population X and Y are 74%/26% (λ=0.06). Samples are drawn from the 10th generation.
Figure  4
Figure 4
Number of cases required for 90% power to detect linkage at a genomewide significance level of .05, when 3,000 markers are evenly placed along the genome, under four genetic models: multiplicative, additive, recessive (with the low-risk allele in the low-risk population), and dominant (with the high-risk allele in the high-risk population). The population has been admixed for 10 generations, according to the CGF model. Total contributions of parental populations X and Y are 74%/26% (λ=0.06). Left, X has a higher population risk than Y; the curve for the dominant model does not appear because the number is >2,500. Right, Y has a higher population risk than X. Samples are drawn from the 10th generation.
Figure  5
Figure 5
Type I error rate when we test linkage by testing r=1 under four genetic models: multiplicative, additive, recessive (with the low-risk allele in the low-risk population), and dominant (with the high-risk allele in the high-risk population). Total contributions of the parental populations X and Y are 74%/26%. Left, IA model. Right, CGF model. Two hundred fifty cases are drawn from the 10th generation.
Figure  6
Figure 6
Histograms of estimated ancestral proportions (Π) across the markers; 1,000 markers on 1,000 individuals were generated. A, Data simulated using the IA model at a marker density of 1 marker/cM. B, Data simulated using the CGF model at a marker density of 1 marker/cM. C, Data simulated using the IA model at a marker density of 1 marker/5cM. D, Data simulated using the CGF model at a marker density of 1 marker/5 cM.
Figure  7
Figure 7
Comparisons of a person’s estimated and true ancestral proportion; 1,000 markers were generated on 1,000 individuals. A–D, Known ancestral allele frequencies. E–H, Ancestral allele frequencies estimated by STRUCTURE. A and E, Data simulated using the IA model at a marker density of 1 marker/cM. B and F, Data simulated using the CGF model at a marker density of 1 marker/cM. C and G, Data simulated using the IA model at a marker density of 1 marker/5 cM. D and H, Data simulated using the CGF model at a marker density of 1 marker/5 cM.

References

    1. Chakraborty R, Weiss KM (1988) Admixture as a tool for finding linked genes and detecting that difference from allelic association between loci. Proc Natl Acad Sci USA 85:9119–9123 - PMC - PubMed
    1. Cheung KH, Osier MV, Kidd JR, Pakstis AJ, Miller PL, Kidd KK (2000) ALFRED: an allele frequency database for diverse populations and DNA polymorphisms. Nucleic Acids Res 29:361–36310.1093/nar/28.1.361 - DOI - PMC - PubMed
    1. Collins-Schramm HE, Phillips CM, Operario DJ, Lee JS, Weber JL, Hanson RL, Knowler WC, Cooper R, Li H, Seldin MF (2002) Ethnic-difference markers for use in mapping by admixture linkage disequilibrium. Am J Hum Genet 70:737–750 - PMC - PubMed
    1. Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55:997–1004 - PubMed
    1. Efron B (2004) Large-scale simultaneous hypothesis testing: the choice of a null hypothesis. J Am Stat Assoc 99:96–104

Publication types

Substances