Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2002 Dec;12(12):1846-53.
doi: 10.1101/gr.483802.

Haplotype and linkage disequilibrium architecture for human cancer-associated genes

Affiliations
Comparative Study

Haplotype and linkage disequilibrium architecture for human cancer-associated genes

Penelope E Bonnen et al. Genome Res. 2002 Dec.

Abstract

To facilitate association-based linkage studies we have studied the linkage disequilibrium (LD) and haplotype architecture around five genes of interest for cancer risk: ATM, BRCA1, BRCA2, RAD51, and TP53. Single nucleotide polymorphisms (SNPs) were identified and used to construct haplotypes that span 93-200 kb per locus with an average SNP density of 12 kb. These markers were genotyped in four ethnically defined populations that contained 48 each of African Americans, Asian Americans, Hispanic Americans, and European Americans. Haplotypes were inferred using an expectation maximization (EM) algorithm, and the data were analyzed using D', R(2), Fisher's exact P-values, and the four-gamete test for recombination. LD levels varied widely between loci from continuously high LD across 200 kb to a virtual absence of LD across a similar length of genome. LD structure also varied at each gene and between populations studied. This variation indicates that the success of linkage-based studies will require a precise description of LD at each locus and in each population to be studied. One striking consistency between genes was that at each locus a modest number of haplotypes present in each population accounted for a high fraction of the total number of chromosomes. We conclude that each locus has its own genomic profile with regard to LD, and despite this there is the widespread trend of relatively low haplotype diversity. As a result, a low marker density should be adequate to identify haplotypes that represent the common variation at a locus, thereby decreasing costs and increasing efficacy of association studies.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Fixation index (FST) of SNP allele frequencies across populations. SNP allele frequencies varied between ethnic groups. The standard deviation of that variation across populations was calculated for each SNP that was used in haplotype and LD analyses. The majority of RAD51 SNPs had an FST > 0.081. Excluding RAD51, 100% of SNPs had an FST < 0.081, and 71% of SNPs had an FST < 0.05.
Figure 2
Figure 2
LD intensity across loci. LD was measured for the five loci in all populations using the statistic ‖D‘‖ in a pairwise manner across markers. GOLD generates these plots through interpolation of the resulting triangular matrices. ‖D‘‖ ranges from 1 to 0, with 1 showing in red and 0 in dark blue.
Figure 3
Figure 3
Comparison of LD and recombination measurements at five loci. Pairwise LD was measured by Fisher’s exact P-values and r2. Each was plotted separately by GOLD with a score of 1 showing in red and 0 in dark blue. Recombination was determined using the four-gamete test with R, potential recombination sites, indicated by white Xs. Blue boxes indicate site pairs having four gametic types, which implies that recombination has occurred between these two sites. Red boxes indicate site pairs having less than four gametic types.
Figure 4
Figure 4
LD versus intermarker distance. ‖D‘‖ values for all pairwise comparisons are plotted against the physical distance between each pair of markers. LD measurements were made for each population for each locus. African American data are plotted in blue, Asian American in red, European American in yellow, and Hispanic American in green.

References

    1. Abecasis GR, Cookson WO. GOLD—Graphical overview of linkage disequilibrium. Bioinformatics. 2000;16:182–183. - PubMed
    1. Abecasis GR, Noguchi E, Heinzmann A, Traherne JA, Bhattacharyya S, Leaves NI, Anderson GG, Zhang Y, Lench NJ, Carey A, et al. Extent and distribution of linkage disequilibrium in three genomic regions. Am J Hum Genet. 2001;68:191–197. - PMC - PubMed
    1. Bonnen PE, Story MD, Ashorn CL, Buchholz TA, Weil MM, Nelson DL. Haplotypes at ATM identify coding-sequence variation and indicate a region of extensive linkage disequilibrium. Am J Hum Genet. 2000;67:1437–1451. - PMC - PubMed
    1. Chakraborty R. Gene admixture in human populations: Models and predictions. Yearbook Phys Anthropol. 1986;29:1–43.
    1. Chakravarti A, Buetow KH, Antonarakis SE, Waber PG, Boehm CD, Kazazian HH. Nonuniform recombination within the human β-globin gene cluster. Am J Hum Genet. 1984;36:1239–1258. - PMC - PubMed

Publication types

Substances