Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2002 Nov;71(5):1227-34.
doi: 10.1086/344398. Epub 2002 Oct 15.

Distribution of recombination crossovers and the origin of haplotype blocks: the interplay of population history, recombination, and mutation

Affiliations

Distribution of recombination crossovers and the origin of haplotype blocks: the interplay of population history, recombination, and mutation

Ning Wang et al. Am J Hum Genet. 2002 Nov.

Abstract

Recent studies suggest that haplotypes are arranged into discrete blocklike structures throughout the human genome. Here, we present an alternative haplotype block definition that assumes no recombination within each block but allows for recombination between blocks, and we use it to study the combined effects of demographic history and various population genetic parameters on haplotype block characteristics. Through extensive coalescent simulations and analysis of published haplotype data on chromosome 21, we find that (1) the combined effects of population demographic history, recombination, and mutation dictate haplotype block characteristics and (2) haplotype blocks can arise in the absence of recombination hot spots. Finally, we provide practical guidelines for designing and interpreting studies investigating haplotype block structure.

PubMed Disclaimer

Figures

Figure  1
Figure 1
Diagram of the FGT and haplotype block identification. A, Consider two loci A and B, each with two alleles (denoted as A1, A2, B1, and B2). On the left, only three gametes (A1B1, A1B2, and A2B1) are observed between loci A and B (which arise from mutations; suppose A1B1 is the wild type). On the right, recombination has occurred between A1B2 and A2B1 haplotypes, which leads to all four gametes being observed in a sample. B, Haplotype block identification. The FGT is conducted between pairwise loci, and 0 and 1 denote the absence and presence, respectively, of all four gametes between locus pairs. In this example with eight loci, three haplotype blocks were identified, with the first block including SNPs 1, 2, and 3; the second block including SNPs 4, 5, and 6; and the last block including SNPs 7 and 8.
Figure  2
Figure 2
The effect of recombination and demographic history on haplotype block characteristics; average haplotype block size ± SE versus recombination level for a sample size of 100. If we assume 1 cM = 1 Mb, the recombination rate r is ∼10−8 per generation per intersite. The average block size decreases with the increasing recombination level. The effective population size was adjusted by fixing the ratio of R/θ and allowing θ to vary from 5, 10, and 25 (which corresponds to effective population sizes of 2,000, 4,000, and 10,000).
Figure  3
Figure 3
The effect of sample size on haplotype block characteristics; average haplotype block size versus sample sizes. Under the same parameters θ=25 and R=10, which corresponds to a general population with mutation rate 10−9 per site per year, and recombination rate 10−8 per generation, given effective population size as 104, each generation time as 25 years, and total sequence length as 25 kb, changing the sample size from 20 to 50, 100, 200, 400, 600, 800, and 1,000, respectively. Small sample size reveals increased haplotype block sizes. Each simulation is based on 1,000 replications.
Figure  4
Figure 4
The effect of SNP density of haplotype block characteristics. A, Average block size versus θ. By fixing R, the average block size increases slightly when θ is very small and then decreases until an equilibrium value of θ is reached. If we assume the mutation rate is constant across a local region, changing θ also corresponds to changing the SNP density. The five arrows denote positions where the SNP density is equal to 0.5, 1, 2, 3, and 5 SNPs/kb, respectively. B, An explanation of why θ influences average block size. Each X represents a recombination position, and each ↓ represents an identified SNP marker. When θ is small, two markers are identified and constitute one block as shown in (a); (b) θ increases, three markers are identified, and they form one block whose size is larger compared with (a); (c) As θ continues to increase, six markers are identified to resolve this region into two blocks, whose average block size has now decreased in comparison with (b). Each simulation is based on 1,000 replications.
Figure  5
Figure 5
Haplotype block size distribution of published chromosome 21 data and simulations identified by FGT method. For the simulated data, each parameter combination is based on 1,000 replications. A, Distribution of haplotype block length in real data, simulated data assuming r=1.0*10-8, and simulated data assuming r=4.0*10-8. B, Distribution of haplotype block length in real data and simulated data assuming a mixture model of recombination (see text for details).

References

Electronic-Database Information

    1. Hudson Lab home page, http://home.uchicago.edu/~rhudson1/source.html (for simulation of the coalescent with recombination and migration [see the program “mksamples”])

References

    1. Bogart KP (2000) Introductory combinatorics, 3rd ed. Harcourt Academic Press, San Diego
    1. Collins A, Lonjou C, Morton NE (1999) Genetic epidemiology of single-nucleotide polymorphisms. Proc Natl Acad Sci USA 96:15173–15177 - PMC - PubMed
    1. Daly MJ, Rioux JD, Schaffner SF, Hudson TJ, Lander ES (2001) High-resolution haplotype structure in the human genome. Nat Genet 29:229–232 - PubMed
    1. Fu YX, Li WH (1999) Coalescing into the 21st century: an overview and prospects of coalescent theory. Theor Popul Biol 56:1–10 - PubMed
    1. Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, Defelice M, Lochner A, Faggart M, Liu-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D (2002) The structure of haplotype blocks in the human genome. Science 296:2225–2229 - PubMed

Publication types