Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 May;28(5):441-50.
doi: 10.1002/humu.20463.

Patterns of variation in DNA segments upstream of transcription start sites

Affiliations

Patterns of variation in DNA segments upstream of transcription start sites

Damian Labuda et al. Hum Mutat. 2007 May.

Abstract

It is likely that evolutionary differences among species are driven by sequence changes in regulatory regions. Likewise, polymorphisms in the promoter regions may be responsible for interindividual differences at the level of populations. We present an unbiased survey of genetic variation in 2-kb segments upstream of the transcription start sites of 28 protein-coding genes, characterized in five population groups of different geographic origin. On average, we found 9.1 polymorphisms and 8.8 haplotypes per segment with corresponding nucleotide and haplotype diversities of 0.082% and 58%, respectively. We characterized these segments through different summary statistics, Hardy-Weinberg equilibria fixation index (Fst) estimates, and neutrality tests, as well as by analyzing the distributions of haplotype allelic classes, introduced here to assess the departure from neutrality and examined by coalescent simulations under a simple population model, assuming recombinations or different demography. Our results suggest that genetic diversity in some of these regions could have been shaped by purifying selection and driven by adaptive changes in the other, thus explaining the relatively large variance in the corresponding genetic diversity indices loci. However, some of these effects could be also due to linkage with surrounding sequences, and the neutralists' explanations cannot be ruled out given uncertainty in the underlying demographic histories and the possibility of random effects due to the small size of the studied segments.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Distributions of allelic frequency classes (left panels) of frequencies of haplotypes [Middleton et al.,1993] and haplotype allelic classes (right) inCDC25A,CX3CR1, andGSRM3. Bars represent the observed values; lines represent theoretical distributions. The occupancy of allelic frequency classes corresponds to counts of sites represented by i new alleles in a sample of n chromosomes (i=1, 2, 3,…, n–1). Here, the theoretical curve (solid line) corresponds to the distribution calculated from the equation [Fan et al., 2002; Fu,1997] Si(i)Θ/i,using Θ/π (Table 1) as the estimator of Θ.The theoretical distribution (solid line) of haplotype frequencies expected given k observed haplotypes (Table 1) is according to Ewens [1972]. Haplotype names are arbitrary and correspond to their names in our database. In the case of haplotype allelic classes, regrouping haplotypes sharing the same number of mutations from the ancestral haplotype, their theoretical occupancy was obtained by coalescent simulation under the standard model, assuming constant population size without (solid line) and with (dotted line) recombination, at 10-fold the genomic average in the case of segments where crossovers were detected.
FIGURE 2
FIGURE 2
Frequencies of the ancestral haplotypes (A) and the distribution of major haplotypes among haplotype allelic classes (B) (cf. Fig.1) in 28 studied segments.The data (solid bars) are compared with theoretical expectations fromcoalescent simulations under the standard model in the absence (solid line) and in the presence (dashed line) of recombination at 10-fold the genomic average (10 cm/Mb). Simulations were for a sample size of 80 chromosomes, a mutation rate of 2.13×10−8 per bp per generation,corresponding to the average S density of 9.1, and N =10,000.

Similar articles

Cited by

  • Common variants of the BRCA1 wild-type allele modify the risk of breast cancer in BRCA1 mutation carriers.
    Cox DG, Simard J, Sinnett D, Hamdi Y, Soucy P, Ouimet M, Barjhoux L, Verny-Pierre C, McGuffog L, Healey S, Szabo C, Greene MH, Mai PL, Andrulis IL; Ontario Cancer Genetics Network; Thomassen M, Gerdes AM, Caligo MA, Friedman E, Laitman Y, Kaufman B, Paluch SS, Borg Å, Karlsson P, Askmalm MS, Bustinza GB; SWE-BRCA Collaborators; Nathanson KL, Domchek SM, Rebbeck TR, Benítez J, Hamann U, Rookus MA, van den Ouweland AM, Ausems MG, Aalfs CM, van Asperen CJ, Devilee P, Gille HJ; HEBON; EMBRACE; Peock S, Frost D, Evans DG, Eeles R, Izatt L, Adlard J, Paterson J, Eason J, Godwin AK, Remon MA, Moncoutier V, Gauthier-Villars M, Lasset C, Giraud S, Hardouin A, Berthet P, Sobol H, Eisinger F, Bressac de Paillerets B, Caron O, Delnatte C; GEMO Study Collaborators; Goldgar D, Miron A, Ozcelik H, Buys S, Southey MC, Terry MB; Breast Cancer Family Registry; Singer CF, Dressler AC, Tea MK, Hansen TV, Johannsson O, Piedmonte M, Rodriguez GC, Basil JB, Blank S, Toland AE, Montagna M, Isaacs C, Blanco I, Gayther SA, Moysich KB, Schmutzler RK, Wappenschmidt B, Engel C, Meindl A, Ditsch N, Arnold N, Niederacher D, Sutter C, Gadzicki D, Fiebig B, Caldes T, Laframboise R, Nevanlinna H, Chen X, Beesley J… See abstract for full author list ➔ Cox DG, et al. Hum Mol Genet. 2011 Dec 1;20(23):4732-47. doi: 10.1093/hmg/ddr388. Epub 2011 Sep 2. Hum Mol Genet. 2011. PMID: 21890493 Free PMC article.
  • iHDSel software: The price equation and the population stability index to detect genomic patterns compatible with selective sweeps. An example with SARS-CoV-2.
    Carvajal-Rodríguez A. Carvajal-Rodríguez A. Biol Methods Protoc. 2024 Nov 27;9(1):bpae089. doi: 10.1093/biomethods/bpae089. eCollection 2024. Biol Methods Protoc. 2024. PMID: 39679303 Free PMC article.
  • Haplotype allelic classes for detecting ongoing positive selection.
    Hussin J, Nadeau P, Lefebvre JF, Labuda D. Hussin J, et al. BMC Bioinformatics. 2010 Jan 28;11:65. doi: 10.1186/1471-2105-11-65. BMC Bioinformatics. 2010. PMID: 20109229 Free PMC article.
  • Fraction of informative recombinations: a heuristic approach to analyze recombination rates.
    Lefebvre JF, Labuda D. Lefebvre JF, et al. Genetics. 2008 Apr;178(4):2069-79. doi: 10.1534/genetics.107.082255. Genetics. 2008. PMID: 18430934 Free PMC article.

References

    1. Akey JM, Zhang G, Zhang K, Jin L, Shriver MD. Interrogating a high-density SNP map for signatures of natural selection. Genome Res. 2002;12:1805–1814. - PMC - PubMed
    1. Akey JM, Eberle MA, Rieder MJ, Carlson CS, Shriver MD, Nickerson DA, Kruglyak L. Population history and natural selection shape patterns of genetic variation in 132 genes. PLoS Biol. 2004;2:e286. - PMC - PubMed
    1. Bandelt HJ, Forster P, Rohl A. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999;16:37–48. - PubMed
    1. Belanger H, Beaulieu P, Moreau C, Labuda D, Hudson TJ, Sinnett D. Functional promoter SNPs in cell cycle checkpoint genes. Hum Mol Genet. 2005;14:2641–2648. - PubMed
    1. Bourgeois S, Labuda D. Dynamic allele-specific oligonucleotide hybridization on solid support (DASO) Anal Biochem. 2004;324:309–311. - PubMed

Publication types