Purifying selection in deeply conserved human enhancers is more consistent than in coding sequences
- PMID: 25062004
- PMCID: PMC4111549
- DOI: 10.1371/journal.pone.0103357
Purifying selection in deeply conserved human enhancers is more consistent than in coding sequences
Abstract
Comparison of polymorphism at synonymous and non-synonymous sites in protein-coding DNA can provide evidence for selective constraint. Non-coding DNA that forms part of the regulatory landscape presents more of a challenge since there is not such a clear-cut distinction between sites under stronger and weaker selective constraint. Here, we consider putative regulatory elements termed Conserved Non-coding Elements (CNEs) defined by their high level of sequence identity across all vertebrates. Some mutations in these regions have been implicated in developmental disorders; we analyse CNE polymorphism data to investigate whether such deleterious effects are widespread in humans. Single nucleotide variants from the HapMap and 1000 Genomes Projects were mapped across nearly 2000 CNEs. In the 1000 Genomes data we find a significant excess of rare derived alleles in CNEs relative to coding sequences; this pattern is absent in HapMap data, apparently obscured by ascertainment bias. The distribution of polymorphism within CNEs is not uniform; we could identify two categories of sites by exploiting deep vertebrate alignments: stretches that are non-variant, and those that have at least one substitution. The conserved category has fewer polymorphic sites and a greater excess of rare derived alleles, which can be explained by a large proportion of sites under strong purifying selection within humans--higher than that for non-synonymous sites in most protein coding regions, and comparable to that at the strongly conserved trans-dev genes. Conversely, the more evolutionarily labile CNE sites have an allele frequency distribution not significantly different from non-synonymous sites. Future studies should exploit genome-wide re-sequencing to obtain better coverage in selected non-coding regions, given the likelihood that mutations in evolutionarily conserved enhancer sequences are deleterious. Discovery pipelines should validate non-coding variants to aid in identifying causal and risk-enhancing variants in complex disorders, in contrast to the current focus on exome sequencing.
Conflict of interest statement
Figures



Similar articles
-
Parallel evolution of conserved non-coding elements that target a common set of developmental regulatory genes from worms to humans.Genome Biol. 2007;8(2):R15. doi: 10.1186/gb-2007-8-2-r15. Genome Biol. 2007. PMID: 17274809 Free PMC article.
-
Prioritizing sequence variants in conserved non-coding elements in the chicken genome using chCADD.PLoS Genet. 2020 Sep 23;16(9):e1009027. doi: 10.1371/journal.pgen.1009027. eCollection 2020 Sep. PLoS Genet. 2020. PMID: 32966296 Free PMC article.
-
Divergent evolutionary rates in vertebrate and mammalian specific conserved non-coding elements (CNEs) in echolocating mammals.BMC Evol Biol. 2014 Dec 19;14:261. doi: 10.1186/s12862-014-0261-5. BMC Evol Biol. 2014. PMID: 25523630 Free PMC article.
-
Extensive purifying selection acting on synonymous sites in HIV-1 Group M sequences.Virol J. 2008 Dec 23;5:160. doi: 10.1186/1743-422X-5-160. Virol J. 2008. PMID: 19105834 Free PMC article. Review.
-
[Research progress of conserved non-coding elements in metazoan].Yi Chuan. 2013 Jan;35(1):35-44. doi: 10.3724/sp.j.1005.2013.00035. Yi Chuan. 2013. PMID: 23357263 Review. Chinese.
Cited by
-
Conserved non-coding elements: developmental gene regulation meets genome organization.Nucleic Acids Res. 2017 Dec 15;45(22):12611-12624. doi: 10.1093/nar/gkx1074. Nucleic Acids Res. 2017. PMID: 29121339 Free PMC article. Review.
-
A site specific model and analysis of the neutral somatic mutation rate in whole-genome cancer data.BMC Bioinformatics. 2018 Apr 19;19(1):147. doi: 10.1186/s12859-018-2141-2. BMC Bioinformatics. 2018. PMID: 29673314 Free PMC article.
-
Impact of Genetic Variation in Gene Regulatory Sequences: A Population Genomics Perspective.Front Genet. 2021 Jul 2;12:660899. doi: 10.3389/fgene.2021.660899. eCollection 2021. Front Genet. 2021. PMID: 34276769 Free PMC article. Review.
-
Genetic Analysis Based on Mitochondrial nad2 Gene Reveals a Recent Population Expansion of the Invasive Mussel, Mytella strigata, in China.Genes (Basel). 2023 Nov 3;14(11):2038. doi: 10.3390/genes14112038. Genes (Basel). 2023. PMID: 38002981 Free PMC article.
-
Characterization and functional analysis of conserved non-coding sequences among poaceae: insights into gene regulation and phenotypic variation in maize.BMC Genomics. 2025 Jan 20;26(1):46. doi: 10.1186/s12864-025-11221-9. BMC Genomics. 2025. PMID: 39833673 Free PMC article.
References
-
- Lettice LA, Heaney SJ, Purdie LA, Li L, de Beer P, et al. (2003) A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. Hum Mol Genet 12: 1725–1735. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources