Identifying a high fraction of the human genome to be under selective constraint using GERP++
- PMID: 21152010
- PMCID: PMC2996323
- DOI: 10.1371/journal.pcbi.1001025
Identifying a high fraction of the human genome to be under selective constraint using GERP++
Abstract
Computational efforts to identify functional elements within genomes leverage comparative sequence information by looking for regions that exhibit evidence of selective constraint. One way of detecting constrained elements is to follow a bottom-up approach by computing constraint scores for individual positions of a multiple alignment and then defining constrained elements as segments of contiguous, highly scoring nucleotide positions. Here we present GERP++, a new tool that uses maximum likelihood evolutionary rate estimation for position-specific scoring and, in contrast to previous bottom-up methods, a novel dynamic programming approach to subsequently define constrained elements. GERP++ evaluates a richer set of candidate element breakpoints and ranks them based on statistical significance, eliminating the need for biased heuristic extension techniques. Using GERP++ we identify over 1.3 million constrained elements spanning over 7% of the human genome. We predict a higher fraction than earlier estimates largely due to the annotation of longer constrained elements, which improves one to one correspondence between predicted elements with known functional sequences. GERP++ is an efficient and effective tool to provide both nucleotide- and element-level constraint scores within deep multiple sequence alignments.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures







Similar articles
-
Distribution and intensity of constraint in mammalian genomic sequence.Genome Res. 2005 Jul;15(7):901-13. doi: 10.1101/gr.3577405. Epub 2005 Jun 17. Genome Res. 2005. PMID: 15965027 Free PMC article.
-
Population genetic models of GERP scores suggest pervasive turnover of constrained sites across mammalian evolution.PLoS Genet. 2020 May 29;16(5):e1008827. doi: 10.1371/journal.pgen.1008827. eCollection 2020 May. PLoS Genet. 2020. PMID: 32469868 Free PMC article.
-
Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome.Genome Res. 2007 Jun;17(6):760-74. doi: 10.1101/gr.6034307. Genome Res. 2007. PMID: 17567995 Free PMC article.
-
Approaches to comparative sequence analysis: towards a functional view of vertebrate genomes.Nat Rev Genet. 2008 Apr;9(4):303-13. doi: 10.1038/nrg2185. Nat Rev Genet. 2008. PMID: 18347593 Review.
-
Computation and analysis of genomic multi-sequence alignments.Annu Rev Genomics Hum Genet. 2007;8:193-213. doi: 10.1146/annurev.genom.8.080706.092300. Annu Rev Genomics Hum Genet. 2007. PMID: 17489682 Review.
Cited by
-
wANNOVAR: annotating genetic variants for personal genomes via the web.J Med Genet. 2012 Jul;49(7):433-6. doi: 10.1136/jmedgenet-2012-100918. Epub 2012 Jun 20. J Med Genet. 2012. PMID: 22717648 Free PMC article.
-
dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations.Hum Mutat. 2013 Sep;34(9):E2393-402. doi: 10.1002/humu.22376. Epub 2013 Jul 10. Hum Mutat. 2013. PMID: 23843252 Free PMC article.
-
Mutations in the NOTCH pathway regulator MIB1 cause left ventricular noncompaction cardiomyopathy.Nat Med. 2013 Feb;19(2):193-201. doi: 10.1038/nm.3046. Epub 2013 Jan 13. Nat Med. 2013. PMID: 23314057
-
Deleterious Variation in Natural Populations and Implications for Conservation Genetics.Annu Rev Anim Biosci. 2023 Feb 15;11:93-114. doi: 10.1146/annurev-animal-080522-093311. Epub 2022 Nov 4. Annu Rev Anim Biosci. 2023. PMID: 36332644 Free PMC article. Review.
-
Genetic Variation in Cardiomyopathy and Cardiovascular Disorders.Circ J. 2015;79(7):1409-15. doi: 10.1253/circj.CJ-15-0536. Epub 2015 Jun 4. Circ J. 2015. PMID: 26040335 Free PMC article. Review.
References
-
- The ENCODE Project Consortium. The ENCODE (ENCyclopedia Of DNA Elements) Project. Science. 2004;306:636–640. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources