A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters
- PMID: 16432200
- PMCID: PMC1345710
- DOI: 10.1073/pnas.0510310103
A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters
Abstract
A striking feature of the human genome is the dearth of CpG dinucleotides (CpGs) interrupted occasionally by CpG islands (CGIs), regions with relatively high content of the dinucleotide. CGIs are generally associated with promoters; genes, whose promoters are especially rich in CpG sequences, tend to be expressed in most tissues. However, all working definitions of what constitutes a CGI rely on ad hoc thresholds. Here we adopt a direct and comprehensive survey to identify the locations of all CpGs in the human genome and find that promoters segregate naturally into two classes by CpG content. Seventy-two percent of promoters belong to the class with high CpG content (HCG), and 28% are in the class whose CpG content is characteristic of the overall genome (low CpG content). The enrichment of CpGs in the HCG class is symmetric and peaks around the core promoter. The broad-based expression of the HCG promoters is not a consequence of a correlation with CpG content because within the HCG class the breadth of expression is independent of the CpG content. The overall depletion of CpGs throughout the genome is thought to be a consequence of the methylation of some germ-line CpGs and their susceptibility to mutation. A comparison of the frequencies of inferred deamination mutations at CpG and GpC dinucleotides in the two classes of promoters using SNPs in human-chimpanzee sequence alignments shows that CpGs mutate at a lower frequency in the HCG promoters, suggesting that CpGs in the HCG class are hypomethylated in the germ line.
Figures



Similar articles
-
CpGcluster: a distance-based algorithm for CpG-island detection.BMC Bioinformatics. 2006 Oct 12;7:446. doi: 10.1186/1471-2105-7-446. BMC Bioinformatics. 2006. PMID: 17038168 Free PMC article.
-
CpG mutation rates in the human genome are highly dependent on local GC content.Mol Biol Evol. 2005 Mar;22(3):650-8. doi: 10.1093/molbev/msi043. Epub 2004 Nov 10. Mol Biol Evol. 2005. PMID: 15537806
-
Large-scale human promoter mapping using CpG islands.Nat Genet. 2000 Sep;26(1):61-3. doi: 10.1038/79189. Nat Genet. 2000. PMID: 10973249
-
CpG islands--'a rough guide'.FEBS Lett. 2009 Jun 5;583(11):1713-20. doi: 10.1016/j.febslet.2009.04.012. Epub 2009 Apr 18. FEBS Lett. 2009. PMID: 19376112 Review.
-
From the margins of the genome: mobile elements shape primate evolution.Bioessays. 2005 Aug;27(8):785-94. doi: 10.1002/bies.20268. Bioessays. 2005. PMID: 16015599 Review.
Cited by
-
Universal prediction of vertebrate species age at maturity.Commun Biol. 2024 Oct 30;7(1):1414. doi: 10.1038/s42003-024-07046-z. Commun Biol. 2024. PMID: 39478142 Free PMC article.
-
Multi-omics integration strategies for animal epigenetic studies - A review.Anim Biosci. 2021 Aug;34(8):1271-1282. doi: 10.5713/ab.21.0042. Epub 2021 Apr 23. Anim Biosci. 2021. PMID: 33902167 Free PMC article.
-
Lower DNA methylation levels in CpG island shores of CR1, CLU, and PICALM in the blood of Japanese Alzheimer's disease patients.PLoS One. 2020 Sep 29;15(9):e0239196. doi: 10.1371/journal.pone.0239196. eCollection 2020. PLoS One. 2020. PMID: 32991610 Free PMC article.
-
Reconstructing the demographic history of the human lineage using whole-genome sequences from human and three great apes.Genome Biol Evol. 2012;4(11):1133-45. doi: 10.1093/gbe/evs075. Genome Biol Evol. 2012. PMID: 22975719 Free PMC article.
-
Male germline transmits fetal alcohol epigenetic marks for multiple generations: a review.Addict Biol. 2016 Jan;21(1):23-34. doi: 10.1111/adb.12186. Epub 2015 Jan 12. Addict Biol. 2016. PMID: 25581210 Free PMC article. Review.
References
-
- Reik, W., Dean, W. & Walter, J. (2001) Science 293, 1089-1093. - PubMed
-
- Fazzari, M. J. & Greally, J. M. (2004) Nat. Rev. Genet. 5, 446-455. - PubMed
-
- Robertson, K. D. & Wolffe, A. P. (2000) Nat. Rev. Genet. 1, 11-19. - PubMed
-
- Singal, R. & Ginder, G. D. (1999) Blood 93, 4059-4070. - PubMed
-
- Bird, A. (2002) Genes Dev. 16, 6-21. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical