CpG island mapping by epigenome prediction
- PMID: 17559301
- PMCID: PMC1892605
- DOI: 10.1371/journal.pcbi.0030110
CpG island mapping by epigenome prediction
Abstract
CpG islands were originally identified by epigenetic and functional properties, namely, absence of DNA methylation and frequent promoter association. However, this concept was quickly replaced by simple DNA sequence criteria, which allowed for genome-wide annotation of CpG islands in the absence of large-scale epigenetic datasets. Although widely used, the current CpG island criteria incur significant disadvantages: (1) reliance on arbitrary threshold parameters that bear little biological justification, (2) failure to account for widespread heterogeneity among CpG islands, and (3) apparent lack of specificity when applied to the human genome. This study is driven by the idea that a quantitative score of "CpG island strength" that incorporates epigenetic and functional aspects can help resolve these issues. We construct an epigenome prediction pipeline that links the DNA sequence of CpG islands to their epigenetic states, including DNA methylation, histone modifications, and chromatin accessibility. By training support vector machines on epigenetic data for CpG islands on human Chromosomes 21 and 22, we identify informative DNA attributes that correlate with open versus compact chromatin structures. These DNA attributes are used to predict the epigenetic states of all CpG islands genome-wide. Combining predictions for multiple epigenetic features, we estimate the inherent CpG island strength for each CpG island in the human genome, i.e., its inherent tendency to exhibit an open and transcriptionally competent chromatin structure. We extensively validate our results on independent datasets, showing that the CpG island strength predictions are applicable and informative across different tissues and cell types, and we derive improved maps of predicted "bona fide" CpG islands. The mapping of CpG islands by epigenome prediction is conceptually superior to identifying CpG islands by widely used sequence criteria since it links CpG island detection to their characteristic epigenetic and functional states. And it is superior to purely experimental epigenome mapping for CpG island detection since it abstracts from specific properties that are limited to a single cell type or tissue. In addition, using computational epigenetics methods we could identify high correlation between the epigenome and characteristics of the DNA sequence, a finding which emphasizes the need for a better understanding of the mechanistic links between genome and epigenome.
Conflict of interest statement
Figures






Similar articles
-
Predicting methylation status of CpG islands in the human brain.Bioinformatics. 2006 Sep 15;22(18):2204-9. doi: 10.1093/bioinformatics/btl377. Epub 2006 Jul 12. Bioinformatics. 2006. PMID: 16837523
-
The mammalian epigenome.Cell. 2007 Feb 23;128(4):669-81. doi: 10.1016/j.cell.2007.01.033. Cell. 2007. PMID: 17320505 Review.
-
DNA motifs associated with aberrant CpG island methylation.Genomics. 2006 May;87(5):572-9. doi: 10.1016/j.ygeno.2005.12.016. Epub 2006 Feb 17. Genomics. 2006. PMID: 16487676
-
Gene-Ontology analysis reveals association of tissue-specific 5' CpG-island genes with development and embryogenesis.Hum Mol Genet. 2004 Sep 1;13(17):1969-78. doi: 10.1093/hmg/ddh207. Epub 2004 Jul 14. Hum Mol Genet. 2004. PMID: 15254011
-
Action at a distance: epigenetic silencing of large chromosomal regions in carcinogenesis.Hum Mol Genet. 2007 Apr 15;16 Spec No 1:R88-95. doi: 10.1093/hmg/ddm051. Hum Mol Genet. 2007. PMID: 17613553 Review.
Cited by
-
No Evidence for a Parent-of-Origin Specific Differentially Methylated Region Linked to RASGRF1.Front Genet. 2012 Mar 28;3:41. doi: 10.3389/fgene.2012.00041. eCollection 2012. Front Genet. 2012. PMID: 22470404 Free PMC article.
-
Integrated genetic and epigenetic analysis identifies three different subclasses of colon cancer.Proc Natl Acad Sci U S A. 2007 Nov 20;104(47):18654-9. doi: 10.1073/pnas.0704652104. Epub 2007 Nov 14. Proc Natl Acad Sci U S A. 2007. PMID: 18003927 Free PMC article.
-
Predicting DNA-binding specificities of eukaryotic transcription factors.PLoS One. 2010 Nov 30;5(11):e13876. doi: 10.1371/journal.pone.0013876. PLoS One. 2010. PMID: 21152420 Free PMC article.
-
Tight associations between transcription promoter type and epigenetic variation in histone positioning and modification.BMC Genomics. 2011 Aug 17;12:416. doi: 10.1186/1471-2164-12-416. BMC Genomics. 2011. PMID: 21846408 Free PMC article.
-
Orphan CpG islands define a novel class of highly active enhancers.Epigenetics. 2017 Jun 3;12(6):449-464. doi: 10.1080/15592294.2017.1297910. Epub 2017 Apr 27. Epigenetics. 2017. PMID: 28448736 Free PMC article.
References
-
- Bird A. DNA methylation patterns and epigenetic memory. Genes Dev. 2002;16:6–21. - PubMed
-
- Caiafa P, Zampieri M. DNA methylation and chromatin structure: The puzzling CpG islands. J Cell Biochem. 2005;94:257–265. - PubMed
-
- Bird AP. CpG-rich islands and the function of DNA methylation. Nature. 1986;321:209–213. - PubMed
-
- Laird PW. Cancer epigenetics. Hum Mol Genet. 2005;14:R65–R76. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases