Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025:2883:281-297.
doi: 10.1007/978-1-0716-4290-0_12.

LncRNA Recognition-Associated CpG Island Detection and Methylation Analysis

Affiliations

LncRNA Recognition-Associated CpG Island Detection and Methylation Analysis

Yujie You et al. Methods Mol Biol. 2025.

Abstract

CpG islands (CGIs) are rare, interspersed DNA sequences, which possess a significant deviation from background genomic distribution by exhibiting patterns of GC-rich and CpG-rich sequence, the density of which provides a good classification feature for long noncoding RNA (lncRNA) promoters. By reviewing previous CpG-related studies, we consider that the transcription regulation of about half of the human genes, mostly housekeeping (HK) genes, involves CGIs, their methylation states, CpG spacing, and other chromosomal parameters. However, the precise CGI definition and positioning of CGIs within gene structures, as well as specific CGI-associated regulatory mechanisms, all remain to be elucidated at individual gene and gene family levels, together with consideration of species and lineage specificity. Although previous studies have already classified CGIs into high-CpG (HCGI), intermediate-CpG (ICGI), and low-CpG (LCGI) densities based on CpG density variation, the correlation between CGI density and gene expression regulation, such as co-regulation of CGIs and TATA-box on HK genes, is not clear. Here, we introduce such a problem-solving protocol for human genome annotation, which is based on a combination of GTEx, JBLA, and GO analysis. Next, we discuss why CGI-associated genes are most likely regulated by HCGI and tend to be HK genes; The HCGI/TATA± and LCGI/TATA± combinations show different GO enrichment, whereas the ICGI/TATA± combination is less characteristic than LCGI/TATA± based on GO enrichment analysis.

Keywords: CpG Island; Genome analysis; Genome annotation; LncRNA; Statistical genetics.

PubMed Disclaimer

References

    1. Yotsukura S, duVerle D, Hancock T, Natsume-Kitatani Y, Mamitsuka H (2017) Computational recognition for long non-coding RNA (lncRNA): Software and databases. Brief Bioinform 18(1):9–27 - DOI - PubMed
    1. Barquist L, Burge SW, Gardner PP (2012) Building non-coding RNA families [J]. Biomolecules
    1. Deaton AM, Bird A (2011) CpG islands and the regulation of transcription. Genes Dev 25(10):1010–1022 - DOI - PubMed - PMC
    1. Straussman R, Nejman D, Roberts D, Steinfeld I, Blum B, Benvenisty N, Simon I, Yakhini Z, Cedar H (2009) Developmental programming of CpG island methylation profiles in the human genome. Nat Struct Mol Biol 16(5):564–571 - DOI - PubMed
    1. Takahashi Y, Wu J, Suzuki K, Martinez-Redondo P, Li M, Liao H-K, Wu M-Z, Hernández-Benítez R, Hishida T, Shokhirev MN (2017) Integration of CpG-free DNA induces de novo methylation of CpG islands in pluripotent stem cells. Science 356(6337):503–508 - DOI - PubMed - PMC

Substances

LinkOut - more resources