Applications of recursive segmentation to the analysis of DNA sequences
- PMID: 12144178
- DOI: 10.1016/s0097-8485(02)00010-4
Applications of recursive segmentation to the analysis of DNA sequences
Abstract
Recursive segmentation is a procedure that partitions a DNA sequence into domains with a homogeneous composition of the four nucleotides A, C, G and T. This procedure can also be applied to any sequence converted from a DNA sequence, such as to a binary strong(G + C)/weak(A + T) sequence, to a binary sequence indicating the presence or absence of the dinucleotide CpG, or to a sequence indicating both the base and the codon position information. We apply various conversion schemes in order to address the following five DNA sequence analysis problems: isochore mapping, CpG island detection, locating the origin and terminus of replication in bacterial genomes, finding complex repeats in telomere sequences, and delineating coding and noncoding regions. We find that the recursive segmentation procedure can successfully detect isochore borders, CpG islands, and the origin and terminus of replication, but it needs improvement for detecting complex repeats as well as borders between coding and noncoding regions.
Similar articles
-
Characteristic enrichment of DNA repeats in different genomes.Proc Natl Acad Sci U S A. 1997 May 13;94(10):5237-42. doi: 10.1073/pnas.94.10.5237. Proc Natl Acad Sci U S A. 1997. PMID: 9144221 Free PMC article.
-
The over-representation of binary DNA tracts in seven sequenced chromosomes.BMC Genomics. 2004 Mar 3;5(1):19. doi: 10.1186/1471-2164-5-19. BMC Genomics. 2004. PMID: 15113401 Free PMC article.
-
Comprehensive analysis of CpG islands in human chromosomes 21 and 22.Proc Natl Acad Sci U S A. 2002 Mar 19;99(6):3740-5. doi: 10.1073/pnas.052410099. Epub 2002 Mar 12. Proc Natl Acad Sci U S A. 2002. PMID: 11891299 Free PMC article.
-
Methods for identification of epigenetic elements in mammalian long multigenic genome sequences.Biochemistry (Mosc). 2007 Jun;72(6):589-94. doi: 10.1134/s0006297907060016. Biochemistry (Mosc). 2007. PMID: 17630903 Review.
-
A review of computational algorithms for CpG islands detection.J Biosci. 2019 Dec;44(6):143. J Biosci. 2019. PMID: 31894124 Review.
Cited by
-
Interpreting genomic data via entropic dissection.Nucleic Acids Res. 2013 Jan 7;41(1):e23. doi: 10.1093/nar/gks917. Epub 2012 Oct 3. Nucleic Acids Res. 2013. PMID: 23036836 Free PMC article.
-
CpG island mapping by epigenome prediction.PLoS Comput Biol. 2007 Jun;3(6):e110. doi: 10.1371/journal.pcbi.0030110. Epub 2007 May 2. PLoS Comput Biol. 2007. PMID: 17559301 Free PMC article.
-
Detection of genomic islands via segmental genome heterogeneity.Nucleic Acids Res. 2009 Sep;37(16):5255-66. doi: 10.1093/nar/gkp576. Epub 2009 Jul 9. Nucleic Acids Res. 2009. PMID: 19589805 Free PMC article.
-
Calling differentially methylated regions from whole genome bisulphite sequencing with DMRcate.Nucleic Acids Res. 2021 Nov 8;49(19):e109. doi: 10.1093/nar/gkab637. Nucleic Acids Res. 2021. PMID: 34320181 Free PMC article.
-
Characterisation of inactivation domains and evolutionary strata in human X chromosome through Markov segmentation.PLoS One. 2009 Nov 25;4(11):e7885. doi: 10.1371/journal.pone.0007885. PLoS One. 2009. PMID: 19946363 Free PMC article.
Publication types
MeSH terms
Associated data
- Actions
- Actions
- Actions
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases