Base compositional structure of genomes
- PMID: 1505943
- DOI: 10.1016/0888-7543(92)90019-o
Base compositional structure of genomes
Abstract
We model the base compositional structure of the human and Escherichia coli genomes. Three particular properties are first quantified: (1) There is a significant tendency for any region of either genome to have a strand-symmetric base composition. (2) The variation in base composition from region to region, within each genome, is very much larger than expected from common homogeneous stochastic models. (3) A given local base composition tends to persist over a scale of at least kilobases (E. coli) or tens of kilobases (human). Multidomain stochastic models from the literature are reviewed and sharpened. In particular, quantitative measurements of the third property lead us to suggest a significant shift in the style of domain models, in which the variation of A+T content with position is modeled by a random walk with frequent small steps rather than with large quantum jumps. As an application, we suggest a way to reduce the amount of computation in the assembly of large sequences from sequences of randomly chosen fragments.
Similar articles
-
Statistical scales of order in DNA.Biophys Chem. 2009 May;141(2-3):203-13. doi: 10.1016/j.bpc.2009.02.003. Epub 2009 Feb 20. Biophys Chem. 2009. PMID: 19254822
-
Understanding the differences between genome sequences of Escherichia coli B strains REL606 and BL21(DE3) and comparison of the E. coli B and K-12 genomes.J Mol Biol. 2009 Dec 11;394(4):653-80. doi: 10.1016/j.jmb.2009.09.021. Epub 2009 Sep 15. J Mol Biol. 2009. PMID: 19765592
-
Limitations of compositional approach to identifying horizontally transferred genes.J Mol Evol. 2001 Sep;53(3):244-50. doi: 10.1007/s002390010214. J Mol Evol. 2001. PMID: 11523011
-
Are Escherichia coli Pathotypes Still Relevant in the Era of Whole-Genome Sequencing?Front Cell Infect Microbiol. 2016 Nov 18;6:141. doi: 10.3389/fcimb.2016.00141. eCollection 2016. Front Cell Infect Microbiol. 2016. PMID: 27917373 Free PMC article. Review.
-
The vertebrate genome: isochores and evolution.Mol Biol Evol. 1993 Jan;10(1):186-204. doi: 10.1093/oxfordjournals.molbev.a039994. Mol Biol Evol. 1993. PMID: 8450755 Review.
Cited by
-
Exact distribution of a pattern in a set of random sequences generated by a Markov source: applications to biological data.Algorithms Mol Biol. 2010 Jan 26;5:15. doi: 10.1186/1748-7188-5-15. Algorithms Mol Biol. 2010. PMID: 20205909 Free PMC article.
-
Copy-number-variation and copy-number-alteration region detection by cumulative plots.BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S67. doi: 10.1186/1471-2105-10-S1-S67. BMC Bioinformatics. 2009. PMID: 19208171 Free PMC article.
-
Simple sequence repeats in prokaryotic genomes.Proc Natl Acad Sci U S A. 2007 May 15;104(20):8472-7. doi: 10.1073/pnas.0702412104. Epub 2007 May 7. Proc Natl Acad Sci U S A. 2007. PMID: 17485665 Free PMC article.
-
Macronuclear genome structure of the ciliate Nyctotherus ovalis: single-gene chromosomes and tiny introns.BMC Genomics. 2008 Dec 5;9:587. doi: 10.1186/1471-2164-9-587. BMC Genomics. 2008. PMID: 19061489 Free PMC article.
-
Global features of sequences of bacterial chromosomes, plasmids and phages revealed by analysis of oligonucleotide usage patterns.BMC Bioinformatics. 2004 Jul 7;5:90. doi: 10.1186/1471-2105-5-90. BMC Bioinformatics. 2004. PMID: 15239845 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Research Materials