Biological origins of long-range correlations and compositional variations in DNA
- PMID: 8255772
- PMCID: PMC310632
- DOI: 10.1093/nar/21.22.5167
Biological origins of long-range correlations and compositional variations in DNA
Abstract
The occurrence of certain long-range correlations between nucleotides in DNA sequences of living organisms has recently been reported. The biological origin of these correlations was unknown. The correlations were proposed to be concerned with fractal structure and differences between intron-containing and intron-less sequences. We and others have reported that no consistent difference exists between intron-containing and intron-less sequences. In agreement with this, we demonstrate here that the long-range correlations are trivially equivalent to the varying ratio R between pyrimidines and purines (or any other nucleotide combinations) in different regions of a DNA sequence. Moreover, we show that this variation of R has simple biological explanations: Differences in base composition occur along most DNA sequences and are associated with (i) simple repeats (ii) differences in codon composition (due to the amino acid composition in the encoded protein), (iii) change of the direction of transcription (and thus also translation), and (iv) differences between protein- and rRNA-encoding segments. Seven biological examples are given.
Similar articles
-
Variations in base pair composition and associated long-range correlations in DNA sequences--computer simulation results.Biochim Biophys Acta. 1994 Mar 1;1217(2):181-7. doi: 10.1016/0167-4781(94)90032-9. Biochim Biophys Acta. 1994. PMID: 8110832
-
A quantitative test of long-range correlations and compositional fluctuations in DNA sequences.Eur J Biochem. 1994 Sep 1;224(2):365-71. doi: 10.1111/j.1432-1033.1994.00365.x. Eur J Biochem. 1994. PMID: 7925349
-
Similarities inferred from the studies of long range correlations among mitochondrial DNA sequences.Indian J Biochem Biophys. 1997 Jun;34(3):259-65. Indian J Biochem Biophys. 1997. PMID: 9425745
-
Fractal landscapes in biological systems: long-range correlations in DNA and interbeat heart intervals.Physica A. 1992 Dec 15;191(1-4):1-12. doi: 10.1016/0378-4371(92)90497-e. Physica A. 1992. PMID: 11537103
-
Encoding of non-biological information for its long-term storage in DNA.Biosystems. 2022 Jun;215-216:104664. doi: 10.1016/j.biosystems.2022.104664. Epub 2022 Mar 14. Biosystems. 2022. PMID: 35301090 Review.
Cited by
-
Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats.BMC Bioinformatics. 2008 Nov 3;9:466. doi: 10.1186/1471-2105-9-466. BMC Bioinformatics. 2008. PMID: 18980673 Free PMC article.
-
Compositional Structure of the Genome: A Review.Biology (Basel). 2023 Jun 13;12(6):849. doi: 10.3390/biology12060849. Biology (Basel). 2023. PMID: 37372134 Free PMC article. Review.
-
Wavelet Analysis of DNA Bending Profiles reveals Structural Constraints on the Evolution of Genomic Sequences.J Biol Phys. 2004 Mar;30(1):33-81. doi: 10.1023/B:JOBP.0000016438.86794.8e. J Biol Phys. 2004. PMID: 23345861 Free PMC article.
-
Simple sequence repeats in prokaryotic genomes.Proc Natl Acad Sci U S A. 2007 May 15;104(20):8472-7. doi: 10.1073/pnas.0702412104. Epub 2007 May 7. Proc Natl Acad Sci U S A. 2007. PMID: 17485665 Free PMC article.
-
Lack of biological significance in the 'linguistic features' of noncoding DNA--a quantitative analysis.Nucleic Acids Res. 1996 May 1;24(9):1676-81. doi: 10.1093/nar/24.9.1676. Nucleic Acids Res. 1996. PMID: 8649985 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources