Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 1999;273(1-2):1-18.
doi: 10.1016/s0378-4371(99)00407-0.

Scaling features of noncoding DNA

Collaborators, Affiliations
Comparative Study

Scaling features of noncoding DNA

H E Stanley et al. Physica A. 1999.

Abstract

We review evidence supporting the idea that the DNA sequence in genes containing noncoding regions is correlated, and that the correlation is remarkably long range--indeed, base pairs thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene, and utilize this fact to build a Coding Sequence Finder Algorithm, which uses statistical ideas to locate the coding regions of an unknown DNA sequence. Finally, we describe briefly some recent work adapting to DNA the Zipf approach to analyzing linguistic texts, and the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function, and reporting that noncoding regions in eukaryotes display a larger redundancy than coding regions. Specifically, we consider the possibility that this result is solely a consequence of nucleotide concentration differences as first noted by Bonhoeffer and his collaborators. We find that cytosine-guanine (CG) concentration does have a strong "background" effect on redundancy. However, we find that for the purine-pyrimidine binary mapping rule, which is not affected by the difference in CG concentration, the Shannon redundancy for the set of analyzed sequences is larger for noncoding regions compared to coding regions.

PubMed Disclaimer

Similar articles

  • Statistical and linguistic features of DNA sequences.
    Havlin S, Buldyrev SV, Goldberger AL, Mantegna RN, Peng CK, Simons M, Stanley HE. Havlin S, et al. Fractals. 1995 Jun;3(2):269-84. doi: 10.1142/s0218348x95000229. Fractals. 1995. PMID: 11539281
  • Statistical properties of DNA sequences.
    Peng CK, Buldyrev SV, Goldberger AL, Havlin S, Mantegna RN, Simons M, Stanley HE. Peng CK, et al. Physica A. 1995;221:180-92. doi: 10.1016/0378-4371(95)00247-5. Physica A. 1995. PMID: 11540495
  • Linguistic features of noncoding DNA sequences.
    Mantegna RN, Buldyrev SV, Goldberger AL, Havlin S, Peng CK, Simons M, Stanley HE. Mantegna RN, et al. Phys Rev Lett. 1994 Dec 5;73(23):3169-72. doi: 10.1103/PhysRevLett.73.3169. Phys Rev Lett. 1994. PMID: 10057305
  • Scaling in nature: from DNA through heartbeats to weather.
    Havlin S, Buldyrev SV, Bunde A, Goldberger AL, Ivanov PCh, Peng CK, Stanley HE. Havlin S, et al. Physica A. 1999 Nov 1;273(1-2):46-69. doi: 10.1016/s0378-4371(99)00340-4. Physica A. 1999. PMID: 11543356 Review.
  • Fractals in biology and medicine.
    Havlin S, Buldyrev SV, Goldberger AL, Mantegna RN, Ossadnik SM, Peng CK, Simons M, Stanley HE. Havlin S, et al. Chaos Solitons Fractals. 1995;6:171-201. doi: 10.1016/0960-0779(95)80025-c. Chaos Solitons Fractals. 1995. PMID: 11539852 Review.

Cited by

Publication types

LinkOut - more resources