Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1988 Dec;85(24):9630-4.
doi: 10.1073/pnas.85.24.9630.

Universal rule for coding sequence construction: TA/CG deficiency-TG/CT excess

Affiliations

Universal rule for coding sequence construction: TA/CG deficiency-TG/CT excess

S Ohno. Proc Natl Acad Sci U S A. 1988 Dec.

Abstract

Each coding sequence is a finite resource as to the number and composition of four bases. Accordingly, the excessive recurrence of one base oligomer entails the noticeable underrepresentation by the other, so that if the former is the same in most, if not all, of the coding sequences, the latter too must necessarily be the same in all. Indeed, a previous series of studies on 20-odd divergent coding sequences established CTG as one of the most frequently recurring base trimers (if not the most frequent), and this excess was compensated by the underrepresentation by CG and TA dimer-containing base trimers. In this study, I have analyzed three additional coding sequences and reanalyzed one previously studied coding sequence. These four, derived from man, a plant, and a fish, were of variously lopsided base compositions that were not at all conducive to high recurrences of either CT dimer or CT and TG. Yet, the excess of CT and TG dimers accompanied by complementary deficiency of CG and TA dimers emerged as the common rule. Thus, I propose the above as the universal rule of coding sequence construction. The underrepresentation by CG and TA dimers within coding sequences explains why regulatory signals in intergenic spacers are of two kinds: one, TA dimer rich; and the other, CG dimer rich.

PubMed Disclaimer

References

    1. Cytogenet Cell Genet. 1975;14(1):9-25 - PubMed
    1. Proc Natl Acad Sci U S A. 1980 Aug;77(8):4895-8 - PubMed
    1. Nature. 1983 Jan 6;301(5895):19-20 - PubMed
    1. Proc Natl Acad Sci U S A. 1983 Jan;80(2):472-6 - PubMed
    1. EMBO J. 1982;1(12):1635-40 - PubMed

LinkOut - more resources