Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1999 Nov 23;96(24):14055-60.
doi: 10.1073/pnas.96.24.14055.

In silico detection of control signals: mRNA 3'-end-processing sequences in diverse species

Affiliations

In silico detection of control signals: mRNA 3'-end-processing sequences in diverse species

J H Graber et al. Proc Natl Acad Sci U S A. .

Abstract

We have investigated mRNA 3'-end-processing signals in each of six eukaryotic species (yeast, rice, arabidopsis, fruitfly, mouse, and human) through the analysis of more than 20,000 3'-expressed sequence tags. The use and conservation of the canonical AAUAAA element vary widely among the six species and are especially weak in plants and yeast. Even in the animal species, the AAUAAA signal does not appear to be as universal as indicated by previous studies. The abundance of single-base variants of AAUAAA correlates with their measured processing efficiencies. As found previously, the plant polyadenylation signals are more similar to those of yeast than to those of animals, with both common content and arrangement of the signal elements. In all species examined, the complete polyadenylation signal appears to consist of an aggregate of multiple elements. In light of these and previous results, we present a broadened concept of 3'-end-processing signals in which no single exact sequence element is universally required for processing. Rather, the total efficiency is a function of all elements and, importantly, an inefficient word in one element can be compensated for by strong words in other elements. These complex patterns indicate that effective tools to identify 3'-end-processing signals will require more than consensus sequence identification.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Single-nucleotide frequencies preceding the 3′-end-processing sites as determined by alignment of 3′-ESTs generated from yeast (1,352), rice (1,246), arabidopsis (4,069), fruitfly (3,236), mouse (6,029), and human (4,427) cDNAs. Positions are given relative to the putative 3′-end-processing site. (A) Sequences aligned on the 3′-most end of the ESTs. (B) Sequences aligned to a 6-nt profile, via an iterative procedure as described in Methods.
Figure 2
Figure 2
Single-nucleotide frequencies that describe the two principal signal elements of the fruitfly 3′-end-processing signal, as determined by analysis of 882 ESTs that were successfully aligned to contig sequences. By using the iterative profiling procedure defined in Methods, the sequences were successively aligned on (A) the PE (approximate position −20) and (B) DE (approximate position +14).
Figure 3
Figure 3
A comparison of the occurrences (in 327 contig-aligned fruitfly EST sequences that did not contain AAUAAA) with in vitro measured [in human HeLa cells (21)] processing efficiencies of sequences similar to the canonical PE AAUAAA. The processing efficiencies are plotted as a percentage of the efficiency of AAUAAA, whereas the occurrences are plotted as a fraction of the total number of sequences examined.
Figure 4
Figure 4
Graphical representations of the proposed alignment of 3′-end-processing signals for (A) yeast, (B) plants, and (C) animals, including the most common words found for each subelement.

References

    1. Trifonov E N. Comput Appl Biosci. 1996;12:423–429. - PubMed
    1. Graber J H, Cantor C R, Mohr S C, Smith T F. Nucleic Acids Res. 1999;27:888–894. - PMC - PubMed
    1. Zhao J, Kessler M M, Moore C L. J Biol Chem. 1997;272:10831–10838. - PubMed
    1. Stumpf G, Domdey H. Science. 1996;274:1517–1520. - PubMed
    1. Preker P J, Ohnacker M, Minvielle-Sebastia L, Keller W. EMBO J. 1997;16:4727–4737. - PMC - PubMed

Publication types