Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013;9(1):e1002866.
doi: 10.1371/journal.pcbi.1002866. Epub 2013 Jan 31.

Ribosome traffic on mRNAs maps to gene ontology: genome-wide quantification of translation initiation rates and polysome size regulation

Affiliations

Ribosome traffic on mRNAs maps to gene ontology: genome-wide quantification of translation initiation rates and polysome size regulation

Luca Ciandrini et al. PLoS Comput Biol. 2013.

Abstract

To understand the complex relationship governing transcript abundance and the level of the encoded protein, we integrate genome-wide experimental data of ribosomal density on mRNAs with a novel stochastic model describing ribosome traffic dynamics during translation elongation. This analysis reveals that codon arrangement, rather than simply codon bias, has a key role in determining translational efficiency. It also reveals that translation output is governed both by initiation efficiency and elongation dynamics. By integrating genome-wide experimental data sets with simulation of ribosome traffic on all Saccharomyces cerevisiae ORFs, mRNA-specific translation initiation rates are for the first time estimated across the entire transcriptome. Our analysis identifies different classes of mRNAs characterised by their initiation rates, their ribosome traffic dynamics, and by their response to ribosome availability. Strikingly, this classification based on translational dynamics maps onto key gene ontological classifications, revealing evolutionary optimisation of translation responses to be strongly influenced by gene function.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. The model.
Particles representing ribosomes move along a unidimensional lattice (the mRNA chain) in which each site represents a codon. For the sake of illustration, in the sketch a particle covers 3 codons, while in the model we considered particles occupying 9 codons . (A) Schematic representation of ribosome dynamics: along the mRNA, ribosomes with the A site on codon formula image capture the cognate tRNA with a rate formula image, then keep it and advance with a rate formula image, provided that the following codon is empty. (B) The entire translation process can be viewed as particles moving on a lattice. Ribosomes attempt to initiate the translation with a rate formula image. Then they move according to the dynamical rules introduced above and at the end of the lattice the ribosomes detach with a termination rate formula image. Particles can queue if the bottlenecks in the lattice cannot support the incoming flow.
Figure 2
Figure 2. Distribution of the estimated initiation rates in S. cerevisiae.
The mean initiation rate is formula image and the median is formula image. Most of the mRNAs have an estimated initiation rate formula image and therefore we show only this range.
Figure 3
Figure 3. Scatter plot of the ORF lengths against the estimated 's.
The log-log scatter plot shows possible signatures of a power-law dependence.
Figure 4
Figure 4. Outcomes for some mRNAs obtained by stochastic simulations of the model.
Panels (A) and (C) show a sketch of the two different behaviours one can obtain for the density of ribosomes formula image and the current formula image, respectively. The genes are divided in two categories, according to the shape of formula image, as shown in (A): abrupt mRNAs (red, colour online) present a steep increase of the polysome size with increasing the initiation rate. On the other and, smooth sequences (blue, colour online) do not show this feature. The current (C) is also affected, with abrupt genes exhibiting a sudden change, or ‘kink’ in the current, while the current of smooth mRNAs does not suddenly saturate. Panels (B) and (D) show the outcome of numerical simulations of real sequences from S. cerevisiae. Genes YGL103W and YBL027W are ribosomal proteins while YHR030C and YBL105C are kinase regulatory proteins. formula image indicates the saturation value of the current (see text).
Figure 5
Figure 5. Scatter plot of the estimated initiation rates versus the slope of the protein production rate evaluated at .
Both distributions of initiation rates and slopes have been subdivided in quartiles (dashed lines), defining 16 regions. Boxed annotations indicate those GO categories that are overrepresented in each quartile sector (P; GO process, F; function, C; component) with the P-value indicated as a power 10 exponent (E). The enrichments in each region are indicated in the square brackets as xx/yy, where ‘xx’ is the number of genes with that specific annotation and ‘yy’ the total number of genes in the region.
Figure 6
Figure 6. Normalised histogram of the simulated protein production rates of the YPL106C randomised ensemble.
We constructed the randomised ensemble by shuffling the YPL106C codon choice at each sequence position, generating 2,000 different variants, each time keeping the amino acid sequence and overall codon composition constant. For example, for the chosen gene, CAI = 0.521. The value of the chosen initiation is formula image.
Figure 7
Figure 7. Scatter plots of different estimators of protein production rates.
(A) formula image versus abundance of proteins. The mRNA abundances are from and the experimentally measured protein levels from . The plot shows a clear correlation between the model prediction of the amount of proteins in the cell and the experimental values. (B) CAI from versus protein abundance. (C) and (D) show different variants of the tRNA adaptation index, tAIc and tAIp from , vs protein abundance. Our approach yields a better correlation between the predicted and measured protein abundance.
Figure 8
Figure 8. Initiation rate: summary of the findings.
(A) For a given ‘physiological’ number of ribosomes formula image we found mRNA-specific initiation rates, distributed over a broad range of values (Figure 2). Different regions of the distributions can be mapped to certain GO annotations. For example, mRNAs with small physiological initiation rate formula image are regulatory proteins while genes involved in translation have a larger initiation rate. (B) Changes in initiation (induced, for instance, by variations in the ribosomal pool, e.g. available ribosomes increase to a value of formula image) are estimated by our modelling and theoretically perceived by the transcript in different ways, according to their current-initiation relationship formula image. In particular, some mRNAs have a large gearing factor formula image, such as regulatory proteins, while other messengers, such as translation associated ones, are less sensitive to changes of the initiation rate. (C) For very large initiation rates the protein production rates reach a maximal elongation-limited value, i.e. only depending on the sequence of codons. We discover that translation associated genes have a larger maximal production rate when compared to other mRNAs, such as regulatory proteins, whose production might need to be capped. (D) In general we find two main groups of sequences classified according to their current-initiation relationship formula image. Abrupt sequences, usually regulatory proteins, present an abrupt ‘kink’ in formula image, meaning that the protein production rate can quickly saturate above specific values (sequence-dependent) of the initiation rate. Genes involved in translation like ribosomal proteins are instead classified as smooth sequences, since their sequences are such that this abrupt crossover does not exist.

Similar articles

Cited by

References

    1. Spellman PT, Sherlock G, Zhang MQ, Iyer VR, Anders K, et al. (1998) Comprehensive identification of cell cycle-regulated genes of the yeast saccharomyces cerevisiae by microarray hybridization. Molecular Biology of the Cell 9: 3273–3297. - PMC - PubMed
    1. Shalon D, Smith SJ, Brown PO (1996) A DNA microarray system for analyzing complex DNA samples using two-color uorescent probe hybridization. Genome Research 6: 639–645. - PubMed
    1. Ideker T, Thorsson V, Ranish JA, Christmas R, Buhler J, et al. (2001) Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. Science 292: 929–934. - PubMed
    1. Schwanhausser B, Busse D, Li N, Dittmar G, Schuchhardt J, et al. (2011) Global quantification of mammalian gene expression control. Nature 473: 337–342. - PubMed
    1. Ghazalpour A, Bennett B, Petyuk VA, Orozco L, Hagopian R, et al. (2011) Comparative analysis of proteome and transcriptome variation in mouse. PLoS Genet 7: e1001393. - PMC - PubMed

Publication types

LinkOut - more resources