Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Oct;124(1):e67.
doi: 10.1002/cpmb.67. Epub 2018 Sep 4.

RibORF: Identifying Genome-Wide Translated Open Reading Frames Using Ribosome Profiling

Affiliations

RibORF: Identifying Genome-Wide Translated Open Reading Frames Using Ribosome Profiling

Zhe Ji. Curr Protoc Mol Biol. 2018 Oct.

Abstract

Ribosome profiling identifies RNA fragments associated with translating ribosomes. The technology provides an opportunity to examine genome-wide translation events at single-nucleotide resolution and in an unbiased manner. Here I present a computational pipeline named RibORF to systematically identify translated open reading frames (ORFs), based on read distribution features representing active translation, including 3-nt periodicity and uniformness across codons. Analyses using the computational tool revealed pervasive translation in putative 'noncoding' regions, such as lncRNAs, pseudogenes, and 5'UTRs. The computational tool is useful for studying functional roles of non-canonical translation events in various biological processes. © 2018 by John Wiley & Sons, Inc.

Keywords: noncoding RNA; open reading frame; ribosome profiling; translation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
For candidate ORFs with ribosome profiling reads, the read distribution features can separate them into three groups, including actively translated ORFs, off-frame ORFs, and non-ribosomal protein-RNA complex binding.
Figure 2
Figure 2
The outline of the RibORF algorithm.
Figure 3
Figure 3
Types of candidate ORFs, defined based on transcript types and ORF locations in reference transcripts.
Figure 4
Figure 4
Check read distribution around the start and stop codons of canonical ORFs, and correct read locations based on offset distances between 5′ end of fragments and ribosomal A-sites. (A) Ribosome profiling reads were grouped based on fragment length (i.e. 28nt, 29nt and 30nt). The plots show the distribution of 5′ end of read fragments around the start and stop codons of canonical ORFs of mRNAs. The dataset shows high quality and clear 3-nt periodicity. The summary statistics of read fragments were shown in the box. (B) An example of low quality ribosome profiling dataset, which does not show obvious 3-nt periodicity. (C) Distribution of offset corrected read distribution around the start and stop codons of canonical ORFs.
Figure 5
Figure 5
Example outputs of RibORF program and algorithm performance evaluation. (A) Example candidate ORFs with training parameters and predicted translated P-values. These values were shown in the files “pred.pvalue.parameters.txt” and “repre.valid.pred.pvalue.parameters.txt”. (B) The predicted translated P-value cutoffs and associated statistics of true positive, false positive, true negative, false negative, false positive rate and true positive rate, as shown in the “stat.cutoff.txt” file. (C) The ROC curve showing the performance of the RibORF program in identifying translated ORFs. The plot was shown in the “plot.ROC.curve.pdf”. The AUC value was shown in the plot.

Similar articles

Cited by

References

LITERATURE CITED

    1. Anderson DM, Anderson KM, Chang CL, Makarewich CA, Nelson BR, McAnally JR, Kasaragod P, Shelton JM, Liou J, Bassel-Duby R, et al. A micropeptide encoded by a putative long noncoding RNA regulates muscle performance. Cell. 2015;160:595–606. - PMC - PubMed
    1. Barbosa C, Peixeiro I, Romao L. Gene expression regulation by upstream open reading frames and human disease. PLoS genetics. 2013;9:e1003529. - PMC - PubMed
    1. Calviello L, Mukherjee N, Wyler E, Zauber H, Hirsekorn A, Selbach M, Landthaler M, Obermayer B, Ohler U. Detecting actively translated open reading frames in ribosome profiling data. Nature methods. 2016;13:165–170. - PubMed
    1. Chun SY, Rodriguez CM, Todd PK, Mills RE. SPECtre: a spectral coherence--based classifier of actively translated transcripts from ribosome profiling sequence data. BMC Bioinformatics. 2016;17:482. - PMC - PubMed
    1. Clamp M, Fry B, Kamal M, Xie X, Cuff J, Lin MF, Kellis M, Lindblad-Toh K, Lander ES. Distinguishing protein-coding and noncoding genes in the human genome. Proc Natl Acad Sci U S A. 2007;104:19428–19433. - PMC - PubMed

KEY REFERENCE

    1. RibORF program was originally published in: Ji Z, Song R, Regev A, Struhl K. Many lncRNAs, 5′UTRs, and pseudogenes are translated and some are likely to express functional proteins. Elife. 2015;4:e08890.

    1. The Following paper can help to understand ribosome profiling technology: Ji Z, Song R, Huang H, Regev A, Struhl K. Transcriptome-scale RNase-footprinting of RNA-protein complexes. Nat Biotechnol. 2016;34:410–413.

Publication types

MeSH terms

Substances

LinkOut - more resources