Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2011 Sep;12(6):503-7.
doi: 10.2174/138920311796957667.

Small open reading frames: current prediction techniques and future prospect

Affiliations
Review

Small open reading frames: current prediction techniques and future prospect

Haoyu Cheng et al. Curr Protein Pept Sci. 2011 Sep.

Abstract

Evidence is accumulating that small open reading frames (sORF, <100 codons) play key roles in many important biological processes. Yet, they are generally ignored in gene annotation despite they are far more abundant than the genes with more than 100 codons. Here, we demonstrate that popular homolog search and codon-index techniques perform poorly for small genes relative to that for larger genes, while a method dedicated to sORF discovery has a similar level of accuracy as homology search. The result is largely due to the small dataset of experimentally verified sORF available for homology search and for training ab initio techniques. It highlights the urgent need for both experimental and computational studies in order to further advance the accuracy of sORF prediction.

PubMed Disclaimer

Figures

Fig. (1)
Fig. (1)
Significant drop in performance is observed for the popular homolog-search technique BLAST (in Red) and ab initio predictor codonW (in Blue). The receiver operating curves are shown for the best performed BLAST search in NR database for sORFs (excluding 50% or more homologous sequences) and for ORFs with 100–150 codons (excluding 70% or more homologous sequence) along with the performance of codonW for sORFs and ORFs with 100–150 codons, respectively.
Fig. (2)
Fig. (2)
As in Fig.1 but for sORF only with an additional method called sORF finder (in Black).

References

    1. Kastenmayer JP, Ni L, Chu A, Kitchen LE, Au WC, Yang H, Carter CD, Wheeler D, Davis RW, Boeke JD, Snyder MA, Basrai MA. Functional genomics of genes with small open reading frames (sORFs) in S. cerevisiae. Genome Res. 2006;16(3):365–373. - PMC - PubMed
    1. Galindo MI, Pueyo JI, Fouix S, Bishop SA, Couso JP. Peptides encoded by short ORFs control development and define a new eukaryotic gene family. PLoS Biol. 2007;5(5):e106. - PMC - PubMed
    1. Kondo T, Hashimoto Y, Kato K, Inagaki S, Hayashi S, Kageyama Y. Small peptide regulators of actin-based cell morphogenesis encoded by a polycistronic mRNA. Nat. Cell Biol. 2007;9(6):660–665. - PubMed
    1. Kondo T, Plaza S, Zanet J, Benrabah E, Valenti P, Hashimoto Y, Kobayashi S, Payre F, Kageyama Y. Small peptides switch the transcriptional activity of Shavenbaby during Drosophila embryogenesis. Science. 2010;329(5989):336–339. - PubMed
    1. Burkholder WF, Kurtser I, Grossman AD. Replication initiation proteins regulate a developmental checkpoint in Bacillus subtilis. Cell. 2001;104(2):269–279. - PubMed

Publication types

MeSH terms

LinkOut - more resources