Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Jul 1;33(Web Server issue):W677-80.
doi: 10.1093/nar/gki394.

OrfPredictor: predicting protein-coding regions in EST-derived sequences

Affiliations

OrfPredictor: predicting protein-coding regions in EST-derived sequences

Xiang Jia Min et al. Nucleic Acids Res. .

Abstract

OrfPredictor is a web server designed for identifying protein-coding regions in expressed sequence tag (EST)-derived sequences. For query sequences with a hit in BLASTX, the program predicts the coding regions based on the translation reading frames identified in BLASTX alignments, otherwise, it predicts the most probable coding region based on the intrinsic signals of the query sequences. The output is the predicted peptide sequences in the FASTA format, and a definition line that includes the query ID, the translation reading frame and the nucleotide positions where the coding region begins and ends. OrfPredictor facilitates the annotation of EST-derived sequences, particularly, for large-scale EST projects. OrfPredictor is available at https://fungalgenome.concordia.ca/tools/OrfPredictor.html.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Categories of information derived from the EST sequences. (A) A typical full-length cDNA sequence including one or more stop codons in the 5′-UTR, a start codon and a stop codon. The coding region may contain multiple ATG codons encoding methionine and the 3′-UTR may harbor additional stop codons. (B) A full-length cDNA without a stop codon in the 5′-UTR. (C) A sequence containing a 5′-UTR with a stop codon and a portion of the coding region. (D) A sequence containing a 5′-UTR with a stop codon. (E) A sequence containing a 5′-UTR without a 5′ stop codon, and a portion of the coding region. (F) A sequence containing a portion of 5′-UTR without a 5′ stop codon. (G) A sequence containing the internal portion of a coding region with or without internal ATG codons. (H) A sequence containing a portion of the coding region with an internal ATG codon, a 3′ stop codon and 3′-UTR. (I) A sequence containing a portion of the coding region with no internal ATG codons, a 3′ stop codon and a 3′-UTR. (J) A sequence containing a 3′-UTR without a 3′ stop codon. Red star: stop codon at 5′ end; green circle: start codon; blue circle: internal ATG codon; red hexagon: stop codon; solid line: sequenced portion of the full-length cDNA; and dashed line: unsequenced or truncated portion of the full-length cDNA.
Figure 2
Figure 2
The OrfPredictor server interface for loading data and choosing other parameters.

Similar articles

Cited by

References

    1. Adams M.D., Kelley J.M., Gocayne J.D., Dubnick M., Polymeropoulos M.H., Xiao H., Merril C.R., Wu A., Olde B., Moreno R.F., et al. Complementary DNA sequencing: expressed sequence tags and human genome project. Science. 1991;252:1651–1656. - PubMed
    1. Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. - PubMed
    1. Gish W., State D.J. Identification of protein coding regions by database similarity search. Nature Genet. 1993;3:266–272. - PubMed
    1. Mignone F., Gissi C., Liuni S., Pesole G. Untranslated regions of mRNAs. Genome Biol. 2002;3 reviews 0004. - PMC - PubMed
    1. Zdobnov E.M., Apweiler R. InterProScan—an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001;17:847–848. - PubMed

Publication types