Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1996 Sep 15;12(11):1163-78.
doi: 10.1002/(SICI)1097-0061(19960915)12:11%3C1163::AID-YEA6%3E3.0.CO;2-7.

A computer filtering method to drive out tiny genes from the yeast genome

Affiliations

A computer filtering method to drive out tiny genes from the yeast genome

C Barry et al. Yeast. .

Abstract

The authors of the first yeast chromosome sequence defined a minimum threshold requirement of 100 codons, above which an open reading frame (ORF) is retained as a putative coding sequence. However, at least 58 yeast genes shorter than 100 codons have an assigned protein function. Therefore, the yeast genome may contain other tiny but functionally important genes that are discarded from analyses by this simple filtering rule. We have established discriminant functions from the in-phase hexamer frequencies of functional genes and of simulated ORFs derived from a stationary Markov chain model. Fifty-two out of the 58 genes were recognized as coding ORFs by our discriminating method. The test was also applied to all the small ORFs (36 to 100 codons) found in the intergenic regions of published chromosomes. It retained 140 new potential tiny coding sequences, among which we identified seven new genes by similarity searches. Our method, used conjointly with similarity searches, can also highlight sequencing errors resulting from the disruption of the coding frame of longer ORFs. This method, by its ability to detect potential coding ORFs, can be a very useful tool for functional analysis.

PubMed Disclaimer

Publication types

Associated data

LinkOut - more resources