Assessment of protein coding measures

J W Fickett¹, C S Tung

Affiliations

PMID: 1480466
PMCID: PMC334555
DOI: 10.1093/nar/20.24.6441

Free PMC article

Review

Assessment of protein coding measures

J W Fickett et al. Nucleic Acids Res. 1992.

Free PMC article

. 1992 Dec 25;20(24):6441-50.

doi: 10.1093/nar/20.24.6441.

Authors

J W Fickett¹, C S Tung

Affiliation

¹ Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, NM 87545.

PMID: 1480466
PMCID: PMC334555
DOI: 10.1093/nar/20.24.6441

Abstract

A number of methods for recognizing protein coding genes in DNA sequence have been published over the last 13 years, and new, more comprehensive algorithms, drawing on the repertoire of existing techniques, continue to be developed. To optimize continued development, it is valuable to systematically review and evaluate published techniques. At the core of most gene recognition algorithms is one or more coding measures--functions which produce, given any sample window of sequence, a number or vector intended to measure the degree to which a sample sequence resembles a window of 'typical' exonic DNA. In this paper we review and synthesize the underlying coding measures from published algorithms. A standardized benchmark is described, and each of the measures is evaluated according to this benchmark. Our main conclusion is that a very simple and obvious measure--counting oligomers--is more effective than any of the more sophisticated measures. Different measures contain different information. However there is a great deal of redundancy in the current suite of measures. We show that in future development of gene recognition algorithms, attention can probably be limited to six of the twenty or so measures proposed to date.

PubMed Disclaimer

References

1. Proteins. 1988;4(2):99-122 - PubMed
1. DNA. 1987 Oct;6(5):493-5 - PubMed
1. Nucleic Acids Res. 1986 Jan 10;14(1):127-35 - PubMed
1. Nucleic Acids Res. 1985 Jan 11;13(1):185-94 - PubMed
1. Gene. 1984 Oct;30(1-3):157-66 - PubMed

Publication types

Actions
Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions

Grants and funding

GM-37812/GM/NIGMS NIH HHS/United States

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Assessment of protein coding measures

Affiliation

Assessment of protein coding measures

Authors

Affiliation

Abstract

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Medical