Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008:2008:897019.
doi: 10.1155/2008/897019. Epub 2008 Jun 30.

Automated quantitative assessment of proteins' biological function in protein knowledge bases

Affiliations

Automated quantitative assessment of proteins' biological function in protein knowledge bases

Gabriele Mayr et al. Adv Bioinformatics. 2008.

Abstract

Primary protein sequence data are archived in databases together with information regarding corresponding biological functions. In this respect, UniProt/Swiss-Prot is currently the most comprehensive collection and it is routinely cross-examined when trying to unravel the biological role of hypothetical proteins. Bioscientists frequently extract single entries and further evaluate those on a subjective basis. In lieu of a standardized procedure for scoring the existing knowledge regarding individual proteins, we here report about a computer-assisted method, which we applied to score the present knowledge about any given Swiss-Prot entry. Applying this quantitative score allows the comparison of proteins with respect to their sequence yet highlights the comprehension of functional data. pfs analysis may be also applied for quality control of individual entries or for database management in order to rank entry listings.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Computation of pfs. (A) pfs was calculated with regard to the validity of bibliographic citations as well as a description and comments section. In line 9, the FUNCTION and CATALYTIC ACTIVITY records were evaluated. In step 2, automated entries resulting from large-scale experimental approaches were detached from further analysis. (B) Section evaluation. In every section, every single sentence was evaluated independently. Lines 12–14 are required in case literature citations contain down-weighting phrases, yet conclusions regarding functional properties of a protein have eventually been made by mere resemblance at the primary sequence level (Swiss-Prot term: “BY SIMILARITY”).

Similar articles

References

    1. International Human Genome Sequencing Consortium. Finishing the euchromatic sequence of the human genome. Nature. 2004;431(7011):931–945. - PubMed
    1. Imanishi T, Itoh T, Suzuki Y, et al. Integrative annotation of 21,037 human genes validated by full-length cDNA clones. PLoS Biology. 2004;2(6):p. e162. - PMC - PubMed
    1. Apweiler R. Functional information in SWISS-PROT: the basis for large-scale characterisation of protein sequences. Briefings in Bioinformatics. 2001;2(1):9–18. - PubMed
    1. Camon E, Magrane M, Barrell D, et al. The Gene Ontology Annotation (GOA) database: sharing knowledge in Uniprot with Gene Ontology. Nucleic Acids Research. 2004;32, database issue:D262–D266. - PMC - PubMed
    1. Peri S, Navarro JD, Kristiansen TZ, et al. Human protein reference database as a discovery resource for proteomics. Nucleic Acids Research. 2004;32, database issue:D497–D501. - PMC - PubMed

LinkOut - more resources