Quality of computationally inferred gene ontology annotations
- PMID: 22693439
- PMCID: PMC3364937
- DOI: 10.1371/journal.pcbi.1002533
Quality of computationally inferred gene ontology annotations
Abstract
Gene Ontology (GO) has established itself as the undisputed standard for protein function annotation. Most annotations are inferred electronically, i.e. without individual curator supervision, but they are widely considered unreliable. At the same time, we crucially depend on those automated annotations, as most newly sequenced genomes are non-model organisms. Here, we introduce a methodology to systematically and quantitatively evaluate electronic annotations. By exploiting changes in successive releases of the UniProt Gene Ontology Annotation database, we assessed the quality of electronic annotations in terms of specificity, reliability, and coverage. Overall, we not only found that electronic annotations have significantly improved in recent years, but also that their reliability now rivals that of annotations inferred by curators when they use evidence other than experiments from primary literature. This work provides the means to identify the subset of electronic annotations that can be relied upon-an important outcome given that >98% of all annotations are inferred without direct curation.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures








References
-
- Dolan ME, Ni L, Camon E, Blake JA. A procedure for assessing GO annotation consistency. Bioinformatics. 2005;21(Suppl 1):i136–143. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources