Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 May 9:2015:bav043.
doi: 10.1093/database/bav043. Print 2015.

The Confidence Information Ontology: a step towards a standard for asserting confidence in annotations

Affiliations

The Confidence Information Ontology: a step towards a standard for asserting confidence in annotations

Frederic B Bastian et al. Database (Oxford). .

Abstract

Biocuration has become a cornerstone for analyses in biology, and to meet needs, the amount of annotations has considerably grown in recent years. However, the reliability of these annotations varies; it has thus become necessary to be able to assess the confidence in annotations. Although several resources already provide confidence information about the annotations that they produce, a standard way of providing such information has yet to be defined. This lack of standardization undermines the propagation of knowledge across resources, as well as the credibility of results from high-throughput analyses. Seeded at a workshop during the Biocuration 2012 conference, a working group has been created to address this problem. We present here the elements that were identified as essential for assessing confidence in annotations, as well as a draft ontology--the Confidence Information Ontology--to illustrate how the problems identified could be addressed. We hope that this effort will provide a home for discussing this major issue among the biocuration community. Tracker URL: https://github.com/BgeeDB/confidence-information-ontology Ontology URL: https://raw.githubusercontent.com/BgeeDB/confidence-information-ontology/master/src/ontology/cio-simple.obo

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Partial overview of the CIO. The first branching of the CIO distinguishes annotations supported by a single evidence, or by multiple evidence lines. In the latter case, further subclasses refine the overall confidence in the annotation, yielded from all evidence lines available considered together.
Figure 2.
Figure 2.
Overview of the confidence statement from single evidence branch. The CIO defines three basic confidence statements, corresponding to a simple rating system, that can be modularly used for single evidence annotation, plus a rejected term, used to tag retracted results.
Figure 3.
Figure 3.
Example of conflicting versus congruent terms. This figure presents the branch ‘confidence statement from multiple evidence lines of same type’; the rationale would be the same if applied to evidence lines of multiple types. The term confidence statement from multiple evidence lines of same type has two subclasses: ‘confidence statement from conflicting evidence lines of same types’ and ‘confidence statement from congruent evidence lines of same type’. The ‘congruent evidence lines’ term has three subclasses, to define the overall level of confidence obtained from the set of supporting evidence lines. Similarly, the ‘weakly conflicting evidence lines’ term has three subclasses, defining the overall level of confidence obtained from the set of available evidence lines. The ‘strongly conflicting evidence lines’ term does not have such subclasses, as in that case, the evidence lines do not allow to reach a consensual conclusion.

References

    1. Skunca N., Altenhoff A., Dessimoz C. (2012) Quality of computationally inferred gene ontology annotations. PLoS Comput. Biol., 8, e1002533. - PMC - PubMed
    1. Schnoes A.M., Ream D.C., Thorman A.W., et al. . (2013) Biases in the experimental annotations of protein function and their effect on our understanding of protein function space. PLoS Comput. Biol., 9, e1003063. - PMC - PubMed
    1. Du Plessis L., Skunca N., Dessimoz C. (2011) The what, where, how and why of gene ontology–a primer for bioinformaticians. Brief. Bioinform., 12, 723–735. - PMC - PubMed
    1. Chibucos M.C., Mungall C.J., Balakrishnan R., et al. (2014) Standardized description of scientific evidence using the Evidence Ontology (ECO). Database, 2014, bau075 http://database.oxfordjournals.org/content/2014/bau075.long - PMC - PubMed
    1. Willighagen E.L., Waagmeester A., Spjuth O., et al. (2013) The ChEMBL database as linked open data. J. Cheminform.,5, 23. - PMC - PubMed

Publication types