Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2006 Apr 8:7:196.
doi: 10.1186/1471-2105-7-196.

A case study in pathway knowledgebase verification

Affiliations
Comparative Study

A case study in pathway knowledgebase verification

Stephen A Racunas et al. BMC Bioinformatics. .

Abstract

Background: Biological databases and pathway knowledge-bases are proliferating rapidly. We are developing software tools for computer-aided hypothesis design and evaluation, and we would like our tools to take advantage of the information stored in these repositories. But before we can reliably use a pathway knowledge-base as a data source, we need to proofread it to ensure that it can fully support computer-aided information integration and inference.

Results: We design a series of logical tests to detect potential problems we might encounter using a particular knowledge-base, the Reactome database, with a particular computer-aided hypothesis evaluation tool, HyBrow. We develop an explicit formal language from the language implicit in the Reactome data format and specify a logic to evaluate models expressed using this language. We use the formalism of finite model theory in this work. We then use this logic to formulate tests for desirable properties (such as completeness, consistency, and well-formedness) for pathways stored in Reactome. We apply these tests to the publicly available Reactome releases (releases 10 through 14) and compare the results, which highlight Reactome's steady improvement in terms of decreasing inconsistencies. We also investigate and discuss Reactome's potential for supporting computer-aided inference tools.

Conclusion: The case study described in this work demonstrates that it is possible to use our model theory based approach to identify problems one might encounter using a knowledge-base to support hypothesis evaluation tools. The methodology we use is general and is in no way restricted to the specific knowledge-base employed in this case study. Future application of this methodology will enable us to compare pathway resources with respect to the generic properties such resources will need to possess if they are to support automated reasoning.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Plot of the properties over Reactome releases. The figure shows the change in the percentage of reactions that were responsible for the violation of each particular property from releases 10 to 14.

References

    1. Galperin MY. The Molecular Biology Database Collection: 2005 update. Nucleic Acids Res. 2005;33:D5–24. doi: 10.1093/nar/gki139. - DOI - PMC - PubMed
    1. Joshi-Tope G, Gillespie M, Vastrik I, D'Eustachio P, Schmidt E, de Bono B, Jassal B, Gopinath GR, Wu GR, Matthews L, Lewis S, Birney E, Stein L. Reactome: a knowledgebase of biological pathways. Nucleic Acids Res. 2005;33 Database Issue:D428–32. - PMC - PubMed
    1. Karp PD, Riley M, Saier M, Paulsen IT, Collado-Vides J, Paley SM, Pellegrini-Toole A, Bonavides C, Gama-Castro S. The EcoCyc Database. Nucleic Acids Res. 2002;30:56–58. doi: 10.1093/nar/30.1.56. - DOI - PMC - PubMed
    1. Cary MP, Bader GD, Sander C. Pathway information for systems biology. FEBS Lett. 2005;579:1815–1820. doi: 10.1016/j.febslet.2005.02.005. - DOI - PubMed
    1. Fedoroff N, Racunas SA, Shrager J. Tools for thought in the age of biological knowledge. The Scientist. 2005;19:20–21.

LinkOut - more resources