Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Feb 15;32(4):563-70.
doi: 10.1093/bioinformatics/btv484. Epub 2015 Oct 21.

An algorithm to detect and communicate the differences in computational models describing biological systems

Affiliations

An algorithm to detect and communicate the differences in computational models describing biological systems

Martin Scharm et al. Bioinformatics. .

Abstract

Motivation: Repositories support the reuse of models and ensure transparency about results in publications linked to those models. With thousands of models available in repositories, such as the BioModels database or the Physiome Model Repository, a framework to track the differences between models and their versions is essential to compare and combine models. Difference detection not only allows users to study the history of models but also helps in the detection of errors and inconsistencies. Existing repositories lack algorithms to track a model's development over time.

Results: Focusing on SBML and CellML, we present an algorithm to accurately detect and describe differences between coexisting versions of a model with respect to (i) the models' encoding, (ii) the structure of biological networks and (iii) mathematical expressions. This algorithm is implemented in a comprehensive and open source library called BiVeS. BiVeS helps to identify and characterize changes in computational models and thereby contributes to the documentation of a model's history. Our work facilitates the reuse and extension of existing models and supports collaborative modelling. Finally, it contributes to better reproducibility of modelling results and to the challenge of model provenance.

Availability and implementation: The workflow described in this article is implemented in BiVeS. BiVeS is freely available as source code and binary from sems.uni-rostock.de. The web interface BudHat demonstrates the capabilities of BiVeS at budhat.sems.uni-rostock.de.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Sketch of a model’s temporal evolution. Changes in a single reaction of Novak and Tyson’s model with ID BIOMD0000000107 in the BioModels database. The differences between versions from June 2007 (release number 8), June 2013 (release number 25) and February 2015 (latest available version) are shown. The branch represents a modification. The boxes visualize the differences between related versions
Fig. 2.
Fig. 2.
Schematic of the mapping procedure. The procedure to communicate the differences between two versions of a model (row one) to the user (row seven) is shown. Nodes AH represent single entities in the model documents. Dashed lines indicate mappings between the nodes. The values of σ and ω represent signatures and weights of nodes. They are calculated during pre-processing. The different colours of the nodes indicate modifications: updates are yellow, inserts are green and deletes are red. In the evaluation step, moves are blue
Fig. 3.
Fig. 3.
Outputs as generated by BiVeS and available from BudHat. All three figures show the differences between versions June 2007 and November 2013 of model BIOMD0000000107 (cf. Fig. 1). The reaction network (a) and the report (b) present the differences in a human readable format. The XML encoded delta (c) allows for further processing by computers. The modifications described in Figure 1 are highlighted in orange. In the highlighted reaction network (a), deletes are coloured in red, while inserts are blue and updates are yellow

Similar articles

Cited by

  • Identifying frequent patterns in biochemical reaction networks: a workflow.
    Lambusch F, Waltemath D, Wolkenhauer O, Sandkuhl K, Rosenke C, Henkel R. Lambusch F, et al. Database (Oxford). 2018 Jan 1;2018:bay051. doi: 10.1093/database/bay051. Database (Oxford). 2018. PMID: 29992320 Free PMC article.
  • Relating simulation studies by provenance-Developing a family of Wnt signaling models.
    Budde K, Smith J, Wilsdorf P, Haack F, Uhrmacher AM. Budde K, et al. PLoS Comput Biol. 2021 Aug 5;17(8):e1009227. doi: 10.1371/journal.pcbi.1009227. eCollection 2021 Aug. PLoS Comput Biol. 2021. PMID: 34351901 Free PMC article.
  • COMODI: an ontology to characterise differences in versions of computational models in biology.
    Scharm M, Waltemath D, Mendes P, Wolkenhauer O. Scharm M, et al. J Biomed Semantics. 2016 Jul 11;7(1):46. doi: 10.1186/s13326-016-0080-2. J Biomed Semantics. 2016. PMID: 27401413 Free PMC article.
  • The Cardiac Electrophysiology Web Lab.
    Cooper J, Scharm M, Mirams GR. Cooper J, et al. Biophys J. 2016 Jan 19;110(2):292-300. doi: 10.1016/j.bpj.2015.12.012. Biophys J. 2016. PMID: 26789753 Free PMC article.
  • SBML Level 3: an extensible format for the exchange and reuse of biological models.
    Keating SM, Waltemath D, König M, Zhang F, Dräger A, Chaouiya C, Bergmann FT, Finney A, Gillespie CS, Helikar T, Hoops S, Malik-Sheriff RS, Moodie SL, Moraru II, Myers CJ, Naldi A, Olivier BG, Sahle S, Schaff JC, Smith LP, Swat MJ, Thieffry D, Watanabe L, Wilkinson DJ, Blinov ML, Begley K, Faeder JR, Gómez HF, Hamm TM, Inagaki Y, Liebermeister W, Lister AL, Lucio D, Mjolsness E, Proctor CJ, Raman K, Rodriguez N, Shaffer CA, Shapiro BE, Stelling J, Swainston N, Tanimura N, Wagner J, Meier-Schellersheim M, Sauro HM, Palsson B, Bolouri H, Kitano H, Funahashi A, Hermjakob H, Doyle JC, Hucka M; SBML Level 3 Community members. Keating SM, et al. Mol Syst Biol. 2020 Aug;16(8):e9110. doi: 10.15252/msb.20199110. Mol Syst Biol. 2020. PMID: 32845085 Free PMC article. Review.

References

    1. Casadevall A., Fang F.C. (2010) Reproducible science. Infect. Immun., 78, 4972–4975. - PMC - PubMed
    1. Cobena G., et al. (2002) Detecting changes in xml documents. In: Agrawal R., et al. (eds), Proceedings 18th International Conference on Data Engineering, IEEE , San Jose, CA, pp. 41–52.
    1. Cooper J., et al. (2011) High-throughput functional curation of cellular electrophysiology models. Prog. Biophys. Mol. Biol., 107, 11–20. - PubMed
    1. Cuellar A.A., et al. (2003) An overview of CellML 1.1, a biological model description language. Simulation, 79, 740–747.
    1. Davidson S.B., Freire J. (2008) Provenance and scientific workflows: challenges and opportunities. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD ’08, ACM, New York, pp. 1345–1350, http://doi.acm.org/10.1145/1376616.1376772. - DOI

Publication types