. 2018 Sep 10;373(1758):20170381.

doi: 10.1098/rstb.2017.0381.

Towards systematic, data-driven validation of a collaborative, multi-scale model of Caenorhabditis elegans

Richard C Gerkin^{1

2}, Russell J Jarvis³, Sharon M Crook^{3

2}

Affiliations

¹ School of Life Sciences, Arizona State University, Tempe, AZ, USA rgerkin@asu.edu.
² School of Mathematical and Statistical Sciences, Arizona State University, Tempe, AZ, USA.
³ School of Life Sciences, Arizona State University, Tempe, AZ, USA.

PMID: 30201844
PMCID: PMC6158230
DOI: 10.1098/rstb.2017.0381

Towards systematic, data-driven validation of a collaborative, multi-scale model of Caenorhabditis elegans

Richard C Gerkin et al. Philos Trans R Soc Lond B Biol Sci. 2018.

. 2018 Sep 10;373(1758):20170381.

doi: 10.1098/rstb.2017.0381.

Authors

Richard C Gerkin^{1

2}, Russell J Jarvis³, Sharon M Crook^{3

2}

Affiliations

¹ School of Life Sciences, Arizona State University, Tempe, AZ, USA rgerkin@asu.edu.
² School of Mathematical and Statistical Sciences, Arizona State University, Tempe, AZ, USA.
³ School of Life Sciences, Arizona State University, Tempe, AZ, USA.

PMID: 30201844
PMCID: PMC6158230
DOI: 10.1098/rstb.2017.0381

Abstract

The OpenWorm Project is an international open-source collaboration to create a multi-scale model of the organism Caenorhabditis elegans At each scale, including subcellular, cellular, network and behaviour, this project employs one or more computational models that aim to recapitulate the corresponding biological system at that scale. This requires that the simulated behaviour of each model be compared with experimental data both as the model is continuously refined and as new experimental data become available. Here we report the use of SciUnit, a software framework for model validation, to attempt to achieve these goals. During project development, each model is continuously subjected to data-driven 'unit tests' that quantitatively summarize model-data agreement, identifying modelling progress and highlighting particular aspects of each model that fail to adequately reproduce known features of the biological organism and its components. This workflow is publicly visible via both GitHub and a web application and accepts community contributions to ensure that modelling goals are transparent and well-informed.This article is part of a discussion meeting issue 'Connectome to behaviour: modelling C. elegans at cellular resolution'.

Keywords: Python; informatics; modelling; unit-testing.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

**Figure 1.**
Testing overview. The *C. elegans* research community, including members of OpenWorm, produce models using standards such as NeuroML and optionally tune their parameters. The community also produces abundant experimental data, which OpenWorm has taken the lead in annotating and curating into easily accessible databases. The testing procedure (green box) invokes *SciUnit* and related packages to query these databases for observations that parameterize tests, to simulate these models and judge their output according to its correspondence with these empirical observations, and to generate test scores summarizing this correspondence.

**Figure 2.**
Layers of testing for the OpenWorm Project. In the first, outer-most layer, the Python unittest module is invoked from the command line or a continuous integration script, where it can run elements in the second, middle layer. This layer consists of simple blocks of code that can test models, or any project components more generally, but also can run Jupyter notebooks that contain the configuration and execution of most of the *SciUnit* tests. The notebooks can be explored locally or interactively or simply viewed remotely on GitHub. *SciUnit* test execution represents the third, inner-most layer and relies on code in the sciunit package or a variety of *SciUnit*-based packages such as neuronunit. Test metadata and test results, including scores, optionally can be uploaded to http://dash.scidash.org to add to a public record of the history of model performance on the tests.

**Figure 3.**
The SciDash web dashboard for test scores. Each time a test is run, the results are optionally programatically uploaded to http://dash.scidash.org, where they can be viewed. In this example, the scores shown represent Z-Scores, the normalized deviation between the model output and the experimental data distribution. Test scores are colour coded from green (good) to red (bad), to visualize comparison of models on a given test (but not necessarily across tests). Additional metadata (not shown) is also viewable by clicking on links associated with a score's entry. This dashboard can be filtered, sorted and searched to allow anyone to check the progress of the various models, or to compare different versions of the model on their test performance.

**Figure 4.**
Data-driven validation testing of a *C. elegans* ion channel model: ChannelWorm. The black and red current-voltage relationships (IV curves) were produced by the real and simulated worm EGL-19 ion channels, respectively. The model is judged to fail this test due to the deviation between the two. This output is also displayed in the corresponding Jupyter notebook for this test.

**Figure 5.**
Data-driven validation testing of a *C. elegans* muscle cell and the motor neuron that drives it: c302. (a) Results on a suite of four tests, two that compare the distributions of inter-spike-intervals (ISIs) and two that compare the shapes of the action potentials. This exact output is also displayed in the corresponding Jupyter notebook for this test suite. (b) A segment of the experimental membrane potential traces. (c) A segment of the simulated model membrane potential trace. (d) Normalized ISI histogram observed in the experiment and a fit to a theoretical gamma distribution. The test computes and uses parameters from such a fit to assess agreement to a similarly fit model ISI histogram.

**Figure 6.**
Data-driven validation testing of a *C. elegans* motor behaviour model: Sibernetic. (a) Scores for each of eight tests representing core movement and posture features are shown. In some cases, test results are unavailable because the corresponding features cannot be computed for simulation output in the current state of the model. In all other cases, scores indicate that model output is within two standard deviations of the experimental data distribution. (b) Graphical depiction of the contents of a WCON file constructed from segmentation of a video of *C. elegans* locomotor behaviour. One frame is shown, with the body orientation in blue, the head location in red and head locations from previous frames shown in grey.

See this image and copyright information in PMC

References

1. Beck K. 2003. Test-driven development: by example. Boston, MA: Addison-Wesley Professional.
1. Omar C, Aldrich J, Gerkin RC. 2014. Collaborative Infrastructure for Test-driven Scientific Model Validation. In Companion Proc. of the 36th International Conf. on Software Engineering, ICSE Companion 2014, pp. 524–527, New York, NY, USA: ACM; http://doi.acm.org/10.1145/2591062.2591129.
1. Sarma GP, Jacobs TW, Watts MD, Ghayoomie SV, Larson SD, Gerkin RC. 2016. Unit testing, model validation, and biological simulation. F1000Research 5, 1946 ( 10.12688/f1000research.9315.1) - DOI - PMC - PubMed
1. Hines ML, Carnevale NT. 1997. The NEURON simulation environment. Neural Comput. 9, 1179–1209. - PubMed
1. Yu Palyanov A, Khayrulin SS. 2015. Sibernetic: a software complex based on the PCI SPH algorithm aimed at simulation problems in biomechanics. Russ. J. Genet Appl. Res. 5, 635–641. ( 10.1134/S2079059715060052) - DOI

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Towards systematic, data-driven validation of a collaborative, multi-scale model of Caenorhabditis elegans

Affiliations

Towards systematic, data-driven validation of a collaborative, multi-scale model of Caenorhabditis elegans

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources