Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 May 5;11(1):54.
doi: 10.1186/s12918-017-0428-y.

Parameter identifiability analysis and visualization in large-scale kinetic models of biosystems

Affiliations

Parameter identifiability analysis and visualization in large-scale kinetic models of biosystems

Attila Gábor et al. BMC Syst Biol. .

Abstract

Background: Kinetic models of biochemical systems usually consist of ordinary differential equations that have many unknown parameters. Some of these parameters are often practically unidentifiable, that is, their values cannot be uniquely determined from the available data. Possible causes are lack of influence on the measured outputs, interdependence among parameters, and poor data quality. Uncorrelated parameters can be seen as the key tuning knobs of a predictive model. Therefore, before attempting to perform parameter estimation (model calibration) it is important to characterize the subset(s) of identifiable parameters and their interplay. Once this is achieved, it is still necessary to perform parameter estimation, which poses additional challenges.

Methods: We present a methodology that (i) detects high-order relationships among parameters, and (ii) visualizes the results to facilitate further analysis. We use a collinearity index to quantify the correlation between parameters in a group in a computationally efficient way. Then we apply integer optimization to find the largest groups of uncorrelated parameters. We also use the collinearity index to identify small groups of highly correlated parameters. The results files can be visualized using Cytoscape, showing the identifiable and non-identifiable groups of parameters together with the model structure in the same graph.

Results: Our contributions alleviate the difficulties that appear at different stages of the identifiability analysis and parameter estimation process. We show how to combine global optimization and regularization techniques for calibrating medium and large scale biological models with moderate computation times. Then we evaluate the practical identifiability of the estimated parameters using the proposed methodology. The identifiability analysis techniques are implemented as a MATLAB toolbox called VisId, which is freely available as open source from GitHub ( https://github.com/gabora/visid ).

Conclusions: Our approach is geared towards scalability. It enables the practical identifiability analysis of dynamic models of large size, and accelerates their calibration. The visualization tool allows modellers to detect parts that are problematic and need refinement or reformulation, and provides experimentalists with information that can be helpful in the design of new experiments.

Keywords: Dynamic models; Global optimization; Identifiability; Overfitting; Parameter estimation; Regularization.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
TGF- β model: panel a shows the model sensitivities with respect to the parameters in logarithmic scale. In panel b the maximum number of identifiable parameters is depicted depending on the collinearity threshold. The group-size does not change much by the threshold. The red vertical line indicates our choice (CI =20) for the further analysis. In panel c the mathematical model is depicted. Nodes indicate states (yellow), identifiable (green) and not identifiable (red) parameters, and observables (blue). In panel d the interplay among the collinear parameters are indicated. There are 5 groups of parameters: in the first 4 groups triplets of parameters show large collinearity, while in the fifth group 5 parameters are collinear
Fig. 2
Fig. 2
Circadian clock in Arabidopsis thaliana: panel a shows the model sensitivities with respect to parameters in logarithmic scale. In panel b the maximum number of identifiable parameters is depicted as a function of the collinearity threshold. The red vertical line indicates our choice of threshold (CI =20) used for further analysis. In panel c the interplay among the collinear parameters is indicated up to groups of 3 parameters. In panel d the mathematical model is depicted. Nodes indicate states (yellow), identifiable (green) and not identifiable (red) parameters, and observables (blue). Highly correlated parameter pairs are connected by undirected edges
Fig. 3
Fig. 3
Representation of the connections in the B2 model using the network diagram formalism. Nodes indicate states (yellow), identifiable (green) and not identifiable (red) parameters, observables (blue), and inputs (grey). The source file of this figure is provided with the VisId toolbox; using Cytoscape, the user can navigate through it and zoom on different areas to improve the visibility
Fig. 4
Fig. 4
Representation of the connections in the B4 model using the network diagram formalism. Nodes indicate states (yellow), identifiable (green) and not identifiable (red) parameters, observables (blue), and inputs (grey)
Fig. 5
Fig. 5
Visualization of the relationships among highly collinear parameters in the B4 model. The figure shows small groups, whose sizes range between 2 and 10 parameters. Unidentifiable parameters are shown in red; identifiable parameters in green. Highly correlated pairs are connected by lines

Similar articles

Cited by

References

    1. Wiechert W, Noack S. Mechanistic pathway modeling for industrial biotechnology: challenging but worthwhile. Curr Opinion Biotechnol. 2011;22(5):604–10. doi: 10.1016/j.copbio.2011.01.001. - DOI - PubMed
    1. Menolascina F, Siciliano V, Di Bernardo D. Engineering and control of biological systems: a new way to tackle complex diseases. FEBS Lett. 2012;586(15):2122–8. doi: 10.1016/j.febslet.2012.04.050. - DOI - PubMed
    1. Smallbone K, Mendes P. Large-scale metabolic models: From reconstruction to differential equations. Ind Biotechnol. 2013;9(4):179–84. doi: 10.1089/ind.2013.0003. - DOI
    1. Chakrabarti A, Miskovic L, Soh KC, Hatzimanikatis V. Towards kinetic modeling of genome-scale metabolic networks without sacrificing stoichiometric, thermodynamic and physiological constraints. Biotechnol J. 2013;8(9):1043–57. doi: 10.1002/biot.201300091. - DOI - PubMed
    1. Almquist J, Cvijovic M, Hatzimanikatis V, Nielsen J, Jirstrand M. Kinetic models in industrial biotechnology–improving cell factory performance. Metab Eng. 2014;24:38–60. doi: 10.1016/j.ymben.2014.03.007. - DOI - PubMed

Substances

LinkOut - more resources