Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Mar;2(1):41-52.
doi: 10.1007/s11306-006-0018-2. Epub 2006 May 20.

Evaluation of regression models in metabolic physiology: predicting fluxes from isotopic data without knowledge of the pathway

Affiliations

Evaluation of regression models in metabolic physiology: predicting fluxes from isotopic data without knowledge of the pathway

Maciek R Antoniewicz et al. Metabolomics. 2006 Mar.

Abstract

This study explores the ability of regression models, with no knowledge of the underlying physiology, to estimate physiological parameters relevant for metabolism and endocrinology. Four regression models were compared: multiple linear regression (MLR), principal component regression (PCR), partial least-squares regression (PLS) and regression using artificial neural networks (ANN). The pathway of mammalian gluconeogenesis was analyzed using [U-(13)C]glucose as tracer. A set of data was simulated by randomly selecting physiologically appropriate metabolic fluxes for the 9 steps of this pathway as independent variables. The isotope labeling patterns of key intermediates in the pathway were then calculated for each set of fluxes, yielding 29 dependent variables. Two thousand sets were created, allowing independent training and test data. Regression models were asked to predict the nine fluxes, given only the 29 isotopomers. For large training sets (>50) the artificial neural network model was superior, capturing 95% of the variability in the gluconeogenic flux, whereas the three linear models captured only 75%. This reflects the ability of neural networks to capture the inherent non-linearities of the metabolic system. The effect of error in the variables and the addition of random variables to the data set was considered. Model sensitivities were used to find the isotopomers that most influenced the predicted flux values. These studies provide the first test of multivariate regression models for the analysis of isotopomer flux data. They provide insight for metabolomics and the future of isotopic tracers in metabolic research where the underlying physiology is complex or unknown.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Schematic representation of mammalian glucose metabolism evaluated by constant [U−13C]glucose infusion. Abbreviations of metabolites: G6P, glucose-6-phosphate; Pyr, pyruvate; OAC, oxaloacetate; Fum, fumarate; AcCoA, acetyl coenzyme A; PEP, phosphoenolpyruvate; TP, triose phosphates.
Figure 2.
Figure 2.
Determination of the optimal number of principal components by the leave-one-out cross-validation method. The optimal number of principal components is defined as the fewest number of components yielding a PRESS value within 5% of the minimal observed PRESS value; in this case 10 principal components.
Figure 3.
Figure 3.
Optimal number of principal components in the PCR and PLS models for varying sizes of training data set. Larger training sets allow more principal components to be included in the model to capture the finer details of the system. ANN models typically required fewer principal components than PCR and PLS models.
Figure 4.
Figure 4.
Observed prediction accuracy of the gluconeogenesis flux as a function of the number of samples used in training. For small number of training samples the PCR and PLS models produce the best predictions; for larger number of training samples ANN yields the best predictions.

Similar articles

Cited by

References

    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'PubMed', 'value': '8203653', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/8203653/'}]}
    2. Benigni R., Giuliani A. (1994) Quantitative modeling and biology: the multivariate approach. Am. J. Physiol 266:R1697–R1704 - PubMed
    1. Bishop, C.M. (1996). Neural Networks for Pattern Recognition. Oxford University Press.
    1. None
    2. Causton D.R. (1987). A Biologist’s Advanced Mathematics. London, Allen & Unwin
    1. None
    2. Chatfield C., Collins A.J. (1981). Introduction to Multivariate Analysis. London, Chapman & Hall
    1. None
    2. Dillon W.R., Goldstein M. (1984) Multivariate Analysis Methods and Applications. Wiley, New York

LinkOut - more resources