Review

. 2021 Sep 3:12:717958.

doi: 10.3389/fpls.2021.717958. eCollection 2021.

Data Management and Modeling in Plant Biology

Maria Krantz¹, David Zimmer², Stephan O Adler¹, Anastasia Kitashova³, Edda Klipp¹, Timo Mühlhaus², Thomas Nägele³

Affiliations

¹ Theoretical Biophysics, Institute of Biology, Humboldt-Universität zu Berlin, Berlin, Germany.
² Computational Systems Biology, Technische Universität Kaiserslautern, Kaiserslautern, Germany.
³ Plant Evolutionary Cell Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany.

PMID: 34539712
PMCID: PMC8446634
DOI: 10.3389/fpls.2021.717958

Review

Data Management and Modeling in Plant Biology

Maria Krantz et al. Front Plant Sci. 2021.

. 2021 Sep 3:12:717958.

doi: 10.3389/fpls.2021.717958. eCollection 2021.

Authors

Maria Krantz¹, David Zimmer², Stephan O Adler¹, Anastasia Kitashova³, Edda Klipp¹, Timo Mühlhaus², Thomas Nägele³

Affiliations

¹ Theoretical Biophysics, Institute of Biology, Humboldt-Universität zu Berlin, Berlin, Germany.
² Computational Systems Biology, Technische Universität Kaiserslautern, Kaiserslautern, Germany.
³ Plant Evolutionary Cell Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany.

PMID: 34539712
PMCID: PMC8446634
DOI: 10.3389/fpls.2021.717958

Abstract

The study of plant-environment interactions is a multidisciplinary research field. With the emergence of quantitative large-scale and high-throughput techniques, amount and dimensionality of experimental data have strongly increased. Appropriate strategies for data storage, management, and evaluation are needed to make efficient use of experimental findings. Computational approaches of data mining are essential for deriving statistical trends and signatures contained in data matrices. Although, current biology is challenged by high data dimensionality in general, this is particularly true for plant biology. Plants as sessile organisms have to cope with environmental fluctuations. This typically results in strong dynamics of metabolite and protein concentrations which are often challenging to quantify. Summarizing experimental output results in complex data arrays, which need computational statistics and numerical methods for building quantitative models. Experimental findings need to be combined by computational models to gain a mechanistic understanding of plant metabolism. For this, bioinformatics and mathematics need to be combined with experimental setups in physiology, biochemistry, and molecular biology. This review presents and discusses concepts at the interface of experiment and computation, which are likely to shape current and future plant biology. Finally, this interface is discussed with regard to its capabilities and limitations to develop a quantitative model of plant-environment interactions.

Keywords: differential equations; genome-scale networks; machine learning; mathematical modeling; metabolic regulation; omics analysis; plant-environment interactions.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Number of articles found by article search in the PubMed® library covering 2 decades, i.e., 2000–2020 (https://pubmed.ncbi.nlm.nih.gov). **(A)** Timeline of number of articles on different *omics* disciplines (blue: genomics; orange: transcriptomics; gray: proteomics; and yellow: metabolomics). Articles were searched by single key word search, **(B)** Timeline of number of articles found by search on *omics* data integration (green line; single words were connected by AND-expression) and multi-*omics* (or multiomics, blue line).

**Figure 2**
An artificial neural network. Information, i.e., x₁, x₂, and x₃, enters the network *via* the input layer (layer 1, blue). Weights w_ij determine the quantity by which information is passed to layer 2 (hidden, green). Processed information, here y_1n and y_2n, leaves the network by the output layer (layer n, yellow). Indices refer to neuron number and layer number, respectively. Calculations for neurons are depicted exemplarily for h₁₂ (first neuron in second layer), which is composed of a bias (b₁₂) and summed information of the previous layer, here layer 1. Resulting information, y₁₂, is passed to the next layer and might comprise nonlinearities in f(h₁₂). Deep neural networks typically comprise several and up to numerous hidden layers, indicated in grey.

**Figure 3**
Conceptual workflow for data management and modeling in plant sciences.

See this image and copyright information in PMC

Cited by

Elucidating Plant-Microbe-Environment Interactions Through Omics-Enabled Metabolic Modelling Using Synthetic Communities.
Beck AE, Kleiner M, Garrell AK. Beck AE, et al. Front Plant Sci. 2022 Jun 20;13:910377. doi: 10.3389/fpls.2022.910377. eCollection 2022. Front Plant Sci. 2022. PMID: 35795346 Free PMC article.
NYUS.2: an automated machine learning prediction model for the large-scale real-time simulation of grapevine freezing tolerance in North America.
Wang H, Moghe GD, Kovaleski AP, Keller M, Martinson TE, Wright AH, Franklin JL, Hébert-Haché A, Provost C, Reinke M, Atucha A, North MG, Russo JP, Helwi P, Centinari M, Londo JP. Wang H, et al. Hortic Res. 2023 Dec 29;11(2):uhad286. doi: 10.1093/hr/uhad286. eCollection 2024 Feb. Hortic Res. 2023. PMID: 38487294 Free PMC article.
Advancing plant biology through deep learning-powered natural language processing.
Peng S, Rajjou L. Peng S, et al. Plant Cell Rep. 2024 Aug 5;43(8):208. doi: 10.1007/s00299-024-03294-9. Plant Cell Rep. 2024. PMID: 39102077 Review.
Deciphering Macromolecular Interactions Involved in Abiotic Stress Signaling: A Review of Bioinformatics Analysis.
Gouesbet G. Gouesbet G. Methods Mol Biol. 2023;2642:257-294. doi: 10.1007/978-1-0716-3044-0_15. Methods Mol Biol. 2023. PMID: 36944884 Review.

References

1. Andrews S. S., Arkin A. P. (2006). Simulating cell biology. Curr. Biol. 16, R523–R527. 10.1016/j.cub.2006.06.048, PMID: - DOI - PubMed
1. Ara T., Enomoto M., Arita M., Ikeda C., Kera K., Yamada M., et al. . (2015). Metabolonote: a wiki-based database for managing hierarchical metadata of metabolome analyses. Front. Bioeng. Biotechnol. 3:38. 10.3389/fbioe.2015.00038, PMID: - DOI - PMC - PubMed
1. Babtie A. C., Stumpf M. P. H. (2017). How to deal with parameters for whole-cell modelling. J. R. Soc. Interface 14:20170237. 10.1098/rsif.2017.0237, PMID: - DOI - PMC - PubMed
1. Bayer P. E., Golicz A. A., Scheben A., Batley J., Edwards D. (2020). Plant pan-genomes are the new reference. Nat. Plants 6, 914–920. 10.1038/s41477-020-0733-0, PMID: - DOI - PubMed
1. Berardini T. Z., Reiser L., Li D., Mezheritsky Y., Muller R., Strait E., et al. . (2015). The arabidopsis information resource: making and mining the “gold standard” annotated reference plant genome. Genesis 53, 474–485. 10.1002/dvg.22877, PMID: - DOI - PMC - PubMed

Publication types

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Data Management and Modeling in Plant Biology

Affiliations

Data Management and Modeling in Plant Biology

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

LinkOut - more resources

Full Text Sources