Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Sep 3:12:717958.
doi: 10.3389/fpls.2021.717958. eCollection 2021.

Data Management and Modeling in Plant Biology

Affiliations
Review

Data Management and Modeling in Plant Biology

Maria Krantz et al. Front Plant Sci. .

Abstract

The study of plant-environment interactions is a multidisciplinary research field. With the emergence of quantitative large-scale and high-throughput techniques, amount and dimensionality of experimental data have strongly increased. Appropriate strategies for data storage, management, and evaluation are needed to make efficient use of experimental findings. Computational approaches of data mining are essential for deriving statistical trends and signatures contained in data matrices. Although, current biology is challenged by high data dimensionality in general, this is particularly true for plant biology. Plants as sessile organisms have to cope with environmental fluctuations. This typically results in strong dynamics of metabolite and protein concentrations which are often challenging to quantify. Summarizing experimental output results in complex data arrays, which need computational statistics and numerical methods for building quantitative models. Experimental findings need to be combined by computational models to gain a mechanistic understanding of plant metabolism. For this, bioinformatics and mathematics need to be combined with experimental setups in physiology, biochemistry, and molecular biology. This review presents and discusses concepts at the interface of experiment and computation, which are likely to shape current and future plant biology. Finally, this interface is discussed with regard to its capabilities and limitations to develop a quantitative model of plant-environment interactions.

Keywords: differential equations; genome-scale networks; machine learning; mathematical modeling; metabolic regulation; omics analysis; plant-environment interactions.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Number of articles found by article search in the PubMed® library covering 2 decades, i.e., 2000–2020 (https://pubmed.ncbi.nlm.nih.gov). (A) Timeline of number of articles on different omics disciplines (blue: genomics; orange: transcriptomics; gray: proteomics; and yellow: metabolomics). Articles were searched by single key word search, (B) Timeline of number of articles found by search on omics data integration (green line; single words were connected by AND-expression) and multi-omics (or multiomics, blue line).
Figure 2
Figure 2
An artificial neural network. Information, i.e., x1, x2, and x3, enters the network via the input layer (layer 1, blue). Weights wij determine the quantity by which information is passed to layer 2 (hidden, green). Processed information, here y1n and y2n, leaves the network by the output layer (layer n, yellow). Indices refer to neuron number and layer number, respectively. Calculations for neurons are depicted exemplarily for h12 (first neuron in second layer), which is composed of a bias (b12) and summed information of the previous layer, here layer 1. Resulting information, y12, is passed to the next layer and might comprise nonlinearities in f(h12). Deep neural networks typically comprise several and up to numerous hidden layers, indicated in grey.
Figure 3
Figure 3
Conceptual workflow for data management and modeling in plant sciences.

Similar articles

Cited by

References

    1. Andrews S. S., Arkin A. P. (2006). Simulating cell biology. Curr. Biol. 16, R523–R527. 10.1016/j.cub.2006.06.048, PMID: - DOI - PubMed
    1. Ara T., Enomoto M., Arita M., Ikeda C., Kera K., Yamada M., et al. . (2015). Metabolonote: a wiki-based database for managing hierarchical metadata of metabolome analyses. Front. Bioeng. Biotechnol. 3:38. 10.3389/fbioe.2015.00038, PMID: - DOI - PMC - PubMed
    1. Babtie A. C., Stumpf M. P. H. (2017). How to deal with parameters for whole-cell modelling. J. R. Soc. Interface 14:20170237. 10.1098/rsif.2017.0237, PMID: - DOI - PMC - PubMed
    1. Bayer P. E., Golicz A. A., Scheben A., Batley J., Edwards D. (2020). Plant pan-genomes are the new reference. Nat. Plants 6, 914–920. 10.1038/s41477-020-0733-0, PMID: - DOI - PubMed
    1. Berardini T. Z., Reiser L., Li D., Mezheritsky Y., Muller R., Strait E., et al. . (2015). The arabidopsis information resource: making and mining the “gold standard” annotated reference plant genome. Genesis 53, 474–485. 10.1002/dvg.22877, PMID: - DOI - PMC - PubMed

LinkOut - more resources