Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 20;11(1):24209.
doi: 10.1038/s41598-021-03625-w.

Three topological features of regulatory networks control life-essential and specialized subsystems

Affiliations

Three topological features of regulatory networks control life-essential and specialized subsystems

Ivan Rodrigo Wolf et al. Sci Rep. .

Abstract

Gene regulatory networks (GRNs) play key roles in development, phenotype plasticity, and evolution. Although graph theory has been used to explore GRNs, associations amongst topological features, transcription factors (TFs), and systems essentiality are poorly understood. Here we sought the relationship amongst the main GRN topological features that influence the control of essential and specific subsystems. We found that the Knn, page rank, and degree are the most relevant GRN features: the ones are conserved along the evolution and are also relevant in pluripotent cells. Interestingly, life-essential subsystems are governed mainly by TFs with intermediary Knn and high page rank or degree, whereas specialized subsystems are mainly regulated by TFs with low Knn. Hence, we suggest that the high probability of TFs be toured by a random signal, and the high probability of the signal propagation to target genes ensures the life-essential subsystems' robustness. Gene/genome duplication is the main evolutionary process to rise Knn as the most relevant feature. Herein, we shed light on unexplored topological GRN features to assess how they are related to subsystems and how the duplications shaped the regulatory systems along the evolution. The classification model generated can be found here: https://github.com/ivanrwolf/NoC/ .

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Predictive performances. (a) Predictive performances during supervised learning. CCI correctly classified instances, TPR true positive rate, 1-FPR one minus false positive rate, MCC Matthews correlation coefficient, ROC area receiver operating characteristic area under the curve, PRC area precision-recall curve; (b) predictive score of the consensus models over each test set. Blue boxes and the Y-left axis depict the classification using the normal consensus model (only the scores from CCI were plotted), whereas the red boxes and Y-right axis depict classification using the random model. The “+” indicates the mean. The Mann–Whitney test showed a p-value < 0.001 for all comparisons.
Figure 2
Figure 2
Decision tree, GO, and network simulation analysis. (a) The consensus tree which “A”, “B”, “C”, “D”, “E”, and “F” are the bins from the discretization step. Orange squares are the node’s features, and blue squares are the classified leaves; (b) the biological process (rows) of genes in tree’s leaves in (a) and the feature that leads to the leaves (Knn, degree, or page rank) (columns). The “reg.” means regulators, “tar.” means targets. The black box indicates the presence of a given GO term in genes at that tree leaves. The histogram in the box below the heatmap depicts the percentage of GO terms from genes that lie in each leaf type; (c) representation of hypothetical networks. The Knn was calculated for the regulators (yellow nodes). Blue nodes are genes with just one connection. The red node depicts a blue node duplication. The green nodes represent other regulators or genes regulated by many regulators. “I”, “II”, “III”, and “IV” represents networks in an initial state, after a gene duplication or during pervasive transcription, after duplication of a different regulator, and after duplication of the regulator in which Knn is calculated, respectively; (d) simulation of Knn evolution of regulators from (c). The X-axis is the degree of targets and regulators, and the Y-axis is the regulator’s Knn. The diagonal grey line is the identity line (a line where every point has proximal X and Y coordinates), which by crossing only the second point, indicates divergencies since the beginning of the simulation.

Similar articles

Cited by

References

    1. Barabási AL, Oltvai ZN. Network biology: Understanding the cell’s functional organization. Nat. Rev. Genet. 2004;5:101–113. - PubMed
    1. Serban M. Exploring modularity in biological networks. Philos. Trans. R. Soc. B Biol. Sci. 2020;375:20190316. - PMC - PubMed
    1. Barclay SS, et al. Systems biology analysis of drosophila in vivo screen data elucidates core networks for DNA damage repair in SCA1. Hum. Mol. Genet. 2014;23:1345–1364. - PubMed
    1. Huang S, Eichler G, Bar-Yam Y, Ingber DE. Cell fates as high-dimensional attractor states of a complex gene regulatory network. Phys. Rev. Lett. 2005;94:128701. - PubMed
    1. Forster J. Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. Genome Res. 2003;13:244–253. - PMC - PubMed

Publication types