. 2010 Aug 5;6(8):e1000873.

doi: 10.1371/journal.pcbi.1000873.

Evolution of gene regulatory networks by fluctuating selection and intrinsic constraints

Masaki E Tsuda¹, Masakado Kawata

Affiliations

PMID: 20700492
PMCID: PMC2916849
DOI: 10.1371/journal.pcbi.1000873

Evolution of gene regulatory networks by fluctuating selection and intrinsic constraints

Masaki E Tsuda et al. PLoS Comput Biol. 2010.

. 2010 Aug 5;6(8):e1000873.

doi: 10.1371/journal.pcbi.1000873.

Authors

Masaki E Tsuda¹, Masakado Kawata

Affiliation

¹ Division of Ecology and Evolutionary Biology, Graduate School of Life Sciences, Tohoku University, Sendai, Japan. teuder@gmail.com

PMID: 20700492
PMCID: PMC2916849
DOI: 10.1371/journal.pcbi.1000873

Abstract

Various characteristics of complex gene regulatory networks (GRNs) have been discovered during the last decade, e.g., redundancy, exponential indegree distributions, scale-free outdegree distributions, mutational robustness, and evolvability. Although progress has been made in this field, it is not well understood whether these characteristics are the direct products of selection or those of other evolutionary forces such as mutational biases and biophysical constraints. To elucidate the causal factors that promoted the evolution of complex GRNs, we examined the effect of fluctuating environmental selection and some intrinsic constraining factors on GRN evolution by using an individual-based model. We found that the evolution of complex GRNs is remarkably promoted by fixation of beneficial gene duplications under unpredictably fluctuating environmental conditions and that some internal factors inherent in organisms, such as mutational bias, gene expression costs, and constraints on expression dynamics, are also important for the evolution of GRNs. The results indicate that various biological properties observed in GRNs could evolve as a result of not only adaptation to unpredictable environmental changes but also non-adaptive processes owing to the properties of the organisms themselves. Our study emphasizes that evolutionary models considering such intrinsic constraining factors should be used as null models to analyze the effect of selection on GRN evolution.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Figure 1. Schematic representation of the model.**
(A) Each gene has a *cis*-regulatory region composed of 100 *cis*-sites (boxes; potential transcription factor binding sites) and a coding region (diamonds) from which products (circles) of the genes are created. The products of regulatory genes would bind to the corresponding binding sites (represented by the same colors) and control the expression of the target genes. A *cis*-regulatory region is allowed to have multiple binding sites for the same transcription factor; thus, the strength of regulatory interactions, including activation (red arrows) and repression (blue arrows), depend on the numbers and properties of the binding sites. The regulatory cascade would start by imposing an input signal that activates the R ₁ gene. The phenotype of an individual is defined as the steady-state expression level of phenotypic genes. Core genes are expressed and actually involved in phenotypic expression. On the other hand, pseudo-expression genes are expressed but not involved in phenotypic expression. (B) The fitness of an individual depends on the cost of gene expression and the phenotypic suitability to the environment. The phenotypic suitability to the environment depends on the Euclidian distance between the individual phenotype and the optimum phenotype. The position of the optimum shifts a constant distance away (d) at every certain generation (*f⁻¹*) in a random direction (random-walk) or to a fixed position (cyclic).

**Figure 2. GRN structures that evolved under various fluctuations of phenotypic selection.**
The number of core genes (#core), pseudo-expression genes (#psdexp), and silent genes (#silent) in GRNs that evolved for 50,000 generations under random-walk optimum shift (RW), and those that evolved under cyclic optimum shift (CY). All parameters were set at standard values (Table 1). Each point connected by solid lines represents the mean number of each type of genes in evolved GRNs under each selective condition. Vertical bars attached to the point represent 95% confidence intervals. d and f represent the amplitude and frequency of the optimum shift, respectively.

**Figure 3. Relationship between the time-averaged fitness of a population and the GRN structures.**
(A) An example of the changes of the mean fitness in a population during evolution. Red line indicates the mean fitness of a population at certain generation. Horizontal dotted line indicates the time-averaged fitness (F′) during the evolution in this population. (B) The time-averaged fitness of GRNs that evolved under various fluctuations of phenotypic selection. (C) The relationship between the time-averaged fitness and the structure of GRNs. Red line indicates the fitting curve to the quintic equation by non-linear least square method.

**Figure 4. Relationship between the number of core genes in GRNs and the phenotypic effects of various types of mutations in core genes.**
Points represent the results of each population evolved under various amplitudes (d) and frequencies (f) of random-walk optimum shift. Horizontal axes indicate the number of core genes in a population. Panels in each column indicate the effect of different types of mutations (basal transcription level mutation, BTL; *cis*-regulatory mutation, CIS; trans-regulatory mutation, TRA; gene deletion, DEL; gene duplication, DUP). *P_L*, *P_N*, and *P_S* show the proportion of mutations that cause *Loss-of-phenotype*, those that have no phenotypic change (*Non-effect*), and those that have a *Significant* phenotypic change, respectively (*P_L* + *P_N* + *P_S* = 1). *D_S* shows the size of phenotypic changes caused by *Significant* mutations (the Euclidean distance between the original and mutant phenotypes). Statistical significance of the correlation was analyzed by Kendall's correlation test.

**Figure 5. Relationship between the intensity of the optimum fluctuation and the fitness effect of various types of mutations during evolution.**
Points represent the results of a population that evolved under various conditions of random-walk optimum shift. Horizontal axes indicate the time-averaged fitness of a population. Panels in each column indicate the effect of different types of mutations (same as Figure 4). Nt indicates the total number of mutations that arose during the evolution for each types of mutations. P indicates the proportions of mutations that have beneficial (red), neutral (blue), and deleterious (black) effects, respectively.

**Figure 6. Relationship between the intensity of the optimum fluctuation and the fitness effects of gene duplication and gene deletion during evolution.**
Points represent the results of a population that evolved under various conditions of random-walk optimum shift. Nt(x), Nb(x) and Pb(x) indicate the total number of mutations, number of beneficial mutations, and the proportions of beneficial mutations that arose during the evolution for mutation type x, respectively. Vertical axes indicate the difference in the number and the proportions of beneficial mutations between gene duplications and gene deletions. Horizontal axes indicate the time-averaged fitness of a population (F′).

**Figure 7. Relationship between the number of core genes after evolution and the number of beneficial gene duplications and gene deletions.**
Points represent the results of a population that evolved under various conditions of random-walk optimum shift. Vertical axes indicate the number of core genes. Nt(x), Nb(x) and Pb(x) indicate the total number of mutations, the number of beneficial mutations, and the proportions of beneficial mutations that arose during the evolution for mutation type x, respectively. Horizontal axes indicate the difference in the number and the proportions of beneficial mutations between gene duplications and gene deletions. Statistical significance of the correlation was analyzed by the Kendall's correlation test.

**Figure 8. Effect of the strength of steady-state constraints on GRN evolution.**
Greater values of V indicate weaker constraints on steady-state expression (V = 10⁻⁴, standard parameter value). Points connected by solid lines represent the mean number of core genes (#core), pseudo-expression genes (#psdexp), silent genes (#silent) and the time-averaged fitness (F′) in populations that evolved for 50,000 generations under each simulation condition. Vertical bars indicate 95% confidence intervals. Different colors indicate different conditions of phenotypic selection: d = 10⁻¹, f = 10⁻¹ (red); d = 10⁰, f = 10⁻³ (blue); d = 10⁻³, f = 10⁻³ (black) under random-walk optimum shift.

**Figure 9. Effect of gene expression costs on GRN evolution.**
Greater values of c indicate the larger fitness load of a unit of gene expression (c = 10⁻⁵, standard parameter value). Points connected by solid lines represent the mean number of core genes (#core), pseudo-expression genes (#psdexp), silent genes (#silent) and the time-averaged fitness (F′) in populations that evolved for 50,000 generations under each simulation condition. Vertical bars indicate 95% confidence intervals. Different colors indicate different conditions of phenotypic selection: d = 10⁻¹, f = 10⁻¹ (red); d = 10⁰, f = 10⁻³ (blue); d = 10⁻³, f = 10⁻³ (black) under random-walk optimum shift.

**Figure 10. Effects of the probability of binding site formation by regulatory mutations (*C_mut*) on GRN evolution.**
Greater values of *C_mut* indicate larger probabilities of binding site formation by regulatory mutation (*C_mut* = 10⁻², standard parameter values). To control the *C_mut* value, the size of the *cis*-regulatory region of a gene (L) was varied; L = 10, 30, 100, 303, and 1000 for *C_mut* = 10⁻³, 3×10⁻³, 10⁻², 3×10⁻², and 10⁻¹, respectively. Points connected by solid lines represent the mean number of core genes (#core), pseudo-expression genes (#psdexp), silent genes (#silent) and the time-averaged fitness (F′) in populations that evolved for 50,000 generations under each simulation condition. Vertical bars indicate 95% confidence intervals. Different colors indicate different conditions of phenotypic selection; d = 10⁻¹, f = 10⁻¹ (red); d = 10⁰, f = 10⁻³ (blue); d = 10⁻³, f = 10⁻³ (black) under random-walk optimum shift.

**Figure 11. Indegree distribution of the assembled GRNs that evolved under various *C_mut* levels.**
Horizontal and vertical axes in each panel show the indegree (the number of regulatory interactions that arrived at a gene) and the frequency, respectively. Note that the vertical axes are shown logarithmically to demonstrate the exponential character of the distribution. Different rows and columns indicated the different conditions of phenotypic selection and different values of *C_mut*, respectively. Lines in each panel indicate the regression of the plot to the Power law distribution (red), exponential distribution (blue), and Poisson distribution (green). Regression was estimated by a nonlinear least-square method. To judge the goodness of regression, Akaike's information criterion (AIC) was used, and the regression that showed the smallest value of AIC was drawn as a thick line. POW, EXP and POI in each panel indicate the differences between AIC value of the best regression model and those of power-law (scale-free), exponential and poisson distributions, respectively.

**Figure 12. Outdegree distribution of assembled GRNs that evolved under various *C_mut* levels.**
Horizontal and vertical axes in each panel show the outdegree (the number of regulatory interactions that depart from a gene) and the frequency, respectively. Note that the both horizontal and vertical axes are shown logarithmically to demonstrate the scale-free character of the distribution. Different rows and columns show the different conditions of phenotypic selection and the different values of *C_mut*, respectively. Lines in each panel indicate the regression of the plot to the Power law distribution (red), exponential distribution (blue) and Poisson distribution (green). Regression was estimated by a nonlinear least-square method. To judge the goodness of the regression, Akaike's information criterion (AIC) was used, and the regression that showed the smallest value of AIC was drawn as a thick line. POW, EXP and POI in each panel indicate the differences between AIC value of the best regression model and those of power-law (scale-free), exponential and poisson distributions, respectively.

**Figure 13. Relationships between GRN structures and the relative rates of gene duplication and gene deletion (*μ_DEL/μ_DUP*).**
Standard parameter value, *μ_DEL/μ_DUP* = 1. To control the value of (*μ_DEL/μ_DUP*), only *μ_del* are varied from 10⁻⁷ to 10⁻⁵, while *μ_dup* was fixed at a standard value (10⁻⁶). Points connected by solid lines represent the mean number of core genes (#core), pseudo-expression genes (#psdexp), silent genes (#silent) and the time-averaged fitness (F′) in populations that evolved for 50,000 generations under each simulation condition. Vertical bars indicate 95% confidence intervals. Different colors indicate different conditions of phenotypic selection; d = 10⁻¹, f = 10⁻¹ (red); d = 10⁰, f = 10⁻³ (blue); d = 10⁻³, f = 10⁻³ (black) under random-walk optimum shift.

**Figure 14. GRN structures that evolved with horizontal transfer of regulatory genes.**
Instead of the duplication of existing regulatory genes, a randomly created new regulatory gene was introduced into a GRN (i.e., *μ_DUP* = 0, *μ_HOR* = 10⁻⁶). All other parameters were set at standard values. Each point connected by solid lines represents the mean number of each type of genes in evolved GRNs under each selective condition. Vertical bars attached to the point represent 95% confidence intervals. d and f represent the amplitude and frequency of the optimum shift, respectively.

See this image and copyright information in PMC

Cited by

Bistability in feedback circuits as a byproduct of evolution of evolvability.
Kuwahara H, Soyer OS. Kuwahara H, et al. Mol Syst Biol. 2012 Jan 17;8:564. doi: 10.1038/msb.2011.98. Mol Syst Biol. 2012. PMID: 22252387 Free PMC article.
Genetic and environmental factors affecting cryptic variations in gene regulatory networks.
Iwasaki WM, Tsuda ME, Kawata M. Iwasaki WM, et al. BMC Evol Biol. 2013 Apr 26;13:91. doi: 10.1186/1471-2148-13-91. BMC Evol Biol. 2013. PMID: 23622056 Free PMC article.
ncDNA and drift drive binding site accumulation.
Ruths T, Nakhleh L. Ruths T, et al. BMC Evol Biol. 2012 Aug 30;12:159. doi: 10.1186/1471-2148-12-159. BMC Evol Biol. 2012. PMID: 22935101 Free PMC article.
Early effects of gene duplication on the robustness and phenotypic variability of gene regulatory networks.
Posadas-García YS, Espinosa-Soto C. Posadas-García YS, et al. BMC Bioinformatics. 2022 Nov 28;23(1):509. doi: 10.1186/s12859-022-05067-1. BMC Bioinformatics. 2022. PMID: 36443677 Free PMC article.
Co-expression network analysis of environmental canalization in the ascidian Ciona.
Sato A, Oba GM, Aubert-Kato N, Yura K, Bishop J. Sato A, et al. BMC Ecol Evol. 2022 Apr 28;22(1):53. doi: 10.1186/s12862-022-02006-9. BMC Ecol Evol. 2022. PMID: 35484499 Free PMC article.

See all "Cited by" articles

References

1. Carroll SB. Evo-devo and an expanding evolutionary synthesis: a genetic theory of morphological evolution. Cell. 2008;134:25–36. - PubMed
1. Fox CW, Wolf JB. Evolutionary genetics : concepts and case studies. New York: Oxford University Press; 2006.
1. Prud'homme B, Gompel N, Carroll SB. Emerging principles of regulatory evolution. Proc Natl Acad Sci USA. 2007;104:8605–8612. - PMC - PubMed
1. Janga SC, Collado-Vides J. Structure and evolution of gene regulatory networks in microbial genomes. Res Microbiol. 2007;158:787–794. - PMC - PubMed
1. Lozada-Chávez I, Janga SC, Collado-Vides J. Bacterial regulatory networks are extremely flexible in evolution. Nucleic Acids Res. 2006;34:3434–3445. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Evolution of gene regulatory networks by fluctuating selection and intrinsic constraints

Affiliation

Evolution of gene regulatory networks by fluctuating selection and intrinsic constraints

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Miscellaneous