Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 May;197(1):237-55.
doi: 10.1534/genetics.113.160325. Epub 2014 Feb 28.

Fisher's geometrical model emerges as a property of complex integrated phenotypic networks

Affiliations

Fisher's geometrical model emerges as a property of complex integrated phenotypic networks

Guillaume Martin. Genetics. 2014 May.

Abstract

Models relating phenotype space to fitness (phenotype-fitness landscapes) have seen important developments recently. They can roughly be divided into mechanistic models (e.g., metabolic networks) and more heuristic models like Fisher's geometrical model. Each has its own drawbacks, but both yield testable predictions on how the context (genomic background or environment) affects the distribution of mutation effects on fitness and thus adaptation. Both have received some empirical validation. This article aims at bridging the gap between these approaches. A derivation of the Fisher model "from first principles" is proposed, where the basic assumptions emerge from a more general model, inspired by mechanistic networks. I start from a general phenotypic network relating unspecified phenotypic traits and fitness. A limited set of qualitative assumptions is then imposed, mostly corresponding to known features of phenotypic networks: a large set of traits is pleiotropically affected by mutations and determines a much smaller set of traits under optimizing selection. Otherwise, the model remains fairly general regarding the phenotypic processes involved or the distribution of mutation effects affecting the network. A statistical treatment and a local approximation close to a fitness optimum yield a landscape that is effectively the isotropic Fisher model or its extension with a single dominant phenotypic direction. The fit of the resulting alternative distributions is illustrated in an empirical data set. These results bear implications on the validity of Fisher's model's assumptions and on which features of mutation fitness effects may vary (or not) across genomic or environmental contexts.

Keywords: Fisher’s geometrical model; mutation fitness effects; network biology; random matrix theory; systems biology.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Model of genotype–phenotype–fitness map. This schematic representation shows the different levels of integration assumed in the model, from a single genetic change in the DNA (left) to its effect on the Malthusian fitness of the whole organism. Each mutation pleiotropically affects a large subset of p “mutable traits” (orange ovals), via a complex interaction network among proteins. The parent phenotype at all these traits is represented by the vector x. The effect of a mutation (on the offspring’s phenotype) is a random small perturbation dx with mean zero and arbitrary multivariate distribution (covariance V). These basic mutational changes “percolate” through the network of interactions to induce changes at a much smaller set of n key integrative traits (“optimized traits,” green ovals), which are those under stabilizing selection, represented by the vector y. An arbitrary developmental function y=φ(x) relates mutable to optimized traits (developmental integration). The effect of mutations on y is a perturbation vector dy that is approximately linear in dx (to leading order), with linear coefficients bij=xjφ(yi), arbitrarily distributed. The optimized traits directly determine fitness via a locally quadratic function m(y) around some optimum (set at y=0).
Figure 2
Figure 2
Applicability of the quadratic approximation around the optimum, for various fitness functions. Different fitness functions m(y) (indicated in inset) are illustrated in one dimension (y). Conditional on the existence of a nonvanishing second derivative at the optimum (m(0)0), the quadratic approximation (dashed line) applies (A) or does not apply (B).
Figure 3
Figure 3
Convergence of the Marchenko–Pastur law toward isotropy. The pdf ρ(x) of the Marchenko–Pastur (M-P) law is illustrated for a set of values of the shape parameter β=p/n (see inset). The matrices are scaled to have E(λ)=1 (ζ=1/β). The histograms show the spectral distribution of a simulated Wishart matrix (n=300,β=p/n), and the lines show the limit spectral distribution given by the corresponding Marchenko–Pastur law λMP(β,1/β) (Equation 3). The spectral distribution is well captured by the M-P law and becomes narrower as the ratio index becomes larger (np, high integration).
Figure 4
Figure 4
Spectral distributions under the random phenotypic network model. The observed distribution of eigenvalues of M (spectral distribution) is shown together with the predicted M-P law approximation. In each case, a single random phenotypic network (matrices H, B, and V) was drawn and the resulting spectral distribution of  H.W.H* is shown. The n×p matrix of pathway coefficients B={bij}i[1,n],j[1,p] was set to have mean vector μB={μj}j[1,p] drawn randomly with μj~N(0,σ) and p×p covariance matrix CB. The pathway coefficients bij were drawn from various alternative distributions: discrete bimodal (±1/n), normal, uniform, or a mixture of the two. The covariance matrices CB and V were drawn independently as Wishart deviates: V,CB~Wp+1(Λp)/(p+1), where Λp is a diagonal matrix. The matrix Λp has p gamma-distributed diagonal entries, thus allowing us to set a high coefficient of variation (cvW) (A) of the eigenvalues of W (i.e., of n CB.V). The matrices were scaled so that E(λi1)=1 in the bulk and the dimensions were n=100 and p=2000. A shows the resulting distribution of the bulk eigenvalues λi1 of either H.H* or H.W.H* (see inset) and the corresponding M-P law as solid lines. Each histogram color corresponds to a distinct distribution of the bij. B shows the behavior of the dominant eigenvalue λ1. The coefficient cv (Equation 5) is set in the simulations via the scaling parameter σ. For each value of cv on the x-axis, the full set of eigenvalues is represented by the circles. Red and blue circles correspond to bij drawn from normal or uniform distributions (undistinguishable). The range of the bulk (λi1, orange area) and the dominant eigenvalue λ1 (green line) are well predicted by the M-P law approximation.
Figure 5
Figure 5
Simulated DFE vs. isotropic approximation(s). The distribution of selection coefficients among random mutations is represented for different situations (see File S3). The network was simulated as in Figure 4. The bias μBμB was scaled to obtain a given level of α: α=1 (C) or α=50 (D). The mutable traits dx were drawn as i.i.d. uniform deviates and scaled to obtain V(dx)=V. Then dy=B.dx was computed and scaled to enforce E(s)=0.01. The n parental coordinates yi were drawn as standard normal deviates and scaled to enforce a given y2/2=so. The fitness effect of mutations was then computed from y and dy according to Equation 1. All other parameters are indicated in the insets: n=6, p=200. A illustrates the distribution of pathway coefficients, scaled by their variance (bij/V(bij)) for each situation: below the phase transition [α=1: all bij under the same blue histogram N(0,1)] or beyond the phase transition (α=50: each color gives the  bij for a given index j for the three first mutable traits). The corresponding anisotropy in the spectral distribution of M is illustrated in B: ellipsoids are the 95% domain of the mutant phenotypes on the three first optimized traits yi. Beyond the phase transition (α=50: purple cloud), there is a favored direction in y space. C and D show the corresponding DFE, below (C) or beyond (D) the phase transition, at the optimum (so=0, blue), close to it (so=s¯, red), or farther from it (so=5 s¯, brown). The lines show the corresponding analytic predictions [Equation 9 (C) or Equations 10 and 11 (D)] (see insets).
Figure 6
Figure 6
Fit of the gamma and gamma sum models to empirical DFEs in bacteria. Top, the distribution of selection coefficients among random nucleotide substitutions vs. alternative fitted models: Equation 11 with α=1 (a gamma distribution, blue curve) or α freely fitted (a convolution of two gammas, red curve). Bottom, the corresponding empirical cumulative distribution function vs. fitted models. The values of the best-fit parameters are given in Table 2. The significance of the test for α>1 vs. α=1 is given below the estimated value.

Similar articles

Cited by

References

    1. Bai Z., Silverstein J. W., 2010. Spectral Analysis of Large Dimensional Random Matrices, Ed. 2. now Publishers Inc., Hanover, MA.
    1. Barabasi A. L., Oltvai Z. N., 2004. Network biology: understanding the cell’s functional organization. Nat. Rev. Genet. 5: 101–113. - PubMed
    1. Barton N. H., Coe J. B., 2009. On the application of statistical physics to evolutionary biology. J. Theor. Biol. 259: 317–324. - PubMed
    1. Bataillon T., 2000. Estimation of spontaneous genome-wide mutation rate parameters: Whither beneficial mutations? Heredity 84: 497–501. - PubMed
    1. Baxter G. J., Blythe R. A., McKane A. J., 2007. Exact solution of the multi-allelic diffusion model. Math. Biosci. 209: 124–170. - PubMed

Publication types