Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Dec 6;113(49):14085-14090.
doi: 10.1073/pnas.1612676113. Epub 2016 Nov 18.

On the (un)predictability of a large intragenic fitness landscape

Affiliations

On the (un)predictability of a large intragenic fitness landscape

Claudia Bank et al. Proc Natl Acad Sci U S A. .

Abstract

The study of fitness landscapes, which aims at mapping genotypes to fitness, is receiving ever-increasing attention. Novel experimental approaches combined with next-generation sequencing (NGS) methods enable accurate and extensive studies of the fitness effects of mutations, allowing us to test theoretical predictions and improve our understanding of the shape of the true underlying fitness landscape and its implications for the predictability and repeatability of evolution. Here, we present a uniquely large multiallelic fitness landscape comprising 640 engineered mutants that represent all possible combinations of 13 amino acid-changing mutations at 6 sites in the heat-shock protein Hsp90 in Saccharomyces cerevisiae under elevated salinity. Despite a prevalent pattern of negative epistasis in the landscape, we find that the global fitness peak is reached via four positively epistatic mutations. Combining traditional and extending recently proposed theoretical and statistical approaches, we quantify features of the global multiallelic fitness landscape. Using subsets of the data, we demonstrate that extrapolation beyond a known part of the landscape is difficult owing to both local ruggedness and amino acid-specific epistatic hotspots and that inference is additionally confounded by the nonrandom choice of mutations for experimental fitness landscapes.

Keywords: adaptation; epistasis; evolution; fitness landscape; mutagenesis.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
Individual amino acid substitutions and their effect on the parental background in elevated salinity, obtained from 1,000 samples from the posterior distribution of growth rates. Boxes represent the interquartile range [i.e., the 50% confidence interval (CI)], whiskers extend to the highest/lowest data point within the box ±1.5 times the interquartile range, and circles represent outliers; gray and white background shading alternates by amino acid position. The box below indicates with colored dots which mutations are involved in the focal landscapes discussed throughout the main text: the four mutations leading from the parental type to the global optimum (“opt”), the four individually most beneficial mutations on the parental background (“best”), and the four mutations with the individually lowest growth rates on the parental background (“worst”). (Inset) Parental sequence at positions 582–590 and assessed amino acids by position.
Fig. 2.
Fig. 2.
(A) Empirical fitness landscape by mutational distance to parental type. Each line represents a single substitution. Vertical lines appear when multiple alleles have been screened at the same position. There is a global pattern of negative epistasis. The three focal landscapes are highlighted. (B) Close-up on the beneficial portion of the landscape. (C) Expected (“Exp.”) versus observed (“Obs.”) fitness for the focal landscapes, obtained from 1,000 posterior samples. We observe strong positive epistasis in the landscape that contains the global optimum, whereas the other two are dominated by negative epistasis. In AC, the y axis depicts growth rate as a proxy for fitness.
Fig. 3.
Fig. 3.
(A) Distribution of average lengths of adaptive walks starting from any type in the full landscape (i.e., absorbing times of the Markov chain). The red line indicates mean path length for adaptive walks from the parental genotype. (B and C) Distribution of absorbing probabilities, that is, the probability to reach a specific optimum starting from a given genotype computed for all genotypes in the dataset. The red line corresponds to the respective probability when starting from the parental sequence. The global optimum (B) is, in general, reached with a very high probability, but there are starting points from which it is poorly accessible.
Fig. 4.
Fig. 4.
(A) Expected pattern of landscape-wide epistasis measure γd (SI Appendix, Eq. S1_15) with mutational distance for theoretical fitness landscapes with 6 loci from ref. . (B) Observed decay of γd with mutational distance under the drop-one approach is quite homogenous, except when hot spot mutation 588P is removed; 95% CIs are contained within the lines for the global landscape. (C) Observed decay of γd for all diallelic six-locus sublandscapes. Depending on the underlying mutations, γd is vastly different, suggesting qualitative differences in the topography of the underlying fitness landscape and in the extent of additivity in the landscape. Three focal landscapes representative of different types and γd for the full landscape have been highlighted. (D) Observed decay of γd for all diallelic four-locus sublandscapes containing the parental type, indicative of locally different landscape topographies. Highlighted are the three focal landscapes. (Insets) Histograms of the roughness-to-slope ratio r/s for the respective subset, with horizontal lines indicating values for highlighted landscapes [r/s for global landscape (blue) is significantly different from HoC expectation, p=0.01]. Similar to γd, there is a huge variation in r/s for subparts of the fitness landscape, especially when considering the four-locus subsets.
Fig. 5.
Fig. 5.
(A) Epistasis measure γ(Ai,Bi)(Aj,Bj) (SI Appendix, Eq. S1_8; (Ai,Bi) on the x axis) between any two substitutions, averaged across the entire landscape. The majority of interactions are small to moderate (blue). Parts of the fitness landscape show highly localized and mutation-specific epistasis, ranging from strong magnitude epistasis (white) to sign and reciprocal sign epistasis (yellow). (B) The average epistatic effect γAi (SI Appendix, Eq. S1_10) of a mutation occurring on any background is always small. (C) The average epistatic effect γAj (SI Appendix, Eq. S1_13) of a background on any new mutation is usually small, except for mutations to *588P, which shows a strong magnitude effect. (D) Locus-specific gamma for the four mutations leading to the global optimum (Top), the four largest single-effect mutations (Middle), and the single-effect mutations with the lowest fitness (Bottom). The opt landscape exhibits strong sign epistasis between loci 587 and 588 and between loci 588 and 589. Also, the best landscape exhibits pervasive epistasis with sign epistasis between locus 588 and loci 585 and 586, respectively. We observe almost no epistasis in the worst landscape.

References

    1. Wright S. The roles of mutation, inbreeding, crossbreeding, and selection in evolution. In: Jones DF, editor. Proceedings of the Sixth International Congress of Genetics. Brooklyn Botanic Garden; Menasha, WI: 1932. pp. 356–366.
    1. Coyne JA, Barton NH, Turelli M. Perspective: A critique of Sewall Wright’s shifting balance theory of evolution. Evolution. 1997;51(3):643–671. - PubMed
    1. Gavrilets S. 2004. Fitness Landscapes and the Origin of Species. (Princeton Univ Press, Princeton)
    1. de Visser JAGM, Krug J. Empirical fitness landscapes and the predictability of evolution. Nat Rev Genet. 2014;15(7):480–490. - PubMed
    1. Costanzo M, et al. The genetic landscape of a cell. Science. 2010;327(5964):425–431. - PMC - PubMed

Publication types

Substances

LinkOut - more resources