Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb 1;17(2):e1009353.
doi: 10.1371/journal.pgen.1009353. eCollection 2021 Feb.

Fitness landscape of a dynamic RNA structure

Affiliations

Fitness landscape of a dynamic RNA structure

Valerie W C Soo et al. PLoS Genet. .

Abstract

RNA structures are dynamic. As a consequence, mutational effects can be hard to rationalize with reference to a single static native structure. We reasoned that deep mutational scanning experiments, which couple molecular function to fitness, should capture mutational effects across multiple conformational states simultaneously. Here, we provide a proof-of-principle that this is indeed the case, using the self-splicing group I intron from Tetrahymena thermophila as a model system. We comprehensively mutagenized two 4-bp segments of the intron. These segments first come together to form the P1 extension (P1ex) helix at the 5' splice site. Following cleavage at the 5' splice site, the two halves of the helix dissociate to allow formation of an alternative helix (P10) at the 3' splice site. Using an in vivo reporter system that couples splicing activity to fitness in E. coli, we demonstrate that fitness is driven jointly by constraints on P1ex and P10 formation. We further show that patterns of epistasis can be used to infer the presence of intramolecular pleiotropy. Using a machine learning approach that allows quantification of mutational effects in a genotype-specific manner, we demonstrate that the fitness landscape can be deconvoluted to implicate P1ex or P10 as the effective genetic background in which molecular fitness is compromised or enhanced. Our results highlight deep mutational scanning as a tool to study alternative conformational states, with the capacity to provide critical insights into the structure, evolution and evolvability of RNAs as dynamic ensembles. Our findings also suggest that, in the future, deep mutational scanning approaches might help reverse-engineer multiple alternative or successive conformations from a single fitness landscape.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Determining the fitness landscape of a dynamic RNA structure.
(A) The sequence and secondary structure of the Tet-119(C20A) group I intron with its 5’ and 3’ exonic context. Secondary structure conformations during sequential formation of P1ex and P10 are highlighted in the blow-ups. The two sub-regions that were subjected to mutagenesis (N2..N5 and N18..N21) are shaded grey. (B) Schematic representation of the knt-intron construct, library generation, and selection protocol. (C) In the presence of kanamycin, self-splicing activity (molecular fitness) of the group I is coupled to organismal fitness as intron removal is required for reconstitution of the knt open reading frame.
Fig 2
Fig 2. Fitness across intron genotypes.
(A) Distribution of fitness effects at 30°C and 37°C. (B) Shannon diversity of intron genotype pools under different conditions. (C) Similarity in genotype pool composition across all replicates and conditions measured as Bray-Curtis (BC) dissimilarity, where BC = 1 indicates maximum dissimilarity between samples. (D) Fitness of intron genotypes at 30°C as a function of Hamming distance (i.e. the number of mutational steps away from the master sequence).
Fig 3
Fig 3. Causes and correlates of variable fitness across intron genotypes.
(A) Fitness weakly correlates with predicted minimum free energy of the intron. For orientation, note that the predicted minimum free energy (ΔG) of the master sequence is -362.8 (B) Fitness varies according to the number of guanosines or cytosines (#GC) in the N2..N5 and N18..N21 regions. (C) Fitness varies as a function of the number of strong or weak base-pairs that could be formed in P1ex assuming that base-pairing follows the established master/wildtype pattern (see Fig 1A). (D) Distribution of pairwise epistasis values for genotypes that are two mutations away from the master sequence (Hamming distance = 2). ε>0 indicates positive epistasis, ε<0 indicates negative epistasis. (E) Pairwise epistasis for genotypes in (D) by position and mutation. Diagrams on the left highlight the N3/N20 couple, where mutations that are predicted to lead to base-pairing are associated with positive epistasis.
Fig 4
Fig 4. Assessing the contribution of individual nucleotide identities to fitness across multiple structural conformations.
(A) Contribution to XGBoost-predicted relative fitness across all intron genotypes, as measured by Shapley’s additive explanation (SHAP) scores, of three example site/nucleotide features. More positive SHAP scores are associated with higher fitness. (B) The average contribution across all genotypes of all individual site/nucleotide features, measured as ΔSHAP = SHAPpresent—SHAPabsent, where SHAPpresent and SHAPabsent correspond to the mean SHAP score of all genotypes where a given nucleotide at a given site is present and absent, respectively. (C) Fitness landscape at 30°C as a function of RNA stability of P1ex and P10 across all genotypes assuming bases are aligned to pair as in the master/wildtype structure (see Fig 1A, Materials and methods). There are 211 unique energy values across all 48 P1ex genotypes. These were consolidated into ten bins of increasing stability for visualization purposes. The 21 unique energy values across 44 P10 genotypes are shown in full as 21 bins of increasing stability. Bar heights correspond to the median fitness in each bin. (D) Fitness as a function of N2/N21 genotype, with a focus on cytosines. (E) Minor groove width associated with different N2/N21 genotypes as determined using molecular dynamics simulations (see Materials and methods). (F) Three overlaid representative conformations of the P1/P1ex helix (randomly sampled from the final 50 ns of each simulation) for the master sequence and the C2/C21 genotype.
Fig 5
Fig 5. Asymmetric fitness effects across the N2..N5 and N18..N21 sub-regions.
(A) Proportion of gains in the model (see main text) contributed by site/nucleotide identity features at N2..N5 and N18..N21. The solid line corresponds to the mean contribution made by a sub-region across 100 random samples, where individual gains are randomly shuffled across site/nucleotide identity features. Dashed lines correspond to 95% confidence intervals. (B) Pairwise epistasis for double mutants where both mutations are located in N18..N21 (orange), both mutations are located in N2..N5 (brown), or N2..N5 and N18..N21 carry one mutation each (grey). (C) The correlation of fitness effects (γ) of intron mutants at various mutational distances from the master sequence.

References

    1. Chen Y, Carlini DB, Baines JF, Parsch J, Braverman JM, Tanda S, et al. RNA secondary structure and compensatory evolution. Genes Genet Syst. 1999;74: 271–286. 10.1266/ggs.74.271 - DOI - PubMed
    1. Umu SU, Poole AM, Dobson RC, Gardner PP, Adelman K. Avoidance of stochastic RNA interactions can be harnessed to control protein expression levels in bacteria and archaea. eLife. eLife Sciences Publications Limited; 2016;5: e13479 10.7554/eLife.13479 - DOI - PMC - PubMed
    1. Pitt JN, Ferré-D'Amaré AR. Rapid Construction of Empirical RNA Fitness Landscapes. Science. American Association for the Advancement of Science; 2010;330: 376–379. 10.1126/science.1192001 - DOI - PMC - PubMed
    1. Petrie KL, Joyce GF. Limits of Neutral Drift: Lessons From the In Vitro Evolution of Two Ribozymes. J Mol Evol. Springer US; 2014;79: 75–90. 10.1007/s00239-014-9642-z - DOI - PMC - PubMed
    1. Hayden EJ, Ferrada E, Wagner A. Cryptic genetic variation promotes rapid evolutionary adaptation in an RNA enzyme. Nature. Nature Publishing Group; 2011;474: 92–95. 10.1038/nature10083 - DOI - PubMed

Publication types