Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Aug 17:2023.05.28.542639.
doi: 10.1101/2023.05.28.542639.

The Genetic Landscape of a Metabolic Interaction

Affiliations

The Genetic Landscape of a Metabolic Interaction

Thuy N Nguyen et al. bioRxiv. .

Update in

Abstract

Enzyme abundance, catalytic activity, and ultimately sequence are all shaped by the need of growing cells to maintain metabolic flux while minimizing accumulation of deleterious intermediates. While much prior work has explored the constraints on protein sequence and evolution induced by physical protein-protein interactions, the sequence-level constraints emerging from non-binding functional interactions in metabolism remain unclear. To quantify how variation in the activity of one enzyme constrains the biochemical parameters and sequence of another, we focused on dihydrofolate reductase (DHFR) and thymidylate synthase (TYMS), a pair of enzymes catalyzing consecutive reactions in folate metabolism. We used deep mutational scanning to quantify the growth rate effect of 2,696 DHFR single mutations in 3 TYMS backgrounds under conditions selected to emphasize biochemical epistasis. Our data are well-described by a relatively simple enzyme velocity to growth rate model that quantifies how metabolic context tunes enzyme mutational tolerance. Together our results reveal the structural distribution of epistasis in a metabolic enzyme and establish a foundation for the design of multi-enzyme systems.

Keywords: DHFR; TYMS; deep mutational scan; dihydrofolate reductase; epistasis; genetic interaction; genotype-to-phenotype; thymidylate synthase.

PubMed Disclaimer

Conflict of interest statement

COMPETING INTERESTS The authors have no competing interests to declare.

Figures

Figure 1.
Figure 1.. Constructing a biochemistry-to-growth model for DHFR and TYMS.
a) Schematic describing the relationship between metabolic pathway flux and enzyme velocity. Many enzymes show a hyperbolic relationship between velocity and flux; the enzyme control coefficient describes the fractional change in flux given a fractional change in velocity. Control coefficients vary with the starting enzyme velocity (purple and dark blue arrows, background “A”) and can change with genetic background (violet and light blue arrows, background “B”). Consequently mutations can have a strong effect on flux and growth in one context but not in another (compare violet and purple arrows). b) The role of DHFR and TYMS in folate metabolism. Metabolites are labeled in grey or black italic text. Dotted lines indicate multiple intermediate reactions that are summarized with a single line. c) The relationship between the experimentally measured relative abundance of [10-formyl-THF] and E. coli growth rate. Red points indicate five DHFR variants in the background of TYMS R166Q (a near catalytically inactive variant) and black indicates the same DHFR variants in the context of WT TYMS. Error bars indicate the standard deviation across N=3 replicates for both growth rate (y-axis) and 10-formyl-THF abundance (x-axis). The blue dotted line indicates the best fit for a hyperbolic model (Equation 1) relating THF abundance to growth. d) A simplified, abstracted version of the DHFR and TYMS system. Again dotted lines indicate multiple intermediate reactions that are summarized with a single line. e) The correlation between experimentally measured log10[10-formyl-THF] relative abundance and the model prediction (as computed with Equation 3). The grey dotted line indicates x=y. Color coding is identical to c. Error bars in x indicate the standard deviation across N=3 replicate experimental measurements, error bars in y describe the standard deviation across ten fits obtained by jackknife (leave-one-out) sub-sampling the data and refitting the model. f) The correlation between experimentally measured and predicted growth rates for five DHFR point mutations in two different TYMS backgrounds (same mutants as in c,e). The grey dotted line indicates x=y. Color coding is identical to c. Error bars in x indicate the standard deviation across N=3 replicate experimental measurements, error bars in y describe the standard deviation across ten fits obtained by jackknife (leave-one-out) sub-sampling the data and refitting the model.
Figure 2.
Figure 2.. TYMS catalytic activity modulates the epistatic constraints on DHFR sequence and catalytic activity.
a) Location of the TYMS point mutations (PDBID: 1BID). TYMS functions as an obligate domain-swapped homodimer; active sites include residues from both monomers (white and grey cartoon). Positions mutated in this study are in colored spheres, and indicated with arrows (Q33 – cyan, R127 – navy, R166 – red). The dUMP substrate is in sticks, labeled and colored green. b) Michaelis Menten enzyme kinetics for WT TYMS (black), TYMS Q33S (cyan), and TYMS R127A (navy). Experimental replicates (3 total) are plotted individually. Points indicate experimental data and lines the best fit steady state model. c) Correlation between experimentally measured and model-predicted relative growth rates for seven DHFR variants in four TYMS backgrounds. Each point represents one DHFR/TYMS combination. Error bars in the x direction are SEM across triplicate growth rate measurements, error bars in y are the SEM estimated from jackknife (leave-one-out) sub-sampling the data and refitting the model. d) Heatmaps of simulated growth rates (top row) and epistasis (bottom row) as computed over a range of DHFR kinetic parameters in four TYMS backgrounds. In the left-most column of heatmaps a red star marks the highest activity enzyme (low Km, high kcat), while a yellow star marks the lowest activity enzyme. Growth rates are indicated with a blue-white-red color map, where a relative growth rate of one (white) is equivalent to WT. Epistasis values are indicated with a green-white-pink color map, where zero epistasis is shown in white.
Figure 3.
Figure 3.. The effects of DHFR mutation on growth rate in three TYMS backgrounds.
a) Sequencing-based growth rate measurements for DHFR F31V in three TYMS backgrounds: R166Q (red), Q33S (cyan), and WT (black). Each point represents one triplicate experimental measurement. Dotted lines indicate linear regression fits to each replicate, the slope of each line is the inferred growth rate (relative to WT) for that DHFR/TYMS mutant combination. b) Heatmaps of the growth rate effect for all DHFR single mutations. DHFR positions are along the horizontal axis; amino acid residues (along the vertical axis) are organized by physiochemical similarity. The displayed relative growth rate is an average across three replicates, and is normalized such that the WT DHFR is equal to one. Red indicates mutations that increase growth rate, white indicates mutations with wild-type like growth, and blue indicates mutations that decrease growth rate. Null mutations (black squares) were not observed by sequencing after the first two time points, and thus there was insufficient data for growth rate inference. Small dots mark the WT residue identity in each column. c) The distribution of DHFR mutational effects in the WT TYMS background. The red line indicates a best-fit double gaussian, grey bars are the data. The red, dashed “inactive” line marks the average relative growth rate for nonsense mutations (stop codons) in the first 120 positions of DHFR. The WT DHFR growth rate is equal to one. d) The distribution of DHFR mutational effects in the TYMS Q33S background, color coding identical to (c) e) The distribution of DHFR mutational effects in the TYMS R166Q background, color coding identical to (c). Note that the y-axis for (e) is distinct from (c) and (d).
Figure 4.
Figure 4.. Epistatic coupling of DHFR to two TYMS backgrounds.
a) Volcano plot examining the statistical significance of epistasis across all DHFR point mutations in the Q33S background. P-values were calculated by unequal variance t-test under the null hypothesis that the mutations have equal mean growth rates in both TYMS backgrounds (across triplicate measurements). The red horizontal dashed line marks the standard significance cutoff of P=0.05, the black horizontal dashed line indicates a multiple-hypothesis testing adjusted p-value (P=0.035). The grey vertical dashed lines indicate an empirical threshold for epistasis. Pink and green indicate statistically significant positive and negative epistasis respectively. b) Volcano plot examining the statistical significance of epistasis across all DHFR point mutations in the R166Q background. P-values were calculated as in (a); the multiple-hypothesis testing adjusted p-value for the R166Q background was (P=0.029). Color coding is identical to (a). c) Comparison of the relative growth rate effects for DHFR single mutants in the WT and TYMS Q33S backgrounds. The marginal distribution of growth rate effects is shown along each axis. Mutations with statistically significant positive and negative epistasis are indicated in pink and green respectively. The WT relative growth rate equals one, and is indicated with a dashed grey line across each axis. The dashed red line marks x=y. Error bars (in x and y) indicate standard deviation across three experimental measurements. d) Comparison of the relative growth rate effects for DHFR single mutants in the WT and TYMS R166Q backgrounds. Plot layout and color coding is identical to (c). Error bars (in x and y) indicate standard deviation across three experimental measurements.
Figure 5.
Figure 5.. Global comparison of the biochemistry-to-growth model and deep mutational scanning data set.
a) Correlation between the experimentally measured and predicted growth rates of 114 DHFR/TYMS mutant combinations (circles colored according to TYMS background). Horizontal error bars indicate standard deviation in experimentally measured growth rates across three replicate measurements, vertical error bars are the standard deviation in the predicted growth rates estimated by performing 1000 bootstrap re-samplings (and model fits) of the data. The dashed grey line indicates y=x. b) Correlation between the experimentally measured and model-predicted epistasis, as computed from the growth rate data in (a). Again, color coding indicates TYMS background (identical to a). The dashed grey line indicates y=x. Horizontal error bars indicate standard deviation propagated from the experimentally measured growth rates across three replicate measurements, vertical error bars are the standard deviation in the predicted epistasis values estimated by performing 1000 bootstrap re-samplings (and model fits) of the data. c) Correlation between the experimentally measured and computationally inferred log10(kcat/Km) values for 38 mutants of DHFR. Horizontal error bars describe the standard deviation across triplicate experimental measurements, vertical error bars indicate the standard deviation across 50 iterations of stochastic (Monte-Carlo based) model inference. d) Correlation between experimentally measured and predicted growth rates across the entire deep mutational scanning dataset. The marginal distribution of growth rate effects is shown along each axis. e) Correlation between experimentally measured and predicted epistasis across the entire deep mutational scanning dataset. The marginal distribution of epistatic effects is shown along each axis. f) Mutational tolerance of DHFR as a function of TYMS background. The heatmap shows the fraction of DHFR mutations with growth rates of 0.9 or better as TYMS kcat and Km are discretely varied. Both axes are natural log spaced, TYMS kcat was sampled at 50 points between ln(−2) and ln(2), while TYMS Km was sampled at 50 points between ln(−1) and ln(3). The values for TYMS R166Q, Q33S and WT are marked with red, cyan and black circles respectively. g) A zoomed-in version of (f), focusing on the mutational tolerance of DHFR for TYMS backgrounds similar in velocity to WT and Q33S TYMS.
Figure 6.
Figure 6.. DHFR positions clustered by epistatic mutational effect.
a) Clusters of DHFR positions organized by predominant epistasis type. In each heat map DHFR positions are ordered along the vertical axis; amino acid residues are organized by physiochemical similarity along the horizontal axis. As in earlier plots, green indicates negative epistasis, and pink indicates positive epistasis. Grey pixels mark mutations with statistically insignificant epistasis. b) Structural location of epistatic clusters for DHFR to TYMS Q33S. The DHFR backbone is in grey cartoon (PDBID: 1RX2). Folate, the DHFR substrate is indicated with yellow sticks. The NADP+ cofactor is in green sticks. c) Structural location of epistatic clusters for DHFR to TYMS R166Q. Color coding is identical to panel (b).
Figure 7.
Figure 7.. The structural organization of epistasis in DHFR.
a) Epistasis of individual DHFR positions to TYMS Q33S. The DHFR structure is shown in space filling spheres (PDBID: 1RX2), with the NADP co-factor in green sticks, and folate in yellow sticks. A slice through the structure shows the interior arrangement of epistasis. Positions in the negative epistasis cluster are colored green, positions in the positive epistasis cluster are colored pink. Grey spheres indicate positions in the insignificant epistasis cluster. b) Cumulative distribution of positions in each epistatic cluster by distance to the DHFR active site for the TYMS Q33S background. In this case, active site was defined as the C6 atom of the folate substrate. Color coding follows from (a) c) Epistasis of individual DHFR positions to TYMS R166Q. The DHFR structure is shown in space filling spheres (PDBID: 1RX2), with the NADP co-factor in green sticks, and folate in yellow sticks. A slice through the structure shows the interior arrangement of epistasis. Positions with strong positive epistasis are colored magenta, positions in the positive epistasis cluster are colored pink. Grey spheres indicate positions in the insignificant epistasis cluster. d) Cumulative distribution of positions in each epistatic cluster by distance to the DHFR active site for the TYMS R166Q background. In this case, active site was defined as the C6 atom of the folate substrate. Color coding follows from (c) e) The position-averaged log10 catalytic power across measured mutations. All residues are indicated in space filling and color coded by the average mutational effect. Blue indicates positions where mutations have a deleterious effect on catalytic power (on average), while white indicates mutations that have little to no effect on catalytic power. Again, the NADP co-factor is shown in green sticks, and folate in yellow sticks. A slice through the structure shows the interior distribution of mutational effects on catalysis. f) The structural overlap between positions associated to catalysis and evolutionary conservation. The DHFR backbone is shown in grey cartoon, the NADP co-factor in green sticks, and folate in yellow sticks. Positions where mutations have (on average) a deleterious effect on catalysis at least half a standard deviation below the mean are shown in blue space filling (color coding identical to c). Evolutionarily conserved positions (as computed by the Kullback-Leibler relative entropy in a large alignment of DHFR sequences) are outlined in red mesh.

References

    1. Schober A. F. et al. A two-enzyme adaptive unit within bacterial folate metabolism. Cell Reports 27, 3359–3370.e7 (2019). - PMC - PubMed
    1. Kemble H. E. et al. Flux, toxicity and protein expression costs shape genetic interaction in a metabolic pathway. http://biorxiv.org/lookup/doi/10.1101/362327 (2018) doi:10.1101/362327. - DOI - DOI - PMC - PubMed
    1. Lilja E. E. & Johnson D. R. Metabolite toxicity determines the pace of molecular evolution within microbial populations. BMC Evol Biol 17, 52 (2017). - PMC - PubMed
    1. Noor E. et al. The Protein Cost of Metabolic Fluxes: Prediction from Enzymatic Rate Laws and Cost Minimization. PLOS Comput. Biol. 12, e1005167 (2016). - PMC - PubMed
    1. Dekel E. & Alon U. Optimality and evolutionary tuning of the expression level of a protein. Nature 436, 588–592 (2005). - PubMed

Publication types