Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan;33(1):268-80.
doi: 10.1093/molbev/msv211. Epub 2015 Oct 6.

Coevolutionary Landscape Inference and the Context-Dependence of Mutations in Beta-Lactamase TEM-1

Affiliations

Coevolutionary Landscape Inference and the Context-Dependence of Mutations in Beta-Lactamase TEM-1

Matteo Figliuzzi et al. Mol Biol Evol. 2016 Jan.

Abstract

The quantitative characterization of mutational landscapes is a task of outstanding importance in evolutionary and medical biology: It is, for example, of central importance for our understanding of the phenotypic effect of mutations related to disease and antibiotic drug resistance. Here we develop a novel inference scheme for mutational landscapes, which is based on the statistical analysis of large alignments of homologs of the protein of interest. Our method is able to capture epistatic couplings between residues, and therefore to assess the dependence of mutational effects on the sequence context where they appear. Compared with recent large-scale mutagenesis data of the beta-lactamase TEM-1, a protein providing resistance against beta-lactam antibiotics, our method leads to an increase of about 40% in explicative power as compared with approaches neglecting epistasis. We find that the informative sequence context extends to residues at native distances of about 20 Å from the mutated site, reaching thus far beyond residues in direct physical contact.

Keywords: coevolution; epistasis; genotype–phenotype mapping; mutational landscape; statistical inference.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.
Fig. 1.
Pipeline of the mutational-landscape prediction: The homologous Pfam family containing the protein of interest (the Beta-lactamase2 family PF13354 in the case of TEM-1) is used to construct a global statistical model using the DCA. This model allows to score mutations by differences in the inferred genotype-to-phenotype mapping between the mutant and the wild-type amino acid sequence. This score, which is expected to incorporate (co-)evolutionary constraints acting across the entire family, is used as a predictor of the phenotypic effects of single (or few) amino acid substitutions in the protein of interest.
F<sc>ig</sc>. 2.
Fig. 2.
Context dependence of mutational effects: (A) Procedure of including all residues within a maximal native distance dmax into the prediction of the mutational effects of the residue of interest (labeled i in the figure). This leads to residue-specific subalignments, which consist of columns, which are not necessarily consecutive, but connected in 3D. The results are given in (B). The main figure shows the correlation R2 between MIC data and our predictions, as a function of the cutoff distance dmax. The inset shows the average fraction of residues included into the reduced MSAs, again in dependence of dmax.
F<sc>ig</sc>. 3.
Fig. 3.
R2 between experimental fitness and predicted fitness for the following features: Independent-residue model (IND), DCA, SIFT (SIFT), Polyphen-2 (Poly), PoPMuSiC (PoP), I-Mutant2.0(sequence+structure) (Imut+), MUpro (MUpro), I-Mutant2.0 (Imut), molecular simulations (SIM), relative solvent accessibility (RSA), and Blosum62 substitution matrix (BLO).
F<sc>ig</sc>. 4.
Fig. 4.
Statistical scores and thermodynamic stabilities. (A) Scatter plot of the log odd ratio ΔϕDCA versus the experimental fitness μexp, stabilizing mutations mentioned in the text are highlighted in red. The highest scoring mutations are M182T, G92D, H153R, L201P and E147G, all reported as stabilizing. (B) ΔϕDCA for a smaller set of single mutations is now plotted versus the change in Gibbs Free Energy relative to wild type ΔΔGΔGmutΔGwt, as measured by four independent studies (Ref1, Ref2, Ref3 and Ref4 are Kather et al. 2008, Raquet et al. 1995, Wang et al. 2002, and Deng et al. 2012, respectively).
F<sc>ig</sc>. 5.
Fig. 5.
Protein-specific amino acid substitution effects in TEM-1: Amino acid substitution effects, averaged over experimental measurements (A), DCA predictions (B), and extracted from BLOSUM62 (C). Blue squares correspond to nearly neutral mutations (log MIC > 5.3), whereas yellow squares correspond to highly deleterious mutations (log MIC < 2.6). White squares are used for unobserved substitutions. The histogram in (D) shows R2 between averaged computational and experimental amino acid substitution effects.

References

    1. Abriata LA, Palzkill T, Dal Peraro M. 2015. How structural and physicochemical determinants shape sequence constraints in a functional enzyme. PLoS One 10(2):e0118684 - PMC - PubMed
    1. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. 2010. A method and server for predicting damaging missense mutations. Nat Methods. 7(4):248–249. - PMC - PubMed
    1. Ambler R, Coulson A, Frère J-M, Ghuysen J-M, Joris B, Forsman M, Levesque R, Tiraby G, Waley S. 1991. A standard numbering scheme for the class a beta-lactamases. Biochem J. 276(Pt 1):269. - PMC - PubMed
    1. Ashenberg O, Gong LI, Bloom JD. 2013. Mutational effects on stability are largely conserved during protein evolution. Proc Natl Acad Sci U S A. 110(52):21071–21076. - PMC - PubMed
    1. Baldassi C, Zamparo M, Feinauer C, Procaccini A, Zecchina R, Weigt M, Pagnani A. 2014. Fast and accurate multivariate Gaussian modeling of protein families: predicting residue contacts and protein-interaction partners. PLoS One 9(3):e92721. - PMC - PubMed

Publication types