Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 3:11:e75751.
doi: 10.7554/eLife.75751.

Sampling alternative conformational states of transporters and receptors with AlphaFold2

Affiliations

Sampling alternative conformational states of transporters and receptors with AlphaFold2

Diego Del Alamo et al. Elife. .

Abstract

Equilibrium fluctuations and triggered conformational changes often underlie the functional cycles of membrane proteins. For example, transporters mediate the passage of molecules across cell membranes by alternating between inward- and outward-facing states, while receptors undergo intracellular structural rearrangements that initiate signaling cascades. Although the conformational plasticity of these proteins has historically posed a challenge for traditional de novo protein structure prediction pipelines, the recent success of AlphaFold2 (AF2) in CASP14 culminated in the modeling of a transporter in multiple conformations to high accuracy. Given that AF2 was designed to predict static structures of proteins, it remains unclear if this result represents an underexplored capability to accurately predict multiple conformations and/or structural heterogeneity. Here, we present an approach to drive AF2 to sample alternative conformations of topologically diverse transporters and G-protein-coupled receptors that are absent from the AF2 training set. Whereas models of most proteins generated using the default AF2 pipeline are conformationally homogeneous and nearly identical to one another, reducing the depth of the input multiple sequence alignments by stochastic subsampling led to the generation of accurate models in multiple conformations. In our benchmark, these conformations spanned the range between two experimental structures of interest, with models at the extremes of these conformational distributions observed to be among the most accurate (average template modeling score of 0.94). These results suggest a straightforward approach to identifying native-like alternative states, while also highlighting the need for the next generation of deep learning algorithms to be designed to predict ensembles of biophysically relevant states.

Keywords: G-protein-coupled receptors; conformational dynamics; molecular biophysics; none; protein structure prediction; structural biology; transporters.

PubMed Disclaimer

Conflict of interest statement

Dd, DS, HM, JM No competing interests declared

Figures

Figure 1.
Figure 1.. Alternative conformations of transporters and G-protein-coupled receptors (GPCRs) can be predicted by AlphaFold2 (AF2).
(A) Representative models of the transporter LAT1 in inward-facing (IF) and outward-facing (OF) conformations. Experimental structures shown in gray and models shown in colors. (B) Comparison of AF2 models with inactive/active or IF/OF experimental structures as a function of multiple sequence alignment (MSA) depth for GPCRs (top) and transporters (bottom), respectively. All models shown here were generated without templates. Dashed lines indicate the template modeling (TM) score between experimental structures and are shown for reference. (C) Supplementing shallow MSAs with OF templates allows AF2 to predict the OF conformation of MCT1. (D) Experimental structures superimposed over models with the greatest TM scores. Inactive/IF and active/OF cartoons shown on the top and bottom in teal and orange, respectively.
Figure 1—figure supplement 1.
Figure 1—figure supplement 1.. Example principal component analysis (PCA) of ASCT2 models generated by AlphaFold2 (AF2) containing outlier models.
Colors correspond to multiple sequence alignment (MSA) depth in sequences, while orange and teal diamonds refer to the outward- and inward-facing experimental structures, respectively. Left: outlier models are indicated by gray arrows and could be clearly delineated along PC2. Right: removal of these models reveal collective variables along PC2.
Figure 1—figure supplement 2.
Figure 1—figure supplement 2.. Conformational homogeneity as a function of multiple sequence alignment (MSA) depth.
Conformational homogeneity was defined by classifying models as more similar to one of the two experimental structures based on their template modeling (TM) scores and calculating the fraction of models in the larger group. MCT1 and proteins in the training set (CCR5, MurJ, PfMATE, and SERT) were biased toward one specific conformation (uniformity = 1) even when using shallow MSAs. X-Axis shown on log-2 scale for clarity. Models generated using MSAs with 16 sequences were omitted from this figure.
Figure 1—figure supplement 3.
Figure 1—figure supplement 3.. Templates contribute to conformational sampling only when shallow multiple sequence alignments (MSAs) are provided.
(A) PTH1R. (B) LAT1.
Figure 1—figure supplement 4.
Figure 1—figure supplement 4.. Protein targets with one conformation in the training set cannot be predicted in the alternative conformation.
The following experimental structures served as references for the training and the alternative conformations, respectively: 5UIWr and 7F1Qr (CCR5), 5T77a and 6NC9a (MurJ), 3VVNa and 6FHZa (PfMATE), and 5I6Xa and 6DZZa (SERT).
Figure 2.
Figure 2.. Comparison between the movement undergone by each Cα atom between the two superimposed experimental structures and their root mean square fluctuation (RMSF) values among AlphaFold2 (AF2) models.
Residues with low confidence (pLDDT ≤75) were omitted from this plot for all proteins except PTH1R. Multiple sequence alignment (MSA) sizes of 128 sequences were used for all predictions, except for MCT1 with templates, which instead used 32 sequences to capture the outward-facing (OF) conformation. pLDDT refers to each residue’s predicted accuracy, with a value of 100 indicating maximum confidence.
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. Comparison between the movement undergone by each Cα atom between the two superimposed experimental structures of PTH1R and LAT1 and their root mean square fluctuation (RMSF) values among AlphaFold2 (AF2) models generated with templates.
Figure 3.
Figure 3.. Distinct conformations can be delineated using principal component analysis (PCA).
(A) Conformational heterogeneity in AlphaFold2 (AF2) models of LAT1. Experimental inward-facing (IF) and outward-facing (OF) conformations shown in teal and orange, respectively, while the gallery of AF2 models generated using 128 sequences are shown in gray. (B) Distribution of AF2 models generated using multiple sequence alignments (MSAs) with 32 or more sequences across the first principal component (PC1) following PCA (gray traces). Scatter plots comparing each model’s position along PC1 and its structural similarity to experimentally determined structures. Teal: similarity to IF (transporters) or inactive (G-protein-coupled receptors, GPCRs) conformation. Orange: similarity to OF (transporters) or active (GPCRs) conformation. Each model is shown twice, once in teal and once in orange. Native structures are shown as black dots.
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. Models sampled by AlphaFold2 (AF2) in multiple conformations cannot be fully explained by linear interpolation of two end structures.
Principal component analysis (PCA) of AF2 models, with colors corresponding to the depth of the multiple sequence alignments (MSAs) used for prediction. Experimental structures in an inward-facing or inactive state are shown as teal diamonds, while structures in an outward-facing or active state are shown as orange diamonds.
Figure 3—figure supplement 2.
Figure 3—figure supplement 2.. Example predictions of the adhesion G-protein-coupled receptor (aGPCR) GPR114/ADGRG5.
Top: kernel density estimate of the first principal component (PC1) following principal component analysis (PCA on all AlphaFold2 (AF2)) models. Bottom: comparison of PC1 and template modeling (TM) score values; alignments were measured from the AF2 database model, which is indicated by the green dot. Right: models extracted from the cluster centers appear to adopt three distinct conformations.

Comment in

  • doi: 10.7554/eLife.78549

References

    1. Akdel M, Pires DEV, Porta Pardo E, Jänes J, Zalevsky AO, Mészáros B, Bryant P, Good LL, Laskowski RA, Pozzati G, Shenoy A, Zhu W, Kundrotas P, Ruiz Serra V, Rodrigues CHM, Dunham AS, Burke D, Borkakoti N, Velankar S, Frost A, Lindorff-Larsen K, Valencia A, Ovchinnikov S, Durairaj J, Ascher DB, Thornton JM, Davey NE, Stein A, Elofsson A, Croll TI, Beltrao P. A structural biology community assessment of AlphaFold 2 applications. bioRxiv. 2021 doi: 10.1101/2021.09.26.461876. - DOI - PMC - PubMed
    1. Bavnhøj L, Paulsen PA, Flores-Canales JC, Schiøtt B, Pedersen BP. Molecular mechanism of sugar transport in plants unveiled by structures of glucose/H symporter STP10. Nature Plants. 2021;7:1409–1419. doi: 10.1038/s41477-021-00992-0. - DOI - PubMed
    1. Boehr DD, Nussinov R, Wright PE. The role of dynamic conformational ensembles in biomolecular recognition. Nature Chemical Biology. 2009;5:789–796. doi: 10.1038/nchembio.232. - DOI - PMC - PubMed
    1. Campbell E, Kaltenbach M, Correy GJ, Carr PD, Porebski BT, Livingstone EK, Afriat-Jurnou L, Buckle AM, Weik M, Hollfelder F, Tokuriki N, Jackson CJ. The role of protein dynamics in the evolution of new enzyme function. Nature Chemical Biology. 2016;12:944–950. doi: 10.1038/nchembio.2175. - DOI - PubMed
    1. Claxton DP, Jagessar KL, Mchaourab HS. Principles of Alternating Access in Multidrug and Toxin Extrusion (MATE) Transporters. Journal of Molecular Biology. 2021;433:166959. doi: 10.1016/j.jmb.2021.166959. - DOI - PMC - PubMed

Publication types

LinkOut - more resources