Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep;45(9):e2300057.
doi: 10.1002/bies.202300057. Epub 2023 Jul 11.

Fluid protein fold space and its implications

Affiliations

Fluid protein fold space and its implications

Lauren L Porter. Bioessays. 2023 Sep.

Abstract

Fold-switching proteins, which remodel their secondary and tertiary structures in response to cellular stimuli, suggest a new view of protein fold space. For decades, experimental evidence has indicated that protein fold space is discrete: dissimilar folds are encoded by dissimilar amino acid sequences. Challenging this assumption, fold-switching proteins interconnect discrete groups of dissimilar protein folds, making protein fold space fluid. Three recent observations support the concept of fluid fold space: (1) some amino acid sequences interconvert between folds with distinct secondary structures, (2) some naturally occurring sequences have switched folds by stepwise mutation, and (3) fold switching is evolutionarily selected and likely confers advantage. These observations indicate that minor amino acid sequence modifications can transform protein structure and function. Consequently, proteomic structural and functional diversity may be expanded by alternative splicing, small nucleotide polymorphisms, post-translational modifications, and modified translation rates.

Keywords: alternative splicing; metamorphic proteins; protein evolution; protein fold switching; protein folding.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The single-fold hypothesis and fold-switching observations suggest different models of protein fold space. a) The classical single-fold hypothesis suggests a discrete fold space in which homologous sequences encode very similar fold architectures, and different folds are encoded by disparate protein sequences. Consistent with this assumption, structural basins (contours) encoding distinct fold architectures (green: α+β, yellow: mainly β, blue: mainly α, and red: α/β) contain homologous sequences that each assume the same architecture (dots within the contours). Architectures are represented by XCL1 (chemokine fold, green, PDB ID: 1J8I), the C-terminal domain of RfaH (β-roll fold, yellow; its N-terminal domain is gray, PDB ID: 6C6S, chain D), the human-serum-albumin-binding streptococcal protein GA (3-α-helix bundle, blue, PDB ID: 2FS1, chain A), and the immunoglobulin G (IgG)-binding streptococcal protein GB (α/β-grasp fold, red, PDB ID: 1GB1, chain A). b) Fold switching suggests that fold space is more fluid than previously realized because homologous amino acid sequences can assume multiple folds with different secondary structure arrangements. Contours from the all β-sheet alternative fold of XCL1 (green, PDB ID: 2N54) overlap with the all β-sheet fold of RfaH (yellow, PDB ID: 6C6S, chain D). These overlapping contours are shaded light green since they contain folds with the same major architecture. Furthermore, contours from the all α-helical alternative fold of RfaH (yellow, PDB ID: 5OND, chain A) overlap with contours of the 3-α-helical bundle fold, shaded purple to represent overlap between all-α and α/β folds resulting from engineered GA and GB variants with extremely high levels of sequence identity but different folds. Accordingly, the α/β-grasp fold is shaded magenta also to reflect this overlap. Both plots were calculated by performing multidimensional scaling [22] on pairwise structural differences [23] between all folds and plotting the resulting contour maps and datapoints (Supplementary Text). Structures in all figures made with PyMOL [24].
Figure 2.
Figure 2.
Fold-switching proteins can have different mechanisms and energy landscapes. (a) The mycobacterial protein PimA populates two distinct folds in nearly equal proportions under physiological conditions. Fold-switching regions of PimA are colored dark and light green (PDB IDs: 4N9W, chain A and 4NC9, chain A), respectively. Regions undergoing the largest conformational changes are enlarged below. In the dark green conformation (left), α2 and β3 switch to β2 and α3 in the light green conformation (right), respectively, and the orientations of the β-sheets change from parallel (dark green) to antiparallel (light green). (b) The bacterial plant pathogen PopP2 switches folds in response to the small molecule trigger InsP6. The fold-switching regions of its apo structure (cyan, left, PDB ID: 7F3N, chain A) switch to the dark blue structures upon binding Insp6 (right, PDB ID: 5W3T, chain A). Most notably, its long helix in the apo form switches to a β-sheet upon binding InsP6 (orange). This structural change allows its regulatory domain (pink) to assume a more ordered conformation and carry out its acetyltransferase function. (c) Most single-fold proteins, such as ubiquitin (Red, PDB ID: 1D3Z), have one deep energy well that allows for minor conformational fluctuations while the same fold is maintained. The energy landscapes of equilibrium fold switchers, such as PimA, have wells with nearly equal depths separated by a larger energy barrier. Finally, the energy landscapes of triggered fold switchers change in response to environment. Without Insp6, PopP2’s Apo conformation was calculated to be ~3 kcal/mol more favorable than its Insp6-bound fold [56]. Thus, its two energy wells have unequal depths. Their relative depths shift, however, upon binding Insp6, and the alternative fold becomes more favorable. All energy wells are for illustrative purposes only and do not quantitatively represent the true energy landscapes of these proteins.
Figure 3.
Figure 3.
Biological processes that could trigger fold switching. Alternative splicing. A β-strand (yellow) of the short C2A Piccolo domain is displaced with an alternatively spliced exon (red) in the long form (PDB ID: 1RH8), changing its calcium binding mechanism and affinity. The short form is represented by synaptotagmin (PDB ID: 1RHY), a homolog with calcium binding properties similar to the short form of C2A Piccolo. Single nucleotide polymorphisms. The C-terminal helix of human MEF2b is switched to a b-strand by D83V, the most common non-Hodgkin-lymphoma-associated SNP in this protein. N- and C-termini are labeled for clarity. Post-translational modifications. Could hypothetically induce fold switching. A potential modification (yellow circle) is shown on the right figure. Altered cotranslational folding rates could also lead to changes in protein folds, as shown with the ω subunit of RNA polymerase, which is disordered when translated quickly but folds when translated slowly (black, PDB ID: 3EQL, chain O). N- and C-termini are labeled for clarity. Proposed triggers of fold switching with gray background are indirectly supported and hypothetical.
Figure 4.
Figure 4.
Examples of experimental methods that have identified or could identify fold switching. (a). Circular dichroism spectroscopy was recently used to distinguish between single-folding (salmon) and fold-switching (teal) NusG proteins with different ground state folds [32]. This approach was successful because the secondary structures of their C-terminal domains have very different ground states: all β-sheet (single fold, salmon) and all α-helix (fold switch, teal). Structurally conserved N-terminal domains are gray. (b) Hydrogen exchange monitored by mass spectrometry was recently used to distinguish between two distinct conformations of the SARS-CoV-2 spike protein [106]. It has the potential to also distinguish between fold-switching proteins such as a recently engineered protein that switches from an α/β-plait (blue) to an all α-helical (red) fold as temperature decreases [76].

References

    1. Mistry J, Chuguransky S, Williams L, Qureshi M, Salazar GA, Sonnhammer ELL… Bateman A. 2021. Pfam: The protein families database in 2021. Nucleic Acids Res 49: D412–D9. - PMC - PubMed
    1. Anfinsen CB. 1973. Principles that govern the folding of protein chains. Science 181: 223–30. - PubMed
    1. Rost B. 1999. Twilight zone of protein sequence alignments. Protein Eng 12: 85–94. - PubMed
    1. Berlow RB, Dyson HJ, Wright PE. 2015. Functional advantages of dynamic protein disorder. FEBS Lett 589: 2433–40. - PMC - PubMed
    1. van der Lee R, Buljan M, Lang B, Weatheritt RJ, Daughdrill GW, Dunker AK, Fuxreiter M…Babu MM. 2014. Classification of intrinsically disordered regions and proteins. Chem Rev 114: 6589–631. - PMC - PubMed

Publication types

LinkOut - more resources