Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Aug 10:9:928534.
doi: 10.3389/fmolb.2022.928534. eCollection 2022.

Deep learning approaches for conformational flexibility and switching properties in protein design

Affiliations
Review

Deep learning approaches for conformational flexibility and switching properties in protein design

Lucas S P Rudden et al. Front Mol Biosci. .

Abstract

Following the hugely successful application of deep learning methods to protein structure prediction, an increasing number of design methods seek to leverage generative models to design proteins with improved functionality over native proteins or novel structure and function. The inherent flexibility of proteins, from side-chain motion to larger conformational reshuffling, poses a challenge to design methods, where the ideal approach must consider both the spatial and temporal evolution of proteins in the context of their functional capacity. In this review, we highlight existing methods for protein design before discussing how methods at the forefront of deep learning-based design accommodate flexibility and where the field could evolve in the future.

Keywords: deep learning; generative models; protein design; protein flexibility; protein switches.

PubMed Disclaimer

Conflict of interest statement

PB holds patents and provisional patent applications in the field of engineered T cell therapies and protein design. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Three types of generative models are generally applied in DL-based protein design: (A) autoencoders/variational autoencoders (AE/VAEs), (B) generative adversarial networks (GANs), and (C) autoregressive models.
FIGURE 2
FIGURE 2
Most DL protein design methods tackle design as either a (A) sequence generation or (B) structure generation problem, each accompanied by the general process outlined here. Examples of both methods used to assess the quality of generated samples and specific DL protein design examples are also indicated.
FIGURE 3
FIGURE 3
(A) Current design methods either: (i) Produce new sequences corresponding to some structure with limited design objective conditioning that could be leveraged for conformational flexibility design. (ii) Produce novel folds that confer some function that must be stabilised through sequence design. Both these approaches are inherently negligent of conformational flexibility. (B) (i) The general goal of DL-based protein switch design is to connect multiple structures to one sequence, with conformational perturbation triggered by some controlled signal. I.e., Given some stimuli (e.g., palatinate peptide), the contacts of a designed sequence in one state (red) shift given some new fold (blue), providing novel functional capacity. This could be achieved through (ii) Conformational landscape optimisation of multiple states given some design objective, similar to Norn et al., (iii) Harnessing of implicit relationships between sequence and multiple structures contained within MSA data, as demonstrated by del Alamo et al. (2022). Here, co-evolving residues (denoted in the coloured blocks) in two different low-depth MSAs make distinct contacts (shown as C1, C2, etc.,) that change the overall fold state.

References

    1. Adeniran A., Stainbrook S., Bostick J. W., Tyo K. E. J. (2018). Detection of a peptide biomarker by engineered yeast receptors. ACS Synth. Biol. 7, 696–705. 10.1021/ACSSYNBIO.7B00410/ASSET/IMAGES/SB-2017-004103_M007 - DOI - PMC - PubMed
    1. Alberstein R. G., Guo A. B., Kortemme T. (2022). Design principles of protein switches. Curr. Opin. Struct. Biol. 72, 71–78. 10.1016/j.sbi.2021.08.004 - DOI - PMC - PubMed
    1. Alford R. F., Leaver-Fay A., Jeliazkov J. R., O’Meara M. J., DiMaio F. P., Park H., et al. (2017). The Rosetta all-atom energy function for macromolecular modeling and design. J. Chem. Theory Comput. 13, 3031–3048. 10.1021/acs.jctc.7b00125 - DOI - PMC - PubMed
    1. Amimeur T., Shaver J. M., Ketchem R. R., Taylor J. A., Clark R. H., Smith J., et al. (2020). Designing feature-controlled humanoid antibody discovery libraries using generative adversarial networks. bioRxiv. 10.1101/2020.04.12.024844 - DOI
    1. Anand N., Eguchi R., Huang P. S. (2019). “Fully differentiable full-atom protein backbone generation,” in Deep generative models for highly structured data, ICLR 2019 Workshop, May 6–9, 2019 (New Orleans, LA: ICLR 2019; ).

LinkOut - more resources