Deep learning approaches for conformational flexibility and switching properties in protein design

Lucas S P Rudden¹, Mahdi Hijazi¹, Patrick Barth¹

Affiliations

PMID: 36032687
PMCID: PMC9399439
DOI: 10.3389/fmolb.2022.928534

Review

Deep learning approaches for conformational flexibility and switching properties in protein design

Lucas S P Rudden et al. Front Mol Biosci. 2022.

. 2022 Aug 10:9:928534.

doi: 10.3389/fmolb.2022.928534. eCollection 2022.

Authors

Lucas S P Rudden¹, Mahdi Hijazi¹, Patrick Barth¹

Affiliation

¹ Institute of Bioengineering, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland.

PMID: 36032687
PMCID: PMC9399439
DOI: 10.3389/fmolb.2022.928534

Abstract

Following the hugely successful application of deep learning methods to protein structure prediction, an increasing number of design methods seek to leverage generative models to design proteins with improved functionality over native proteins or novel structure and function. The inherent flexibility of proteins, from side-chain motion to larger conformational reshuffling, poses a challenge to design methods, where the ideal approach must consider both the spatial and temporal evolution of proteins in the context of their functional capacity. In this review, we highlight existing methods for protein design before discussing how methods at the forefront of deep learning-based design accommodate flexibility and where the field could evolve in the future.

Keywords: deep learning; generative models; protein design; protein flexibility; protein switches.

PubMed Disclaimer

Conflict of interest statement

PB holds patents and provisional patent applications in the field of engineered T cell therapies and protein design. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**FIGURE 1**
Three types of generative models are generally applied in DL-based protein design: **(A)** autoencoders/variational autoencoders (AE/VAEs), **(B)** generative adversarial networks (GANs), and **(C)** autoregressive models.

**FIGURE 2**
Most DL protein design methods tackle design as either a **(A)** sequence generation or **(B)** structure generation problem, each accompanied by the general process outlined here. Examples of both methods used to assess the quality of generated samples and specific DL protein design examples are also indicated.

**FIGURE 3**
**(A)** Current design methods either: (i) Produce new sequences corresponding to some structure with limited design objective conditioning that could be leveraged for conformational flexibility design. (ii) Produce novel folds that confer some function that must be stabilised through sequence design. Both these approaches are inherently negligent of conformational flexibility. **(B)** (i) The general goal of DL-based protein switch design is to connect multiple structures to one sequence, with conformational perturbation triggered by some controlled signal. I.e., Given some stimuli (e.g., palatinate peptide), the contacts of a designed sequence in one state (red) shift given some new fold (blue), providing novel functional capacity. This could be achieved through (ii) Conformational landscape optimisation of multiple states given some design objective, similar to Norn et al., (iii) Harnessing of implicit relationships between sequence and multiple structures contained within MSA data, as demonstrated by del Alamo et al. (2022). Here, co-evolving residues (denoted in the coloured blocks) in two different low-depth MSAs make distinct contacts (shown as C1, C2, etc.,) that change the overall fold state.

See this image and copyright information in PMC

References

1. Adeniran A., Stainbrook S., Bostick J. W., Tyo K. E. J. (2018). Detection of a peptide biomarker by engineered yeast receptors. ACS Synth. Biol. 7, 696–705. 10.1021/ACSSYNBIO.7B00410/ASSET/IMAGES/SB-2017-004103_M007 - DOI - PMC - PubMed
1. Alberstein R. G., Guo A. B., Kortemme T. (2022). Design principles of protein switches. Curr. Opin. Struct. Biol. 72, 71–78. 10.1016/j.sbi.2021.08.004 - DOI - PMC - PubMed
1. Alford R. F., Leaver-Fay A., Jeliazkov J. R., O’Meara M. J., DiMaio F. P., Park H., et al. (2017). The Rosetta all-atom energy function for macromolecular modeling and design. J. Chem. Theory Comput. 13, 3031–3048. 10.1021/acs.jctc.7b00125 - DOI - PMC - PubMed
1. Amimeur T., Shaver J. M., Ketchem R. R., Taylor J. A., Clark R. H., Smith J., et al. (2020). Designing feature-controlled humanoid antibody discovery libraries using generative adversarial networks. bioRxiv. 10.1101/2020.04.12.024844 - DOI
1. Anand N., Eguchi R., Huang P. S. (2019). “Fully differentiable full-atom protein backbone generation,” in Deep generative models for highly structured data, ICLR 2019 Workshop, May 6–9, 2019 (New Orleans, LA: ICLR 2019; ).

Publication types

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Deep learning approaches for conformational flexibility and switching properties in protein design

Affiliation

Deep learning approaches for conformational flexibility and switching properties in protein design

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Publication types

LinkOut - more resources

Full Text Sources