Data-driven computational protein design
- PMID: 33910104
- PMCID: PMC8405559
- DOI: 10.1016/j.sbi.2021.03.009
Data-driven computational protein design
Abstract
Computational protein design can generate proteins not found in nature that adopt desired structures and perform novel functions. Although proteins could, in theory, be designed with ab initio methods, practical success has come from using large amounts of data that describe the sequences, structures, and functions of existing proteins and their variants. We present recent creative uses of multiple-sequence alignments, protein structures, and high-throughput functional assays in computational protein design. Approaches range from enhancing structure-based design with experimental data to building regression models to training deep neural nets that generate novel sequences. Looking ahead, deep learning will be increasingly important for maximizing the value of data for protein design.
Copyright © 2021 Elsevier Ltd. All rights reserved.
Conflict of interest statement
Declaration of Interests
V. Frappier is employed by Generate Biomedicines, a protein design company.
Figures
References
-
- Dahiyat BI, Mayo SL: De Novo Protein Design: Fully Automated Sequence Selection. Science (80-) 1997, 278:82–87. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
