Multistate and functional protein design using RoseTTAFold sequence space diffusion
- PMID: 39322764
- PMCID: PMC12339374
- DOI: 10.1038/s41587-024-02395-w
Multistate and functional protein design using RoseTTAFold sequence space diffusion
Erratum in
-
Publisher Correction: Multistate and functional protein design using RoseTTAFold sequence space diffusion.Nat Biotechnol. 2025 Aug;43(8):1384. doi: 10.1038/s41587-024-02456-0. Nat Biotechnol. 2025. PMID: 39375454 Free PMC article. No abstract available.
Abstract
Protein denoising diffusion probabilistic models are used for the de novo generation of protein backbones but are limited in their ability to guide generation of proteins with sequence-specific attributes and functional properties. To overcome this limitation, we developed ProteinGenerator (PG), a sequence space diffusion model based on RoseTTAFold that simultaneously generates protein sequences and structures. Beginning from a noised sequence representation, PG generates sequence and structure pairs by iterative denoising, guided by desired sequence and structural protein attributes. We designed thermostable proteins with varying amino acid compositions and internal sequence repeats and cage bioactive peptides, such as melittin. By averaging sequence logits between diffusion trajectories with distinct structural constraints, we designed multistate parent-child protein triples in which the same sequence folds to different supersecondary structures when intact in the parent versus split into two child domains. PG design trajectories can be guided by experimental sequence-activity data, providing a general approach for integrated computational and experimental optimization of protein function.
© 2024. The Author(s).
Conflict of interest statement
Competing interests: The authors declare no competing interests.
Figures
References
-
- Winnifrith, A., Outeiral, C. & Hie, B. Generative artificial intelligence for de novo protein design. Preprint at arXiv10.48550/arXiv.2310.09685 (2023). - PubMed
-
- Notin, P., Rollins, N., Gal, Y., Sander, C. & Marks, D. Machine learning for functional protein design. Nat. Biotechnol.42, 216–228 (2024). - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
