Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Jul;17(7):665-680.
doi: 10.1038/s41592-020-0848-2. Epub 2020 Jun 1.

Macromolecular modeling and design in Rosetta: recent methods and frameworks

Julia Koehler Leman #  1   2 Brian D Weitzner #  3   4   5   6 Steven M Lewis #  7   8   9 Jared Adolf-Bryfogle  10 Nawsad Alam  11 Rebecca F Alford  3 Melanie Aprahamian  12 David Baker  4   5 Kyle A Barlow  13 Patrick Barth  14   15 Benjamin Basanta  4   16 Brian J Bender  17 Kristin Blacklock  18 Jaume Bonet  14   19 Scott E Boyken  5   6 Phil Bradley  20 Chris Bystroff  21 Patrick Conway  4 Seth Cooper  22 Bruno E Correia  14   19 Brian Coventry  4 Rhiju Das  23 René M De Jong  24 Frank DiMaio  4   5 Lorna Dsilva  22 Roland Dunbrack  25 Alexander S Ford  4 Brandon Frenz  5   9 Darwin Y Fu  26 Caleb Geniesse  23 Lukasz Goldschmidt  4 Ragul Gowthaman  27   28 Jeffrey J Gray  3   29 Dominik Gront  30 Sharon Guffy  7 Scott Horowitz  31   32 Po-Ssu Huang  4 Thomas Huber  33 Tim M Jacobs  34 Jeliazko R Jeliazkov  29 David K Johnson  35 Kalli Kappel  36 John Karanicolas  25 Hamed Khakzad  19   37   38 Karen R Khar  9   35 Sagar D Khare  18   39   40   41   42 Firas Khatib  43 Alisa Khramushin  11 Indigo C King  4   9 Robert Kleffner  22 Brian Koepnick  4 Tanja Kortemme  44 Georg Kuenze  26   45 Brian Kuhlman  7 Daisuke Kuroda  46   47 Jason W Labonte  3   48 Jason K Lai  15 Gideon Lapidoth  49 Andrew Leaver-Fay  7 Steffen Lindert  12 Thomas Linsky  4   5 Nir London  11 Joseph H Lubin  3 Sergey Lyskov  3 Jack Maguire  34 Lars Malmström  19   37   38   50 Enrique Marcos  4   51 Orly Marcu  11 Nicholas A Marze  3 Jens Meiler  45   52   53 Rocco Moretti  26 Vikram Khipple Mulligan  54   4   5 Santrupti Nerli  55 Christoffer Norn  49 Shane Ó'Conchúir  44 Noah Ollikainen  44 Sergey Ovchinnikov  4   5   56 Michael S Pacella  3 Xingjie Pan  44 Hahnbeom Park  4 Ryan E Pavlovicz  4   5   9 Manasi Pethe  40   41 Brian G Pierce  27   28 Kala Bharath Pilla  33 Barak Raveh  11 P Douglas Renfrew  54 Shourya S Roy Burman  3 Aliza Rubenstein  18   42 Marion F Sauer  57 Andreas Scheck  14   19 William Schief  10 Ora Schueler-Furman  11 Yuval Sedan  11 Alexander M Sevy  57 Nikolaos G Sgourakis  58 Lei Shi  4   5 Justin B Siegel  59   60   61 Daniel-Adriano Silva  4 Shannon Smith  26 Yifan Song  4   5   9 Amelie Stein  44 Maria Szegedy  39 Frank D Teets  7 Summer B Thyme  4 Ray Yu-Ruei Wang  4 Andrew Watkins  23 Lior Zimmerman  11 Richard Bonneau  62   63   64   65
Affiliations
Review

Macromolecular modeling and design in Rosetta: recent methods and frameworks

Julia Koehler Leman et al. Nat Methods. 2020 Jul.

Abstract

The Rosetta software for macromolecular modeling, docking and design is extensively used in laboratories worldwide. During two decades of development by a community of laboratories at more than 60 institutions, Rosetta has been continuously refactored and extended. Its advantages are its performance and interoperability between broad modeling capabilities. Here we review tools developed in the last 5 years, including over 80 methods. We discuss improvements to the score function, user interfaces and usability. Rosetta is available at http://www.rosettacommons.org.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:. Capabilities of the Rosetta macromolecular modeling suite
Some popular tasks that can be addressed in Rosetta (blue) and major systems that can be modeled (red). Note this is an incomplete list of Rosetta’s broad modeling capabilities.
Figure 2:
Figure 2:. Main elements of Rosetta are scoring and sampling
(A) Three main elements are required in a Rosetta protocol. The Pose is the biomolecule, such as a protein, RNA, DNA, small molecule, or glycan, in a specific conformation. Residues in the Pose can be selected via ResidueSelectors and the behavior for side-chain optimization or mutation can be defined by TaskOperations. Specific Movers then control how the conformation of the Pose is changed, and the new conformation is subsequently evaluated by a ScoreFunction. The Metropolis criterion decides whether the new conformation is accepted during sampling. Many independent sampling trajectories are generated, and the final models are evaluated based on the purpose of the protocol. (B) The score function consists of a weighted linear combination of various score terms, highlighted in the figure and described above.
Figure 3:
Figure 3:. Rosetta can successfully address diverse biological questions
(A) Curved β-sheet design: overlay of the designed homo-dimeric curved β-sheet (dcs-E_4_dim_cav3) in rainbow and the crystal structure in gray (PDBID 5u35). The protein is designed de novo and features a curved β-sheet, a large pocket, and a homodimer interface. (B) Parametric design: overlay of the de novo designed macrocycle 3H1 in blue and the NMR structure in gray (PDBID 5v2g). This “CovCore” (covalent core) miniprotein is held together covalently by a hydrophobic cross-linker at its core (in red for the design and gray for the NMR structure). (C) PyTXMS: the interactome of M1 protein (virulence factor of Group A streptococcus) and 15 human plasma proteins on the surface of bacteria (peptidoglycan layer (dark green), and the membrane (brown)). This 1.8MDa structure contains over 200 chemical cross-links and is measured in a complex mixture of intact bacteria and human plasma. All models are provided by Rosetta: M1 protein (gray), IgG (red), four fibrinogens (dark to light blue), six albumins (dark to light pink), coagulation factor XIII A [F13A] (purple), C4bPa (cyan), haptoglobin [HP] (brown), and alpha-1-antitrypsin [SerpinA1] (plum). (D) RosettaSurface: model of an LK-α peptide (LKKLLKLLKKLLKL with a periodicity of 3.5 assuming a helical conformation) on a hydrophilic self-assembled monolayer surface. The peptide is unstructured in solution and assumes helical structure when on the surface, as experiments show. (E) RosettaCarbohydrate: flexible docking of a carbohydrate antigen to an antibody. The crystal structure is in gray (PDBID 1mfa) and the model in blue, with the carbohydrate in green. Antibody coordinates were taken from the PDB and glycan coordinates started from a randomized backbone conformation and rigid-body orientation. (F) PIPER-FlexPepDock: high-resolution model of a peptide-protein complex (model: blue; solved structure in gray, PDBID 1mfg). The model was generated from a peptide sequence (LDVPV, derived from the C-terminal tail of ErbB2R) and the unbound structure of the receptor (Erbin PDZ domain, PDBID 2h3l, colored in red).
Figure 4:
Figure 4:. User interfaces to the codebase
(A) Rosetta can be run from a terminal and offers three interfaces to the codebase. The top panel outlines the task to be accomplished: making two mutations in a protein and then refining the structure. The panels underneath show how this task can be accomplished in the different interfaces. The command line panel shows the executable, input files and options to run two specific applications. RosettaScripts is an XML-based scripting language that offers more flexibility by combining Movers and ScoreFunctions into a custom Protocol. PyRosetta offers direct access to the underlying code objects but requires knowledge of the codebase. (B) Point-and-click interfaces to the codebase. InteractiveRosetta is a graphical user-interface (GUI) to PyRosetta. It offers controls to the most popular protocols, file formats and options. Foldit is a videogame primarily used to crowd-source real-world scientific puzzles but can also be used on custom proteins of interest. It can run some popular applications via a game interface. ROSIE hosts a multitude of servers each executing a particular protocol. It currently includes servers for 21 Rosetta methods. [The InteractiveRosetta and Foldit panels were originally published in and under Creative Commons licenses that allows reproduction as is.]
Figure 5:
Figure 5:. Main external documentation page
In 2015, our community performed a complete overhaul of our documentation. Documentation is now hosted on a Gollum wiki, which is version controlled and easily editable by members of our community. Accessibility and ability to edit the documentation has drastically improved the user-experience of the software.

References

    1. Schrodinger - Biologics Design. at <https://www.schrodinger.com/science-articles/biologics-design>
    1. Molecular Operating Environment (MOE) | MOEsaic | PSILO. at <https://www.chemcomp.com/Products.htm>
    1. Ref. Dassault Systèmes BIOVIA, Discovery Studio Modeling Environment, Release 2017, San Diego: Dassault Systèmes, 2016. at <https://www.3dsbiovia.com/products/collaborative-science/biovia-discover...>
    1. Steinegger M, Meier M, Mirdita M, Vöhringer H, Haunsberger SJ & Söding J HH-suite3 for fast remote homology detection and deep protein annotation. BMC Bioinformatics 20, 473 (2019). - PMC - PubMed
    1. Vu O, Mendenhall J, Altarawy D & Meiler J BCL::Mol2D—a robust atom environment descriptor for QSAR modeling and lead optimization. J. Comput. Aided. Mol. Des. 33, 477–486 (2019). - PMC - PubMed

Publication types

Grants and funding