Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun 4;27(6):1041-1051.e8.
doi: 10.1016/j.str.2019.03.014. Epub 2019 Apr 18.

Flexible Backbone Assembly and Refinement of Symmetrical Homomeric Complexes

Affiliations

Flexible Backbone Assembly and Refinement of Symmetrical Homomeric Complexes

Shourya S Roy Burman et al. Structure. .

Abstract

Symmetrical homomeric proteins are ubiquitous in every domain of life, and information about their structure is essential to decipher function. The size of these complexes often makes them intractable to high-resolution structure determination experiments. Computational docking algorithms offer a promising alternative for modeling large complexes with arbitrary symmetry. Accuracy of existing algorithms, however, is limited by backbone inaccuracies when using homology-modeled monomers. Here, we present Rosetta SymDock2 with a broad search of symmetrical conformational space using a six-dimensional coarse-grained score function followed by an all-atom flexible-backbone refinement, which we demonstrate to be essential for physically realistic modeling of tightly packed complexes. In global docking of a benchmark set of complexes of different point symmetries-starting from homology-modeled monomers-we successfully dock (defined as predicting three near-native structures in the five top-scoring models) 17 out of 31 cyclic complexes and 3 out of 12 dihedral complexes.

Keywords: Rosetta; protein docking; symmetry.

PubMed Disclaimer

Conflict of interest statement

Declaration of Interests

J.J.G. is an unpaid board member of the Rosetta Commons. Under institutional participation agreements between the University of Washington, acting on behalf of the Rosetta Commons, Johns Hopkins University may be entitled to a portion of revenue received on licensing Rosetta software, which may include methods described in this paper. As a member of the Scientific Advisory Board of Cyrus Biotechnology, J.J.G. is granted stock options. Cyrus Biotechnology distributes the Rosetta software, which may include methods described in this paper.

Figures

Figure 1.
Figure 1.. Comparison of energy landscapes in all-atom phase and coarse-grained phase.
Score versus RMSD plots for two representative complexes, Rhamnulose-1-phosphate aldolase (A, B and C) and snRNP Sm-like protein (C, D and E) for 5,000 models generated from global docking of homology-modeled monomers (black, grey circles) and 100 models generated by re-docking of bound subunits (triangles). Coarse-grained energy landscapes with Motif Dock Score (MDS) (C and F) resemble the all-atom energy landscapes (A and D), but those with Centroid Score (B and E) do not. Starting from the homology-modeled monomers, none of the 50 top-scoring models generated using Centroid Score are within 5 Å RMSD. All the 100 topscoring models generated using MDS are under 3 Å RMSD. When re-docking bound subunits, closest models generated using Centroid Score (B and E) have 1.9 Å RMSD and high relative scores in both cases. Bound re-docking with MDS (C and F) produces over 80% of the models docked to within 1 Å RMSD in both cases. These sub-angstrom re-docked models also score more favorably than all docking models made using homology-modeled monomers. Hence, Centroid Score does not recognize the energy well near the native conformation, whereas MDS does.
Figure 2.
Figure 2.. Fixed-backbone refinement is insufficient to enter narrow binding funnel.
(A) Coarse-grained score versus RMSD (after coarse-grained phase) plots for Xenopus Nucleophosmin for 5,000 models. Models are colored by their interface score after fixed-backbone refinement. Almost all models under 5 Å RMSD have a positive interface after fixed-backbone refinement arising from minor clashes due to the introduction of side chains, despite repacking. Consequently, these models are discarded. (B) Interface score versus RMSD (after full-protocol) plots for Xenopus Nucleophosmin. A rapid drop in interface score between 0.6 and 0.4 Å RMSD leads to an energy funnel with steep slope (dashed line) of 423 Å−1 and a radius of 0.25 Å. (C) Conceptual representation of the energy landscape near the binding funnel for heterodimers. The funnel is comparatively shallow with local minima near it. (D) Conceptual representation of the energy landscape near the binding funnel for homodimers as seen by symmetrical docking protocols. The funnel is narrow and steep with no comparable local minima.
Figure 3.
Figure 3.. Flexible-backbone refinement improves docking performance.
Comparison of bootstrapped averages of the number of near-native structures in the set of 2,500 docking models using: [white] the homology models (HM) and fixed-backbone refinement, [light grey] homology models supplemented with an ensemble of 150 pregenerated backbone conformations (HM+Ens) and [dark grey] fixed-backbone refinement, and the homology models and flexible-backbone refinement after the coarse-grained phase (A) and after the full protocol (B). Starting with 150 additional backbone conformations generated without the context of the complex improves docking performance for 6 out of 8 complexes, but makes it worse for 1 complex. Starting with just the homology models and performing flexible-backbone refinement leads to improvements in 7 out of 8 complexes after the coarse-grained phase and after the full protocol. After flexible-backbone refinement, more than 75% of the top-scoring models were near-native for 6 out of the 8 complexes.
Figure 4.
Figure 4.. Rosetta SymDock2 compares favorably with Rosetta SymDock on various assessment metrics.
Comparison of bootstrapped-averaged metrics for 43 individual complexes (31 cyclic complexes [triangle] and 12 dihedral complexes [diamond]) both after the coarse-grained phase (A and B) and after the full protocol (C) shows significant performance gains. All complexes (points) above the diagonal line are improved in SymDock2. (A) Comparison of fold-enrichment of near-native models in the 1% top-scoring models, 〈E1%〉 on a log-log scale shows a higher enrichment in 15 cyclics and 4 dihedrals and a lower value in 5 cyclics and 0 dihedrals. Complexes to the right of the vertical dashed line are enriched in SymDock, and complexes above the horizontal dashed line are enriched in SymDock2. (B and C) Comparison of number of near-native models in the five top-scoring models, 〈N5〉, shows marked improvements both after the coarse-grained phase (B) and after the full protocol (C). Areas above and below the dashed lines indicate cases where the two methods differ significantly, i.e. by more than 1 model on average. SymDock2 has significant improvements in 13 cyclics and 1 dihedral complex after the coarse-grained phase, and most importantly, in 15 cyclics and 4 dihedrals after the full protocol. 2 cyclic complexes were modeled significantly worse with SymDock2.
Figure 5.
Figure 5.. On average, Rosetta SymDock and SymDock2 have similar per-decoy runtimes in the benchmark.
Comparison of average time per decoy on a log-log plot demonstrates similar scaling with complex size and symmetry for SymDock (×) and SymDock2 (+). Despite having a slower all-atom refinement phase, no complex had a more than a 60% overhead with SymDock2. For the two methods, run times were within ± 20% for 31 out of 43 complexes.

Similar articles

Cited by

  • Novel sampling strategies and a coarse-grained score function for docking homomers, flexible heteromers, and oligosaccharides using Rosetta in CAPRI rounds 37-45.
    Roy Burman SS, Nance ML, Jeliazkov JR, Labonte JW, Lubin JH, Biswas N, Gray JJ. Roy Burman SS, et al. Proteins. 2020 Aug;88(8):973-985. doi: 10.1002/prot.25855. Epub 2019 Dec 3. Proteins. 2020. PMID: 31742764 Free PMC article.
  • Macromolecular modeling and design in Rosetta: recent methods and frameworks.
    Leman JK, Weitzner BD, Lewis SM, Adolf-Bryfogle J, Alam N, Alford RF, Aprahamian M, Baker D, Barlow KA, Barth P, Basanta B, Bender BJ, Blacklock K, Bonet J, Boyken SE, Bradley P, Bystroff C, Conway P, Cooper S, Correia BE, Coventry B, Das R, De Jong RM, DiMaio F, Dsilva L, Dunbrack R, Ford AS, Frenz B, Fu DY, Geniesse C, Goldschmidt L, Gowthaman R, Gray JJ, Gront D, Guffy S, Horowitz S, Huang PS, Huber T, Jacobs TM, Jeliazkov JR, Johnson DK, Kappel K, Karanicolas J, Khakzad H, Khar KR, Khare SD, Khatib F, Khramushin A, King IC, Kleffner R, Koepnick B, Kortemme T, Kuenze G, Kuhlman B, Kuroda D, Labonte JW, Lai JK, Lapidoth G, Leaver-Fay A, Lindert S, Linsky T, London N, Lubin JH, Lyskov S, Maguire J, Malmström L, Marcos E, Marcu O, Marze NA, Meiler J, Moretti R, Mulligan VK, Nerli S, Norn C, Ó'Conchúir S, Ollikainen N, Ovchinnikov S, Pacella MS, Pan X, Park H, Pavlovicz RE, Pethe M, Pierce BG, Pilla KB, Raveh B, Renfrew PD, Burman SSR, Rubenstein A, Sauer MF, Scheck A, Schief W, Schueler-Furman O, Sedan Y, Sevy AM, Sgourakis NG, Shi L, Siegel JB, Silva DA, Smith S, Song Y, Stein A, Szegedy M, Teets FD, Thyme SB, Wang RY, Watkins A, Zimmerman L, Bonneau R. Leman JK, et al. Nat Methods. 2020 Jul;17(7):665-680. doi: 10.1038/s41592-020-0848-2. Epub 2020 Jun 1. Nat Methods. 2020. PMID: 32483333 Free PMC article. Review.
  • Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment.
    Lensink MF, Brysbaert G, Mauri T, Nadzirin N, Velankar S, Chaleil RAG, Clarence T, Bates PA, Kong R, Liu B, Yang G, Liu M, Shi H, Lu X, Chang S, Roy RS, Quadir F, Liu J, Cheng J, Antoniak A, Czaplewski C, Giełdoń A, Kogut M, Lipska AG, Liwo A, Lubecka EA, Maszota-Zieleniak M, Sieradzan AK, Ślusarz R, Wesołowski PA, Zięba K, Del Carpio Muñoz CA, Ichiishi E, Harmalkar A, Gray JJ, Bonvin AMJJ, Ambrosetti F, Vargas Honorato R, Jandova Z, Jiménez-García B, Koukos PI, Van Keulen S, Van Noort CW, Réau M, Roel-Touris J, Kotelnikov S, Padhorny D, Porter KA, Alekseenko A, Ignatov M, Desta I, Ashizawa R, Sun Z, Ghani U, Hashemi N, Vajda S, Kozakov D, Rosell M, Rodríguez-Lumbreras LA, Fernandez-Recio J, Karczynska A, Grudinin S, Yan Y, Li H, Lin P, Huang SY, Christoffer C, Terashi G, Verburgt J, Sarkar D, Aderinwale T, Wang X, Kihara D, Nakamura T, Hanazono Y, Gowthaman R, Guest JD, Yin R, Taherzadeh G, Pierce BG, Barradas-Bautista D, Cao Z, Cavallo L, Oliva R, Sun Y, Zhu S, Shen Y, Park T, Woo H, Yang J, Kwon S, Won J, Seok C, Kiyota Y, Kobayashi S, Harada Y, Takeda-Shitaka M, Kundrotas PJ, Singh A, Vakser IA, Dapkūnas J, Olechnovič K, Venclovas Č, Duan R, Qiu L, Xu X, Zhang S, Zou X, Wodak … See abstract for full author list ➔ Lensink MF, et al. Proteins. 2021 Dec;89(12):1800-1823. doi: 10.1002/prot.26222. Epub 2021 Sep 13. Proteins. 2021. PMID: 34453465 Free PMC article.
  • Protein oligomer modeling guided by predicted interchain contacts in CASP14.
    Baek M, Anishchenko I, Park H, Humphreys IR, Baker D. Baek M, et al. Proteins. 2021 Dec;89(12):1824-1833. doi: 10.1002/prot.26197. Epub 2021 Aug 23. Proteins. 2021. PMID: 34324224 Free PMC article.
  • Multiple allostery in the regulation of PDGFR beta kinase activities.
    Zhang Y, Wang M, Shao G, Shang Q, Dong M, Qin X, Mi LZ. Zhang Y, et al. Acta Biochim Biophys Sin (Shanghai). 2024 Nov 29;57(3):344-355. doi: 10.3724/abbs.2024205. Acta Biochim Biophys Sin (Shanghai). 2024. PMID: 39623946 Free PMC article.

References

    1. Alford RF, Leaver-Fay A, Jeliazkov JR, O’Meara MJ, DiMaio FP, Park H, Shapovalov MV, Renfrew PD, Mulligan VK, Kappel K, et al. (2017). The Rosetta All-Atom Energy Function for Macromolecular Modeling and Design. J. Chem. Theory Comput. 13, 3031–3048. - PMC - PubMed
    1. Aloy P, Ceulemans H, Stark A, and Russell RB (2003). The Relationship Between Sequence and Interaction Divergence in Proteins. J. Mol. Biol. 332, 989–998. - PubMed
    1. André I, Bradley P, Wang C, and Baker D (2007). Prediction of the structure of symmetrical protein assemblies. Proc. Natl. Acad. Sci. 104, 17656–17661. - PMC - PubMed
    1. Baek M, Park T, Heo L, Park C, and Seok C (2017). GalaxyHomomer: a web server for protein homo-oligomer structure prediction from a monomer sequence or structure. Nucleic Acids Res. 45, W320–W324. - PMC - PubMed
    1. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, and Bourne PE (2000). The Protein Data Bank. Nucleic Acids Res. 28, 235–242. - PMC - PubMed

Publication types

LinkOut - more resources