BWM*: A Novel, Provable, Ensemble-based Dynamic Programming Algorithm for Sparse Approximations of Computational Protein Design
- PMID: 26744898
- PMCID: PMC4904165
- DOI: 10.1089/cmb.2015.0194
BWM*: A Novel, Provable, Ensemble-based Dynamic Programming Algorithm for Sparse Approximations of Computational Protein Design
Abstract
Sparse energy functions that ignore long range interactions between residue pairs are frequently used by protein design algorithms to reduce computational cost. Current dynamic programming algorithms that fully exploit the optimal substructure produced by these energy functions only compute the GMEC. This disproportionately favors the sequence of a single, static conformation and overlooks better binding sequences with multiple low-energy conformations. Provable, ensemble-based algorithms such as A* avoid this problem, but A* cannot guarantee better performance than exhaustive enumeration. We propose a novel, provable, dynamic programming algorithm called Branch-Width Minimization* (BWM*) to enumerate a gap-free ensemble of conformations in order of increasing energy. Given a branch-decomposition of branch-width w for an n-residue protein design with at most q discrete side-chain conformations per residue, BWM* returns the sparse GMEC in O([Formula: see text]) time and enumerates each additional conformation in merely O([Formula: see text]) time. We define a new measure, Total Effective Search Space (TESS), which can be computed efficiently a priori before BWM* or A* is run. We ran BWM* on 67 protein design problems and found that TESS discriminated between BWM*-efficient and A*-efficient cases with 100% accuracy. As predicted by TESS and validated experimentally, BWM* outperforms A* in 73% of the cases and computes the full ensemble or a close approximation faster than A*, enumerating each additional conformation in milliseconds. Unlike A*, the performance of BWM* can be predicted in polynomial time before running the algorithm, which gives protein designers the power to choose the most efficient algorithm for their particular design problem.
Keywords: OSPREY; branch-decomposition; dynamic programming; ensemble-based algorithms; protein design; provable algorithms; sparse residue interaction graphs.
Figures






Similar articles
-
A critical analysis of computational protein design with sparse residue interaction graphs.PLoS Comput Biol. 2017 Mar 30;13(3):e1005346. doi: 10.1371/journal.pcbi.1005346. eCollection 2017 Mar. PLoS Comput Biol. 2017. PMID: 28358804 Free PMC article.
-
BBK* (Branch and Bound Over K*): A Provable and Efficient Ensemble-Based Protein Design Algorithm to Optimize Stability and Binding Affinity Over Large Sequence Spaces.J Comput Biol. 2018 Jul;25(7):726-739. doi: 10.1089/cmb.2017.0267. Epub 2018 Mar 13. J Comput Biol. 2018. PMID: 29641249 Free PMC article.
-
Minimization-Aware Recursive K*: A Novel, Provable Algorithm that Accelerates Ensemble-Based Protein Design and Provably Approximates the Energy Landscape.J Comput Biol. 2020 Apr;27(4):550-564. doi: 10.1089/cmb.2019.0315. Epub 2019 Dec 6. J Comput Biol. 2020. PMID: 31855059 Free PMC article.
-
Molecular flexibility in computational protein design: an algorithmic perspective.Protein Eng Des Sel. 2021 Feb 15;34:gzab011. doi: 10.1093/protein/gzab011. Protein Eng Des Sel. 2021. PMID: 33959778 Review.
-
Sparse RNA folding revisited: space-efficient minimum free energy structure prediction.Algorithms Mol Biol. 2016 Apr 23;11:7. doi: 10.1186/s13015-016-0071-y. eCollection 2016. Algorithms Mol Biol. 2016. PMID: 27110275 Free PMC article. Review.
Cited by
-
Fast gap-free enumeration of conformations and sequences for protein design.Proteins. 2015 Oct;83(10):1859-1877. doi: 10.1002/prot.24870. Epub 2015 Aug 24. Proteins. 2015. PMID: 26235965 Free PMC article.
-
Algorithms for protein design.Curr Opin Struct Biol. 2016 Aug;39:16-26. doi: 10.1016/j.sbi.2016.03.006. Epub 2016 Apr 14. Curr Opin Struct Biol. 2016. PMID: 27086078 Free PMC article. Review.
-
RESISTOR: A New OSPREY Module to Predict Resistance Mutations.J Comput Biol. 2022 Dec;29(12):1346-1352. doi: 10.1089/cmb.2022.0254. Epub 2022 Sep 13. J Comput Biol. 2022. PMID: 36099194 Free PMC article.
-
LUTE (Local Unpruned Tuple Expansion): Accurate Continuously Flexible Protein Design with General Energy Functions and Rigid Rotamer-Like Efficiency.J Comput Biol. 2017 Jun;24(6):536-546. doi: 10.1089/cmb.2016.0136. Epub 2016 Sep 28. J Comput Biol. 2017. PMID: 27681371 Free PMC article.
-
Protocol for Designing De Novo Noncanonical Peptide Binders in OSPREY.J Comput Biol. 2024 Oct;31(10):965-974. doi: 10.1089/cmb.2024.0669. Epub 2024 Oct 4. J Comput Biol. 2024. PMID: 39364612
References
-
- Dechter R., and Mateescu R. 2007. AND/OR search spaces for graphical models. Artif. Intell. 171, 73–106
-
- Desmet J., Spriet J., and Lasters I. 2002. Fast and accurate side-chain topology and energy refinement (FASTER) as a new method for protein structure optimization. Proteins 48, 31–43 - PubMed
-
- Donald B.R. 2011. Algorithms in Structural Molecular Biology. MIT Press, New York
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous