End-to-End Differentiable Learning of Protein Structure
- PMID: 31005579
- PMCID: PMC6513320
- DOI: 10.1016/j.cels.2019.03.006
End-to-End Differentiable Learning of Protein Structure
Abstract
Predicting protein structure from sequence is a central challenge of biochemistry. Co-evolution methods show promise, but an explicit sequence-to-structure map remains elusive. Advances in deep learning that replace complex, human-designed pipelines with differentiable models optimized end to end suggest the potential benefits of similarly reformulating structure prediction. Here, we introduce an end-to-end differentiable model for protein structure learning. The model couples local and global protein structure via geometric units that optimize global geometry without violating local covalent chemistry. We test our model using two challenging tasks: predicting novel folds without co-evolutionary data and predicting known folds without structural templates. In the first task, the model achieves state-of-the-art accuracy, and in the second, it comes within 1-2 Å; competing methods using co-evolution and experimental templates have been refined over many years, and it is likely that the differentiable approach has substantial room for further improvement, with applications ranging from drug discovery to protein design.
Keywords: biophysics; co-evolution; deep learning; geometric deep learning; homology modeling; machine learning; protein design; protein folding; protein structure prediction; structural biology.
Copyright © 2019 Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of Interests:
The author declares no competing interests.
Figures





Similar articles
-
Protein sequence-to-structure learning: Is this the end(-to-end revolution)?Proteins. 2021 Dec;89(12):1770-1786. doi: 10.1002/prot.26235. Epub 2021 Sep 22. Proteins. 2021. PMID: 34519095 Review.
-
Toward the solution of the protein structure prediction problem.J Biol Chem. 2021 Jul;297(1):100870. doi: 10.1016/j.jbc.2021.100870. Epub 2021 Jun 11. J Biol Chem. 2021. PMID: 34119522 Free PMC article. Review.
-
State-of-the-art web services for de novo protein structure prediction.Brief Bioinform. 2021 May 20;22(3):bbaa139. doi: 10.1093/bib/bbaa139. Brief Bioinform. 2021. PMID: 34020540 Review.
-
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan. PLoS Comput Biol. 2017. PMID: 28056090 Free PMC article.
-
Machine learning in protein structure prediction.Curr Opin Chem Biol. 2021 Dec;65:1-8. doi: 10.1016/j.cbpa.2021.04.005. Epub 2021 May 18. Curr Opin Chem Biol. 2021. PMID: 34015749 Review.
Cited by
-
Recent advancements in enzyme-mediated crosslinkable hydrogels: In vivo-mimicking strategies.APL Bioeng. 2021 Apr 1;5(2):021502. doi: 10.1063/5.0037793. eCollection 2021 Jun. APL Bioeng. 2021. PMID: 33834154 Free PMC article. Review.
-
NanoNet: Rapid and accurate end-to-end nanobody modeling by deep learning.Front Immunol. 2022 Aug 12;13:958584. doi: 10.3389/fimmu.2022.958584. eCollection 2022. Front Immunol. 2022. PMID: 36032123 Free PMC article.
-
Metabolomics and modelling approaches for systems metabolic engineering.Metab Eng Commun. 2022 Oct 15;15:e00209. doi: 10.1016/j.mec.2022.e00209. eCollection 2022 Dec. Metab Eng Commun. 2022. PMID: 36281261 Free PMC article. Review.
-
Learning Correlations between Internal Coordinates to Improve 3D Cartesian Coordinates for Proteins.J Chem Theory Comput. 2023 Jul 25;19(14):4689-4700. doi: 10.1021/acs.jctc.2c01270. Epub 2023 Feb 7. J Chem Theory Comput. 2023. PMID: 36749957 Free PMC article.
-
Inverse design of 3d molecular structures with conditional generative neural networks.Nat Commun. 2022 Feb 21;13(1):973. doi: 10.1038/s41467-022-28526-y. Nat Commun. 2022. PMID: 35190542 Free PMC article.
References
-
- Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M, et al. (2016). TensorFlow: A system for large-scale machine learning In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), pp. 265–283.
-
- Alain G, and Bengio Y (2016). Understanding intermediate layers using linear classifier probes. ArXiv:1610.01644 [Cs, Stat].
-
- AlQuraishi M (2019a). Parallelized Natural Extension Reference Frame: Parallelized Conversion from Internal to Cartesian Coordinates. Journal of Computational Chemistry 40, 885–892. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources