Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13
- PMID: 30985027
- PMCID: PMC6800999
- DOI: 10.1002/prot.25697
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13
Abstract
Predicting residue-residue distance relationships (eg, contacts) has become the key direction to advance protein structure prediction since 2014 CASP11 experiment, while deep learning has revolutionized the technology for contact and distance distribution prediction since its debut in 2012 CASP10 experiment. During 2018 CASP13 experiment, we enhanced our MULTICOM protein structure prediction system with three major components: contact distance prediction based on deep convolutional neural networks, distance-driven template-free (ab initio) modeling, and protein model ranking empowered by deep learning and contact prediction. Our experiment demonstrates that contact distance prediction and deep learning methods are the key reasons that MULTICOM was ranked 3rd out of all 98 predictors in both template-free and template-based structure modeling in CASP13. Deep convolutional neural network can utilize global information in pairwise residue-residue features such as coevolution scores to substantially improve contact distance prediction, which played a decisive role in correctly folding some free modeling and hard template-based modeling targets. Deep learning also successfully integrated one-dimensional structural features, two-dimensional contact information, and three-dimensional structural quality scores to improve protein model quality assessment, where the contact prediction was demonstrated to consistently enhance ranking of protein models for the first time. The success of MULTICOM system clearly shows that protein contact distance prediction and model selection driven by deep learning holds the key of solving protein structure prediction problem. However, there are still challenges in accurately predicting protein contact distance when there are few homologous sequences, folding proteins from noisy contact distances, and ranking models of hard targets.
Keywords: contact prediction; deep learning; distance prediction; protein model quality assessment; protein structure prediction; template-based modeling; template-free modeling.
© 2019 The Authors. Proteins: Structure, Function, and Bioinformatics published by Wiley Periodicals, Inc.
Figures










Similar articles
-
Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.Proteins. 2022 Jan;90(1):58-72. doi: 10.1002/prot.26186. Epub 2021 Jul 27. Proteins. 2022. PMID: 34291486 Free PMC article.
-
Analysis of distance-based protein structure prediction by deep learning in CASP13.Proteins. 2019 Dec;87(12):1069-1081. doi: 10.1002/prot.25810. Epub 2019 Sep 13. Proteins. 2019. PMID: 31471916
-
The MULTICOM Protein Structure Prediction Server Empowered by Deep Learning and Contact Distance Prediction.Methods Mol Biol. 2020;2165:13-26. doi: 10.1007/978-1-0716-0708-4_2. Methods Mol Biol. 2020. PMID: 32621217
-
Recent Progress of Protein Tertiary Structure Prediction.Molecules. 2024 Feb 13;29(4):832. doi: 10.3390/molecules29040832. Molecules. 2024. PMID: 38398585 Free PMC article. Review.
-
Protein Structure Prediction: Conventional and Deep Learning Perspectives.Protein J. 2021 Aug;40(4):522-544. doi: 10.1007/s10930-021-10003-y. Epub 2021 May 28. Protein J. 2021. PMID: 34050498 Review.
Cited by
-
Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14.Sci Rep. 2021 May 25;11(1):10943. doi: 10.1038/s41598-021-90303-6. Sci Rep. 2021. PMID: 34035363 Free PMC article.
-
Deep Learning in Proteomics.Proteomics. 2020 Nov;20(21-22):e1900335. doi: 10.1002/pmic.201900335. Epub 2020 Oct 30. Proteomics. 2020. PMID: 32939979 Free PMC article. Review.
-
Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15.bioRxiv [Preprint]. 2023 Mar 12:2023.03.08.531814. doi: 10.1101/2023.03.08.531814. bioRxiv. 2023. Update in: Proteins. 2023 Dec;91(12):1889-1902. doi: 10.1002/prot.26542. PMID: 36945536 Free PMC article. Updated. Preprint.
-
Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.Proteins. 2022 Jan;90(1):58-72. doi: 10.1002/prot.26186. Epub 2021 Jul 27. Proteins. 2022. PMID: 34291486 Free PMC article.
-
Decoy selection for protein structure prediction via extreme gradient boosting and ranking.BMC Bioinformatics. 2020 Dec 9;21(Suppl 1):189. doi: 10.1186/s12859-020-3523-9. BMC Bioinformatics. 2020. PMID: 33297949 Free PMC article.
References
-
- Abriata LA, Tamò GE, Monastyrskyy B, Kryshtafovych A, Dal Peraro M. Assessment of hard target modeling in CASP12 reveals an emerging role of alignment‐based contact prediction methods. Proteins. 2018;86:97‐112. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous