Deep convolutional networks for quality assessment of protein folds
- PMID: 29931128
- DOI: 10.1093/bioinformatics/bty494
Deep convolutional networks for quality assessment of protein folds
Abstract
Motivation: The computational prediction of a protein structure from its sequence generally relies on a method to assess the quality of protein models. Most assessment methods rank candidate models using heavily engineered structural features, defined as complex functions of the atomic coordinates. However, very few methods have attempted to learn these features directly from the data.
Results: We show that deep convolutional networks can be used to predict the ranking of model structures solely on the basis of their raw three-dimensional atomic densities, without any feature tuning. We develop a deep neural network that performs on par with state-of-the-art algorithms from the literature. The network is trained on decoys from the CASP7 to CASP10 datasets and its performance is tested on the CASP11 dataset. Additional testing on decoys from the CASP12, CAMEO and 3DRobot datasets confirms that the network performs consistently well across a variety of protein structures. While the network learns to assess structural decoys globally and does not rely on any predefined features, it can be analyzed to show that it implicitly identifies regions that deviate from the native structure.
Availability and implementation: The code and the datasets are available at https://github.com/lamoureux-lab/3DCNN_MQA.
Supplementary information: Supplementary data are available at Bioinformatics online.
Similar articles
-
Protein model accuracy estimation based on local structure quality assessment using 3D convolutional neural network.PLoS One. 2019 Sep 5;14(9):e0221347. doi: 10.1371/journal.pone.0221347. eCollection 2019. PLoS One. 2019. PMID: 31487288 Free PMC article.
-
DNCON2: improved protein contact prediction using two-level deep convolutional neural networks.Bioinformatics. 2018 May 1;34(9):1466-1472. doi: 10.1093/bioinformatics/btx781. Bioinformatics. 2018. PMID: 29228185 Free PMC article.
-
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan. PLoS Comput Biol. 2017. PMID: 28056090 Free PMC article.
-
GalaxyWater-CNN: Prediction of Water Positions on the Protein Structure by a 3D-Convolutional Neural Network.J Chem Inf Model. 2022 Jul 11;62(13):3157-3168. doi: 10.1021/acs.jcim.2c00306. Epub 2022 Jun 24. J Chem Inf Model. 2022. PMID: 35749367 Review.
-
MASS: predict the global qualities of individual protein models using random forests and novel statistical potentials.BMC Bioinformatics. 2020 Jul 6;21(Suppl 4):246. doi: 10.1186/s12859-020-3383-3. BMC Bioinformatics. 2020. PMID: 32631256 Free PMC article. Review.
Cited by
-
Protein model accuracy estimation based on local structure quality assessment using 3D convolutional neural network.PLoS One. 2019 Sep 5;14(9):e0221347. doi: 10.1371/journal.pone.0221347. eCollection 2019. PLoS One. 2019. PMID: 31487288 Free PMC article.
-
Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning.BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):700. doi: 10.1186/s12859-019-3275-6. BMC Bioinformatics. 2019. PMID: 31874615 Free PMC article.
-
KekuleScope: prediction of cancer cell line sensitivity and compound potency using convolutional neural networks trained on compound images.J Cheminform. 2019 Jun 19;11(1):41. doi: 10.1186/s13321-019-0364-5. J Cheminform. 2019. PMID: 31218493 Free PMC article.
-
Recent advances and challenges in protein complex model accuracy estimation.Comput Struct Biotechnol J. 2024 Apr 21;23:1824-1832. doi: 10.1016/j.csbj.2024.04.049. eCollection 2024 Dec. Comput Struct Biotechnol J. 2024. PMID: 38707538 Free PMC article. Review.
-
Improved protein structure refinement guided by deep learning based accuracy estimation.Nat Commun. 2021 Feb 26;12(1):1340. doi: 10.1038/s41467-021-21511-x. Nat Commun. 2021. PMID: 33637700 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous