This is a preprint.
Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15
- PMID: 36945536
- PMCID: PMC10028888
- DOI: 10.1101/2023.03.08.531814
Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15
Update in
-
Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15.Proteins. 2023 Dec;91(12):1889-1902. doi: 10.1002/prot.26542. Epub 2023 Jun 26. Proteins. 2023. PMID: 37357816 Free PMC article.
Abstract
Estimating the accuracy of quaternary structural models of protein complexes and assemblies (EMA) is important for predicting quaternary structures and applying them to studying protein function and interaction. The pairwise similarity between structural models is proven useful for estimating the quality of protein tertiary structural models, but it has been rarely applied to predicting the quality of quaternary structural models. Moreover, the pairwise similarity approach often fails when many structural models are of low quality and similar to each other. To address the gap, we developed a hybrid method (MULTICOM_qa) combining a pairwise similarity score (PSS) and an interface contact probability score (ICPS) based on the deep learning inter-chain contact prediction for estimating protein complex model accuracy. It blindly participated in the 15th Critical Assessment of Techniques for Protein Structure Prediction (CASP15) in 2022 and ranked first out of 24 predictors in estimating the global accuracy of assembly models. The average per-target correlation coefficient between the model quality scores predicted by MULTICOM_qa and the true quality scores of the models of CASP15 assembly targets is 0.66. The average per-target ranking loss in using the predicted quality scores to rank the models is 0.14. It was able to select good models for most targets. Moreover, several key factors (i.e., target difficulty, model sampling difficulty, skewness of model quality, and similarity between good/bad models) for EMA are identified and analayzed. The results demonstrate that combining the multi-model method (PSS) with the complementary single-model method (ICPS) is a promising approach to EMA. The source code of MULTICOM_qa is available at https://github.com/BioinfoMachineLearning/MULTICOM_qa .
Figures









Similar articles
-
Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15.Proteins. 2023 Dec;91(12):1889-1902. doi: 10.1002/prot.26542. Epub 2023 Jun 26. Proteins. 2023. PMID: 37357816 Free PMC article.
-
Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15.bioRxiv [Preprint]. 2023 May 18:2023.05.16.541055. doi: 10.1101/2023.05.16.541055. bioRxiv. 2023. Update in: Commun Chem. 2023 Sep 7;6(1):188. doi: 10.1038/s42004-023-00991-6. PMID: 37293073 Free PMC article. Updated. Preprint.
-
Improving AlphaFold2-based protein tertiary structure prediction with MULTICOM in CASP15.Commun Chem. 2023 Sep 7;6(1):188. doi: 10.1038/s42004-023-00991-6. Commun Chem. 2023. PMID: 37679431 Free PMC article.
-
Enhancing alphafold-multimer-based protein complex structure prediction with MULTICOM in CASP15.Commun Biol. 2023 Nov 10;6(1):1140. doi: 10.1038/s42003-023-05525-3. Commun Biol. 2023. PMID: 37949999 Free PMC article.
-
A Survey of Deep Learning Methods for Estimating the Accuracy of Protein Quaternary Structure Models.Biomolecules. 2024 May 13;14(5):574. doi: 10.3390/biom14050574. Biomolecules. 2024. PMID: 38785981 Free PMC article. Review.
References
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous