DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction

doi:10.1093/nar/gkac340

. 2022 Jul 5;50(W1):W235-W245.

doi: 10.1093/nar/gkac340.

DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction

Xiaogen Zhou^{1

2}, Chunxiang Peng², Wei Zheng¹, Yang Li¹, Guijun Zhang², Yang Zhang^{1

3}

Affiliations

¹ Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
² College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China.
³ Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA.

PMID: 35536281
PMCID: PMC9252800
DOI: 10.1093/nar/gkac340

DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction

Xiaogen Zhou et al. Nucleic Acids Res. 2022.

. 2022 Jul 5;50(W1):W235-W245.

doi: 10.1093/nar/gkac340.

Authors

Xiaogen Zhou^{1

2}, Chunxiang Peng², Wei Zheng¹, Yang Li¹, Guijun Zhang², Yang Zhang^{1

3}

Affiliations

¹ Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
² College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China.
³ Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA.

PMID: 35536281
PMCID: PMC9252800
DOI: 10.1093/nar/gkac340

Abstract

Most proteins in nature contain multiple folding units (or domains). The revolutionary success of AlphaFold2 in single-domain structure prediction showed potential to extend deep-learning techniques for multi-domain structure modeling. This work presents a significantly improved method, DEMO2, which integrates analogous template structural alignments with deep-learning techniques for high-accuracy domain structure assembly. Starting from individual domain models, inter-domain spatial restraints are first predicted with deep residual convolutional networks, where full-length structure models are assembled using L-BFGS simulations under the guidance of a hybrid energy function combining deep-learning restraints and analogous multi-domain template alignments searched from the PDB. The output of DEMO2 contains deep-learning inter-domain restraints, top-ranked multi-domain structure templates, and up to five full-length structure models. DEMO2 was tested on a large-scale benchmark and the blind CASP14 experiment, where DEMO2 was shown to significantly outperform its predecessor and the state-of-the-art protein structure prediction methods. By integrating with new deep-learning techniques, DEMO2 should help fill the rapidly increasing gap between the improved ability of tertiary structure determination and the high demand for the high-quality multi-domain protein structures. The DEMO2 server is available at https://zhanggroup.org/DEMO/.

PubMed Disclaimer

Figures

**Graphical Abstract**
DEMO2 is an significantly improved version for automated assembly of full-length structural models of multi-domain proteins by integrating analogous template alignments with deep-learning predicted inter-domain spatial restraints.

**Figure 1.**
Flowchart of the DEMO2 pipeline. The procedure mainly includes global and local templates identification, inter-domain spatial restraints prediction by DeepPotential, domain model assembly through fast L-BFGS simulation, and side-chain repacking and domain-domain linker reconstruction.

**Figure 2.**
Comparison of DEMO2 with AIDA and DEMO. (A) Head-to-head TM-score comparison of models assembled by DEMO2 and that created by DEMO. (B) Head-to-head TM-score comparison of models generated by DEMO2 and that built by AIDA. (C and D) representative examples are showing DEMO2 builds better full-length models than DEMO and AIDA. Gray and color cartoons are native structures and DEMO assembled models, respectively, and different domains in the assembled models are represented by different colors. (C) 1vz6A. (D) 4ewtA.

**Figure 3.**
Comparison of DEMO2 with DMPfold and trRosetta. (A) Violin plot plus box plot for the TM-score of the final full-length model, where IQR means the interquartile range of the TM-score. (B) Histogram of the rTM-score of the final full-length model, where the vertical line indicates the outlier of the TM-scores. (C and D) representative examples are showing DEMO2 creates more accurate models than DMPfold and trRosetta. Gray and color cartoons are native structures and DEMO2 assembled models, respectively, and different domains in the assembled models are represented by different colors. (C) 3arbA. (D) 1g87B.

**Figure 4.**
Example of the DEMO2 results page. (A) Title of the results page, link to download all results, FASTA sequence, and domain boundaries of the target. (B) The user provided domain models for the assembly. (C) Predicted residue-residue distance maps and contact maps for domain model assembly. (D) The top ten analogous templates identified by the analogous structural alignment. (E) Top five final full-length models assemble by the server and the estimated accuracy of the model, where different domains are represented by different colors.

See this image and copyright information in PMC

Cited by

Structural modelling of human complement FHR1 and two of its synthetic derivatives provides insight into their in-vivo functions.
Ruiz-Molina N, Parsons J, Decker EL, Reski R. Ruiz-Molina N, et al. Comput Struct Biotechnol J. 2023 Feb 3;21:1473-1486. doi: 10.1016/j.csbj.2023.02.002. eCollection 2023. Comput Struct Biotechnol J. 2023. PMID: 36851916 Free PMC article.
Narrow funnel-like interaction energy distribution is an indicator of specific protein interaction partner.
Choi J. Choi J. iScience. 2023 May 20;26(6):106911. doi: 10.1016/j.isci.2023.106911. eCollection 2023 Jun 16. iScience. 2023. PMID: 37305691 Free PMC article.
Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15.
Zheng W, Wuyun Q, Freddolino L, Zhang Y. Zheng W, et al. Proteins. 2023 Dec;91(12):1684-1703. doi: 10.1002/prot.26585. Epub 2023 Aug 31. Proteins. 2023. PMID: 37650367 Free PMC article.
Recent Progress of Protein Tertiary Structure Prediction.
Wuyun Q, Chen Y, Shen Y, Cao Y, Hu G, Cui W, Gao J, Zheng W. Wuyun Q, et al. Molecules. 2024 Feb 13;29(4):832. doi: 10.3390/molecules29040832. Molecules. 2024. PMID: 38398585 Free PMC article. Review.
Deep-learning-based single-domain and multidomain protein structure prediction with D-I-TASSER.
Zheng W, Wuyun Q, Li Y, Liu Q, Zhou X, Peng C, Zhu Y, Freddolino L, Zhang Y. Zheng W, et al. Nat Biotechnol. 2025 May 23. doi: 10.1038/s41587-025-02654-4. Online ahead of print. Nat Biotechnol. 2025. PMID: 40410405

See all "Cited by" articles

References

1. Wang S., Sun S., Li Z., Zhang R., Xu J.. Accurate De Novo prediction of protein contact map by ultra-deep learning model. PLoS Comput. Biol. 2017; 13:e1005324. - PMC - PubMed
1. Mortuza S.M., Zheng W., Zhang C., Li Y., Pearce R., Zhang Y.. Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions. Nat. Commun. 2021; 12:5011. - PMC - PubMed
1. Baek M., DiMaio F., Anishchenko I., Dauparas J., Ovchinnikov S., Lee G.R., Wang J., Cong Q., Kinch L.N., Schaeffer R.D.et al. .. Accurate prediction of protein structures and interactions using a three-track neural network. Science (New York, N.Y.). 2021; 373:871–876. - PMC - PubMed
1. Zheng W., Zhang C., Li Y., Pearce R., Bell E.W., Zhang Y.. Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations. Cell Reports Methods. 2021; 1:100014. - PMC - PubMed
1. Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tunyasuvunakool K., Bates R., Žídek A., Potapenko A.et al. .. Highly accurate protein structure prediction with AlphaFold. Nature. 2021; 596:583–589. - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

[1] Wang S., Sun S., Li Z., Zhang R., Xu J.. Accurate De Novo prediction of protein contact map by ultra-deep learning model. PLoS Comput. Biol. 2017; 13:e1005324. - PMC - PubMed

[2] Wang S., Sun S., Li Z., Zhang R., Xu J.. Accurate De Novo prediction of protein contact map by ultra-deep learning model. PLoS Comput. Biol. 2017; 13:e1005324. - PMC - PubMed

[3] Mortuza S.M., Zheng W., Zhang C., Li Y., Pearce R., Zhang Y.. Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions. Nat. Commun. 2021; 12:5011. - PMC - PubMed

[4] Mortuza S.M., Zheng W., Zhang C., Li Y., Pearce R., Zhang Y.. Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions. Nat. Commun. 2021; 12:5011. - PMC - PubMed

[5] Baek M., DiMaio F., Anishchenko I., Dauparas J., Ovchinnikov S., Lee G.R., Wang J., Cong Q., Kinch L.N., Schaeffer R.D.et al. .. Accurate prediction of protein structures and interactions using a three-track neural network. Science (New York, N.Y.). 2021; 373:871–876. - PMC - PubMed

[6] Baek M., DiMaio F., Anishchenko I., Dauparas J., Ovchinnikov S., Lee G.R., Wang J., Cong Q., Kinch L.N., Schaeffer R.D.et al. .. Accurate prediction of protein structures and interactions using a three-track neural network. Science (New York, N.Y.). 2021; 373:871–876. - PMC - PubMed

[7] Zheng W., Zhang C., Li Y., Pearce R., Bell E.W., Zhang Y.. Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations. Cell Reports Methods. 2021; 1:100014. - PMC - PubMed

[8] Zheng W., Zhang C., Li Y., Pearce R., Bell E.W., Zhang Y.. Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations. Cell Reports Methods. 2021; 1:100014. - PMC - PubMed

[9] Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tunyasuvunakool K., Bates R., Žídek A., Potapenko A.et al. .. Highly accurate protein structure prediction with AlphaFold. Nature. 2021; 596:583–589. - PMC - PubMed

[10] Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tunyasuvunakool K., Bates R., Žídek A., Potapenko A.et al. .. Highly accurate protein structure prediction with AlphaFold. Nature. 2021; 596:583–589. - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction

Affiliations

DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous