Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 5;50(W1):W235-W245.
doi: 10.1093/nar/gkac340.

DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction

Affiliations

DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction

Xiaogen Zhou et al. Nucleic Acids Res. .

Abstract

Most proteins in nature contain multiple folding units (or domains). The revolutionary success of AlphaFold2 in single-domain structure prediction showed potential to extend deep-learning techniques for multi-domain structure modeling. This work presents a significantly improved method, DEMO2, which integrates analogous template structural alignments with deep-learning techniques for high-accuracy domain structure assembly. Starting from individual domain models, inter-domain spatial restraints are first predicted with deep residual convolutional networks, where full-length structure models are assembled using L-BFGS simulations under the guidance of a hybrid energy function combining deep-learning restraints and analogous multi-domain template alignments searched from the PDB. The output of DEMO2 contains deep-learning inter-domain restraints, top-ranked multi-domain structure templates, and up to five full-length structure models. DEMO2 was tested on a large-scale benchmark and the blind CASP14 experiment, where DEMO2 was shown to significantly outperform its predecessor and the state-of-the-art protein structure prediction methods. By integrating with new deep-learning techniques, DEMO2 should help fill the rapidly increasing gap between the improved ability of tertiary structure determination and the high demand for the high-quality multi-domain protein structures. The DEMO2 server is available at https://zhanggroup.org/DEMO/.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
DEMO2 is an significantly improved version for automated assembly of full-length structural models of multi-domain proteins by integrating analogous template alignments with deep-learning predicted inter-domain spatial restraints.
Figure 1.
Figure 1.
Flowchart of the DEMO2 pipeline. The procedure mainly includes global and local templates identification, inter-domain spatial restraints prediction by DeepPotential, domain model assembly through fast L-BFGS simulation, and side-chain repacking and domain-domain linker reconstruction.
Figure 2.
Figure 2.
Comparison of DEMO2 with AIDA and DEMO. (A) Head-to-head TM-score comparison of models assembled by DEMO2 and that created by DEMO. (B) Head-to-head TM-score comparison of models generated by DEMO2 and that built by AIDA. (C and D) representative examples are showing DEMO2 builds better full-length models than DEMO and AIDA. Gray and color cartoons are native structures and DEMO assembled models, respectively, and different domains in the assembled models are represented by different colors. (C) 1vz6A. (D) 4ewtA.
Figure 3.
Figure 3.
Comparison of DEMO2 with DMPfold and trRosetta. (A) Violin plot plus box plot for the TM-score of the final full-length model, where IQR means the interquartile range of the TM-score. (B) Histogram of the rTM-score of the final full-length model, where the vertical line indicates the outlier of the TM-scores. (C and D) representative examples are showing DEMO2 creates more accurate models than DMPfold and trRosetta. Gray and color cartoons are native structures and DEMO2 assembled models, respectively, and different domains in the assembled models are represented by different colors. (C) 3arbA. (D) 1g87B.
Figure 4.
Figure 4.
Example of the DEMO2 results page. (A) Title of the results page, link to download all results, FASTA sequence, and domain boundaries of the target. (B) The user provided domain models for the assembly. (C) Predicted residue-residue distance maps and contact maps for domain model assembly. (D) The top ten analogous templates identified by the analogous structural alignment. (E) Top five final full-length models assemble by the server and the estimated accuracy of the model, where different domains are represented by different colors.

Similar articles

Cited by

References

    1. Wang S., Sun S., Li Z., Zhang R., Xu J.. Accurate De Novo prediction of protein contact map by ultra-deep learning model. PLoS Comput. Biol. 2017; 13:e1005324. - PMC - PubMed
    1. Mortuza S.M., Zheng W., Zhang C., Li Y., Pearce R., Zhang Y.. Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions. Nat. Commun. 2021; 12:5011. - PMC - PubMed
    1. Baek M., DiMaio F., Anishchenko I., Dauparas J., Ovchinnikov S., Lee G.R., Wang J., Cong Q., Kinch L.N., Schaeffer R.D.et al. .. Accurate prediction of protein structures and interactions using a three-track neural network. Science (New York, N.Y.). 2021; 373:871–876. - PMC - PubMed
    1. Zheng W., Zhang C., Li Y., Pearce R., Bell E.W., Zhang Y.. Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations. Cell Reports Methods. 2021; 1:100014. - PMC - PubMed
    1. Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tunyasuvunakool K., Bates R., Žídek A., Potapenko A.et al. .. Highly accurate protein structure prediction with AlphaFold. Nature. 2021; 596:583–589. - PMC - PubMed

Publication types