Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Feb;80(2):352-61.
doi: 10.1002/prot.23183. Epub 2011 Nov 22.

Template-based protein structure modeling using TASSER(VMT.)

Affiliations

Template-based protein structure modeling using TASSER(VMT.)

Hongyi Zhou et al. Proteins. 2012 Feb.

Abstract

Template-based protein structure modeling is commonly used for protein structure prediction. Based on the observation that multiple template-based methods often perform better than single template-based methods, we further explore the use of a variable number of multiple templates for a given target in the latest variant of TASSER, TASSER(VMT) . We first develop an algorithm that improves the target-template alignment for a given template. The improved alignment, called the SP(3) alternative alignment, is generated by a parametric alignment method coupled with short TASSER refinement on models selected using knowledge-based scores. The refined top model is then structurally aligned to the template to produce the SP(3) alternative alignment. Templates identified using SP(3) threading are combined with the SP(3) alternative and HHEARCH alignments to provide target alignments to each template. These template models are then grouped into sets containing a variable number of template/alignment combinations. For each set, we run short TASSER simulations to build full-length models. Then, the models from all sets of templates are pooled, and the top 20-50 models selected using FTCOM ranking method. These models are then subjected to a single longer TASSER refinement run for final prediction. We benchmarked our method by comparison with our previously developed approach, pro-sp(3) -TASSER, on a set with 874 easy and 318 hard targets. The average GDT-TS score improvements for the first model are 3.5 and 4.3% for easy and hard targets, respectively. When tested on the 112 CASP9 targets, our method improves the average GDT-TS scores as compared to pro-sp3-TASSER by 8.2 and 9.3% for the 80 easy and 32 hard targets, respectively. It also shows slightly better results than the top ranked CASP9 Zhang-Server, QUARK and HHpredA methods. The program is available for download at http://cssb.biology.gatech.edu/.

Keywords: SP3; TASSER; alignment; template-based modeling; threading.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(a) Flowchart of TASSERVMT (b) Flowchart of SP3 alternative alignment generation.
Figure 2
Figure 2
Scatter plot comparison of the first model’s GDT-TS scores by pro-sp3-TASSER and TASSERVMT on the 1192 protein benchmark target set. The number of targets for which TASSERVMT is better/worse than pro-sp3-TASSER is 765/397.
Figure 3
Figure 3
Histogram comparison of first model GDT-TS score by pro-sp3-TASSER and TASSERVMT on the 1192 target set. (a) Easy set; (b) Hard set. The y-axis is the number of targets having first model GDT-TS score > x.
Figure 4
Figure 4
Examples of TASSERVMT improvement over pro-sp3-TASSER models. Numbers in parenthesis are GDT-TS scores to native. TASSER models are also shown for comparison.

References

    1. Zhang Y. Progress and challenges in protein structure prediction. Curr Opin Struct Biol. 2008;18(3):342–348. - PMC - PubMed
    1. Venclovas C, Margelevicius M. Comparative modeling in CASP6 using consensus approach to template selection, sequence-structure alignment, and structure assessment. Proteins. 2005;(Suppl 7):99–105. - PubMed
    1. Xu J. Fold recognition by predicted alignment accuracy. IEEE/ACM Trans on comput Biol and Bioinfo. 2005;2(2):157–165. - PubMed
    1. Fischer D, Eisenberg D. Protein fold recognition using sequence-derived predictions. Protein Sci. 1992;5:947–955. - PMC - PubMed
    1. Jones DT. GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. J Mol Biol. 1999;287:797–815. - PubMed

Publication types

LinkOut - more resources