Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009;77 Suppl 9(Suppl 9):100-13.
doi: 10.1002/prot.22588.

I-TASSER: fully automated protein structure prediction in CASP8

Affiliations

I-TASSER: fully automated protein structure prediction in CASP8

Yang Zhang. Proteins. 2009.

Abstract

The I-TASSER algorithm for 3D protein structure prediction was tested in CASP8, with the procedure fully automated in both the Server and Human sections. The quality of the server models is close to that of human ones but the human predictions incorporate more diverse templates from other servers which improve the human predictions in some of the distant homology targets. For the first time, the sequence-based contact predictions from machine learning techniques are found helpful for both template-based modeling (TBM) and template-free modeling (FM). In TBM, although the accuracy of the sequence based contact predictions is on average lower than that from template-based ones, the novel contacts in the sequence-based predictions, which are complementary to the threading templates in the weakly or unaligned regions, are important to improve the global and local packing in these regions. Moreover, the newly developed atomic structural refinement algorithm was tested in CASP8 and found to improve the hydrogen-bonding networks and the overall TM-score, which is mainly due to its ability of removing steric clashes so that the models can be generated from cluster centroids. Nevertheless, one of the major issues of the I-TASSER pipeline is the model selection where the best models could not be appropriately recognized when the correct templates are detected only by the minority of the threading algorithms. There are also problems related with domain-splitting and mirror image recognition which mainly influences the performance of I-TASSER modeling in the FM-based structure predictions.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Comparison of the best threading templates with the first model predicted by the I-TASSER server. RMSD for models is calculated in the same aligned region as the threading template. The highlights in (b) are two domains where I-TASSER deteriorates the best templates.
Figure 2
Figure 2
The procedure of the I-TASSER server in modeling a FM target of T0416_2. The upper part shows the top 20 alignments by LOMETS for the whole-chain sequence followed by the subsequent threading on the domain which was missed in the whole-chain threading. The examples of 4 templates closest to the target are shown in the third row. The fourth row shows the native backbone structure with inter-residue lines indicating the side-chain contact predictions by SVMSEQ (red solid lines are true-positive and green dashed lines are false-positive predictions). The domain modeling was done in the sequence (L112-T198) but the tails (L112-E125 and F192-T198 shown as backbones in the final models) are trimmed during docking with other parts of the structures. The superposition is made on S124-K180 according to the assessor’s definition of T0416_2. The image is generated by MVP.
Figure 3
Figure 3
SVMSEQ contact predictions improve the modeling of T0437_1. (a) Structural superposition of the target (thin backbone) on the best template 2jz5A (thick backbone) with structural alignment generated by TM-align (RMSD =1.34A, TM-score =0.838). (b) Backbone structure of the native with lines between residues indicating Cα contact prediction from LOMETS. Red solid lines are true-positive and green dashed ones are false-positive. There is no true-positive contact in the lower part of the second beta-hairpin. (c) Same as (b) but contacts are from SVMSEQ with 10 true-positive predictions in the lower part of the second beta-hairpin. (d) Superposition of the I-TASSER server model on the native with a RMSD =1.13 A and a TM-score=0.885. The image is generated by MVP.
Figure 4
Figure 4
Comparison of the first models predicted by human (as “Zhang”) and server (as “Zhang-Server”) for all 164 domains.
Figure 5
Figure 5
TM-score of the I-TASSER server prediction (stars) in control with the best model (solid spheres) predicted by other servers in CASP8. (a) The first model by I-TASSER. (b) The best in top 100 models in I-TASSER simulation.
Figure 6
Figure 6
Structural modeling for T0504. (a) The experimental structure of the first two domains of T0504. (b) The template structure of 2gf7A detected by LOMETS which has the beta-hairpin swapped and may reflect a new evolution mechanism from the target. (c) Superposition of the native on the I-TASSER model (white backbone). The native structures of T0504_1 and T0504_2 are in blue and red. The architecture of the model and the native is similar but with different orientation of beta-hairpins.
Figure 7
Figure 7
The I-TASSER modeling for T0405_1 (a), where the mirror image structure (c) is ranked higher than the correct model (b).

References

    1. Murzin AG, Bateman A. CASP2 knowledge-based approach to distant homology recognition and fold prediction in CASP4. Proteins. 2001;(Suppl 5):76–85. - PubMed
    1. Ginalski K, Rychlewski L. Protein structure prediction of CASP5 comparative modeling and fold recognition targets using consensus alignment approach and 3D assessment. Proteins. 2003;53 (Suppl 6):410–417. - PubMed
    1. Das R, Qian B, Raman S, Vernon R, Thompson J, Bradley P, Khare S, Tyka MD, Bhat D, Chivian D, Kim DE, Sheffler WH, Malmstrom L, Wollacott AM, Wang C, Andre I, Baker D. Structure prediction for CASP7 targets using extensive all-atom refinement with Rosetta@home. Proteins. 2007;69(S8):118–128. - PubMed
    1. Baker D, Sali A. Protein structure prediction and structural genomics. Science. 2001;294(5540):93–96. - PubMed
    1. Skolnick J, Fetrow JS, Kolinski A. Structural genomics and its importance for gene function analysis. Nat Biotechnol. 2000;18(3):283–287. - PubMed

Publication types

LinkOut - more resources