Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2008 Jun;18(3):342-8.
doi: 10.1016/j.sbi.2008.02.004. Epub 2008 Apr 22.

Progress and challenges in protein structure prediction

Affiliations
Review

Progress and challenges in protein structure prediction

Yang Zhang. Curr Opin Struct Biol. 2008 Jun.

Abstract

Depending on whether similar structures are found in the PDB library, the protein structure prediction can be categorized into template-based modeling and free modeling. Although threading is an efficient tool to detect the structural analogs, the advancements in methodology development have come to a steady state. Encouraging progress is observed in structure refinement which aims at drawing template structures closer to the native; this has been mainly driven by the use of multiple structure templates and the development of hybrid knowledge-based and physics-based force fields. For free modeling, exciting examples have been witnessed in folding small proteins to atomic resolutions. However, predicting structures for proteins larger than 150 residues still remains a challenge, with bottlenecks from both force field and conformational search.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Structural superimposition results of 1413 representative single-domain proteins on their analogs in the PDB library. The structural analogs are searched by a sequence-independent structural-alignment tool, TM-align [8], and ranked by TM-score (a structural similarity measure balancing rmsd and coverage) [51]. All structural analogs with a sequence identity >20% to the target are excluded. If the analog of the highest TM-score has a coverage below 70%, the first structural analog with the coverage >70% is presented. As a result, all the structural analogs have a rmsd < 6 Å; 80% have a rmsd < 4 Å with >75% regions covered.
Figure 2
Figure 2
Representative examples of free modeling in CASP7 generated by two different approaches. T0283 (left panel) is a TBM target (from Bacillus halodurans) of 112 residues; but the model is generated by all-atom ROSETTA (a hybrid knowledge-based and physics-based approach) [45•] based on free modeling, which gives a TM-score 0.74 and a rmsd 1.8 Å over the first 92 residues (the overall rmsd is 13.8 Å mainly because of the misorientation of C-terminal). T0382 (right panel) is a FM/TBM target (from Rhodopseudomonas palustris CGA009) of 123 residues; the model is generated by I-TASSER (a purely knowledge-based approach) [31••] with a TM-score 0.66 and a rmsd 3.6 Å. Blue and red represent the model and the crystal structure, respectively.

References

    1. Burley SK, Almo SC, Bonanno JB, Capel M, Chance MR, Gaasterland T, Lin D, Sali A, Studier FW, Swaminathan S. Structural genomics: beyond the human genome project. Nat Genet. 1999;23:151–157. - PubMed
    1. Chandonia JM, Brenner SE. The impact of structural genomics: expectations and outcomes. Science. 2006;311:347–351. The authors review and assess the gain and loss of the structural genomics project in the past five years in contrast with traditional structural biology.

    1. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res. 2000;28:235–242. - PMC - PubMed
    1. Pieper U, Eswar N, Davis FP, Braberg H, Madhusudhan MS, Rossi A, Marti-Renom M, Karchin R, Webb BM, Eramian D, et al. MODBASE: a database of annotated comparative protein structure models and associated resources. Nucleic Acids Res. 2006;34:D291–D295. MODBASE is a database of 3D models built by the MODELLER pipeline for all protein sequences in SwissProt based on available structural templates in the PDB library.

    1. Moult J, Fidelis K, Kryshtafovych A, Rost B, Hubbard T, Tramontano A. Critical assessment of methods of protein structure prediction (CASP) — round VII. Proteins. 2007;69 Suppl 8:3–9. - PMC - PubMed

Publication types

LinkOut - more resources