One contact for every twelve residues allows robust and accurate topology-level protein structure modeling

David E Kim¹, Frank Dimaio, Ray Yu-Ruei Wang, Yifan Song, David Baker

Affiliations

PMID: 23900763
PMCID: PMC4128384
DOI: 10.1002/prot.24374

One contact for every twelve residues allows robust and accurate topology-level protein structure modeling

David E Kim et al. Proteins. 2014 Feb.

. 2014 Feb;82 Suppl 2(0 2):208-18.

doi: 10.1002/prot.24374. Epub 2013 Sep 10.

Authors

David E Kim¹, Frank Dimaio, Ray Yu-Ruei Wang, Yifan Song, David Baker

Affiliation

¹ Department of Biochemistry, University of Washington, Seattle, 98195, Washington.

PMID: 23900763
PMCID: PMC4128384
DOI: 10.1002/prot.24374

Abstract

A number of methods have been described for identifying pairs of contacting residues in protein three-dimensional structures, but it is unclear how many contacts are required for accurate structure modeling. The CASP10 assisted contact experiment provided a blind test of contact guided protein structure modeling. We describe the models generated for these contact guided prediction challenges using the Rosetta structure modeling methodology. For nearly all cases, the submitted models had the correct overall topology, and in some cases, they had near atomic-level accuracy; for example the model of the 384 residue homo-oligomeric tetramer (Tc680o) had only 2.9 Å root-mean-square deviation (RMSD) from the crystal structure. Our results suggest that experimental and bioinformatic methods for obtaining contact information may need to generate only one correct contact for every 12 residues in the protein to allow accurate topology level modeling.

Keywords: ab initio prediction; comparative modeling; contact prediction; homology modeling; protein structure prediction; rosetta.

PubMed Disclaimer

Figures

**Figure 1**
Schematic representation of the protocol used for contact assisted structure prediction. The protocol consists of two stages. In the first stage (Topology Determination), topologies and partial structures with satisfied contacts and good secondary structure were obtained from (A) Rosetta *ab initio* sampling methods with constraints, (B) partial threaded models from the top 1000 SPARKS-X alignments, and (C) Rosetta non-local fragment pairs. For some targets, beta-strand pairings were predicted from either (B) or (C) and enforced using the Rosetta broken chain fold-tree structure prediction method. In the second stage (Topology Refinement), the models and partial structures predicted from the first stage were used as input for the RosettaCM recombination protocol to remodel regions where contacts were not satisfied from the first stage and to sample full-length topologies. All models were optimized using Rosetta all-atom refinement.

**Figure 2**
Contact assisted predictions significantly improved over the best nonassisted predictions. The native structure is on the left, our best submitted model in the middle, and the best nonassisted prediction among all predictor groups on the right. (A) Tc734, (B) Tc684-D2, (C) Tc719-D6, (D) Tc658-D1, and (E) Tc653 (native is on top, our model is in the middle, and the best nonassisted prediction among all predictor groups is on the bottom; orthogonal views are shown on the left and right; the best nonassisted prediction has typical LRR-like curvature, which is opposite to the atypical curvature of the native).

**Figure 3**
Contact assisted predictions with topology-level accuracy similar to best nonassisted predictions. In each panel the native structure is on the left, our best submitted model in the middle, and the best nonassisted prediction among all predictor groups on the right. For Tc680o, since nonassisted tetramer predictions were not made, the best single chain nonassisted prediction among all predictor groups is shown. (A) Tc666, (B) Tc717-D2, (C) Tc684-D1, (D) Tc735-D2, (E) Tc673, and (F) Tc680o.

**Figure 4**
Contact assisted predictions are significantly improved over most nonassisted predictions based on GDT-TS scores. (A) Best BAKER contact assisted predictions versus the best nonassisted predictions among all predictor groups for the 17 target domains (the GDT-TS scores for Tc658-D1 and Tc684-D2 are close enough to appear as the same point). (B) Model 1 BAKER contact assisted predictions versus the best model 1 nonassisted predictions among all predictor groups for the 17 target domains.

**Figure 5**
Examples of predictions with near atomic-level accuracy. The core side chains of the submitted model (red) and native (blue) are highlighted. (A) Tc719, (B) the converged section of Tc719, (C) the symmetric homo-oligomeric tetramer, Tc680o, and (D) the interface of Tc680o.

See this image and copyright information in PMC

References

1. Bradley P, Misura KM, Baker D. Toward high-resolution de novo structure prediction for small proteins. Science. 2005;309:1868–1871. - PubMed
1. Tyka MD, Keedy DA, Andre I, Dimaio F, Song Y, Richardson DC, Richardson JS, Baker D. Alternate states of proteins revealed by detailed energy landscape mapping. J Mol Biol. 2011;405:607–618. - PMC - PubMed
1. Xu D, Zhang Y. Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field. Proteins. 2012;80:1715–1735. - PMC - PubMed
1. Cavalli A, Salvatella X, Dobson CM, Vendruscolo M. Protein structure determination from NMR chemical shifts. Proc Natl Acad Sci U S A. 2007;104:9615–9620. - PMC - PubMed
1. Shen Y, Lange O, Delaglio F, Rossi P, Aramini JM, Liu G, Eletsky A, Wu Y, Singarapu KK, Lemak A, Ignatchenko A, Arrowsmith CH, Szyperski T, Montelione GT, Baker D, Bax A. Consistent blind protein structure generation from NMR chemical shift data. Proc Natl Acad Sci U S A. 2008;105:4685–4690. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

One contact for every twelve residues allows robust and accurate topology-level protein structure modeling

Affiliation

One contact for every twelve residues allows robust and accurate topology-level protein structure modeling

Authors

Affiliation

Abstract

Figures

References

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Miscellaneous