Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2003 Jun;12(6):1232-46.
doi: 10.1110/ps.0300203.

Automated protein fold determination using a minimal NMR constraint strategy

Affiliations
Comparative Study

Automated protein fold determination using a minimal NMR constraint strategy

Deyou Zheng et al. Protein Sci. 2003 Jun.

Abstract

Determination of precise and accurate protein structures by NMR generally requires weeks or even months to acquire and interpret all the necessary NMR data. However, even medium-accuracy fold information can often provide key clues about protein evolution and biochemical function(s). In this article we describe a largely automatic strategy for rapid determination of medium-accuracy protein backbone structures. Our strategy derives from ideas originally introduced by other groups for determining medium-accuracy NMR structures of large proteins using deuterated, (13)C-, (15)N-enriched protein samples with selective protonation of side-chain methyl groups ((13)CH(3)). Data collection includes acquiring NMR spectra for automatically determining assignments of backbone and side-chain (15)N, H(N) resonances, and side-chain (13)CH(3) methyl resonances. These assignments are determined automatically by the program AutoAssign using backbone triple resonance NMR data, together with Spin System Type Assignment Constraints (STACs) derived from side-chain triple-resonance experiments. The program AutoStructure then derives conformational constraints using these chemical shifts, amide (1)H/(2)H exchange, nuclear Overhauser effect spectroscopy (NOESY), and residual dipolar coupling data. The total time required for collecting such NMR data can potentially be as short as a few days. Here we demonstrate an integrated set of NMR software which can process these NMR spectra, carry out resonance assignments, interpret NOESY data, and generate medium-accuracy structures within a few days. The feasibility of this combined data collection and analysis strategy starting from raw NMR time domain data was illustrated by automatic analysis of a medium accuracy structure of the Z domain of Staphylococcal protein A.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
13C-1H HSQC spectrum of Z domain (at 600 MHz). The 13C-1H HSQC spectrum (red) of 13C, 15N, 2H-enriched, 13C-1H methyl protonated Z domain used in this study is overlaid onto the 13C-1H HSQC spectrum (black) of a fully protonated 13C,15N-enriched sample. The isopropyl methyl groups of all Val, Leu, and δ methyl groups of all Ile (top right) are highly protonated, while the rest of the aliphatic and aromatic carbons are primarily deuterated.
Figure 2.
Figure 2.
Sequential connectivity map summarizing the results of automated backbone resonance assignments determined by AutoAssign. Intra (i, red) and sequential (s, yellow) connectivity data used by AutoAssign to establish resonance assignments at each sequence position are shown. Secondary structure information derived automatically from combined analysis of Cα, Cβ, and C′ chemical shift (CSI) and NOE data is also plotted along the protein sequence. Shown in the figure are also i to i + 1, i + 2, and i + 3 HN-HN NOE connections determined by an initial AutoStructure analysis (run with RDC data) that was used to further validate these assignments; the strength (intensity) of the corresponding NOE interactions are indicated by line thickness. Residues with intra (down triangles) STACi and sequential (up triangles) STACs constraints (derived from hCCcoNH-TOCSY and HcccoNH-TOCSY spectra) specifying N–HN roots belonging to or following Val, Leu, Ile residues, respectively, are also indicated.
Figure 3.
Figure 3.
Ribbon representation and backbone trace of Z domain structures. (A) High- resolution solution NMR structure of Z domain (Tashiro et al. 1997; PDB ID 2SPZ). Structures generated automatically with the “minimal-constraint” strategy outlined in the text (B) without RDC and (C) with RDC data, superimposed on backbone structure of 2SPZ (gold).
Figure 4.
Figure 4.
Structures determined by simulated or archived, edited experimental minimal NMR constraints. Coordinate sets for six proteins were taken from Protein Data Bank (PDB). Distance constraints were generated for all interatomic distances <4.5 Å involving amide (backbone N and side-chain N of Asn/Gln) and methyl C of Leu, Val, and Ile(δ). Dihedral angle restraints (for regular secondary structures only), backbone hydrogen bonds, and both N–HN and Cα–C′ RDC data were also back calculated from the atomic coordinates, as described in the text. These data sets are referred to as Simulated Constraint Data. A second set of constraint lists referred as Archived, Edited Constraint Data were generated by editing archived NMR constraint files deposited in PDB, to replace simulated distance constraints with NOE-derived distances between amide and methyl protons. Other simulated data were also replaced with experimental data where available, as described in the text. For each protein used in this analysis, the published high-resolution NMR structure (PDB structure) is shown along with representative structures from the ensembles generated with CNS using the Simulated Constraint Data or the Archived, Edited Constraint Data.

Similar articles

Cited by

References

    1. Aghazadeh, B., Zhu, K., Kubiseski, T.J., Liu, G.A., Pawson, T., Zheng, Y., and Rosen, M.K. 1998. Structure and mutagenesis of the DbI homology domain. Nat. Struct. Biol. 5 1098–1107. - PubMed
    1. Andrec, M., Du, P., and Levy, R.M. 2001. Protein backbone structure determination using only residual dipolar couplings from one ordering medium. J. Biomol. NMR 21 335–347. - PubMed
    1. Andrec, M., Harano, Y., Jacobson, M.P., Friesner, R., and Levy, R.M. 2002. Complete protein structure determination using backbone residual dipolar couplings and sidechain rotamer prediction. J. Struct. Funct. Genomics 2 103–111. - PubMed
    1. Berardi, M.J., Sun, C., Zehr, M., Abildgaard, F., Peng, J., Speck, N.A., and Bushweller, J.H. 1999. The Ig fold of the core binding factor α Runt domain is a member of a family of structurally and functionally related Ig-fold DNA-binding domains. Structure 7 1247–1256. - PubMed
    1. Brunger, A.T., Adams, P.D., Clore, G.M., Delano, W.L., Gros, P., Grosse-Kunstleve, R.W., Jiang, J., Kuszewski, J., Nilges, M., Pannu, N.S., et al. 1998. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta. Crystallogr. D54 905–921. - PubMed

Publication types