Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Dec;39(4):303-14.
doi: 10.1007/s10858-007-9199-x. Epub 2007 Oct 9.

A global analysis of NMR distance constraints from the PDB

Affiliations

A global analysis of NMR distance constraints from the PDB

Wim Vranken. J Biomol NMR. 2007 Dec.

Abstract

Information obtained from Nuclear Magnetic Resonance (NMR) experiments is encoded as a set of constraint lists when calculating three-dimensional structures for a protein. With the amount of constraint data from the world wide Protein Data Bank (wwPDB) that is now available, it is possible to do a global, large-scale analysis using only information from the constraints, without taking the coordinate information into account. This article describes such an analysis of distance constraints from NOE data based on a set of 1834 NMR PDB entries containing 1909 protein chains. In order to best represent the quality and extent of the data that is currently deposited at the wwPDB, only the original data as deposited by the authors was used, and no attempt was made to 'clean up' and further interpret this information. Because the constraint lists provide a single set of data, and not an ensemble of structural solutions, they are easier to analyse and provide a reduced form of structural information that is relevant for NMR analysis only. The online resource resulting from this analysis ( http://www.ebi.ac.uk/msd/srv/docs/NMR/analysis/results/html/comparison.html ) makes it possible to check, for example, how often a particular contact occurs when assigning NOESY spectra, or to find out whether a particular sequence fragment is likely to be difficult to assign. In this respect it formalises information that scientists with experience in spectrum analysis are aware of but cannot necessarily quantify. The analysis described here illustrates the importance of depositing constraints (and all other possible NMR derived information) along with the structure coordinates, as this type of information can greatly assist the NMR community.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Overview of the workflow employed in the analysis. Grey boxes indicate files, white boxes Python scripts
Fig. 2
Fig. 2
Example web page for Ala → Ala i → i + 2 contact information from http://www.ebi.ac.uk/msd-srv/docs/NMR/analysis/results/html/comparison.html. The different data sets, comparisons between them, and analysis categories can be accessed via the left hand side menu. In this contact information page, overall statistics for the individual Ala residues involved in this contact are shown in the ‘Residue 1’ and ‘Residue 2’ tables at the top, statistics for the combined Ala–Ala residues in the ‘Residue combination’ table. The information on an atom level is listed in the ‘Breakdown per contact’ table. Outliers per secondary structure combination based on a binomial analysis are highlighted in red (higher than expected) and blue (lower than expected). This colour coding is used throughout the web pages for other types of analyses
Fig. 3
Fig. 3
Correlation between the foccurrence for the HP and AHIP data sets (correlation Spearman 0.971, Pearson 0.996)
Fig. 4
Fig. 4
Distance distribution from the constraint information for sequential Ala–Ala contacts between backbone H protons
Fig. 5
Fig. 5
Correlation between the foccurrence from the HPC (Coordinates) and HP (Constraint) sets

Similar articles

Cited by

  • The NMR restraints grid at BMRB for 5,266 protein and nucleic acid PDB entries.
    Doreleijers JF, Vranken WF, Schulte C, Lin J, Wedell JR, Penkett CJ, Vuister GW, Vriend G, Markley JL, Ulrich EL. Doreleijers JF, et al. J Biomol NMR. 2009 Dec;45(4):389-96. doi: 10.1007/s10858-009-9378-z. Epub 2009 Oct 7. J Biomol NMR. 2009. PMID: 19809795 Free PMC article.
  • Straightforward and complete deposition of NMR data to the PDBe.
    Penkett CJ, van Ginkel G, Velankar S, Swaminathan J, Ulrich EL, Mading S, Stevens TJ, Fogh RH, Gutmanas A, Kleywegt GJ, Henrick K, Vranken WF. Penkett CJ, et al. J Biomol NMR. 2010 Oct;48(2):85-92. doi: 10.1007/s10858-010-9439-3. Epub 2010 Aug 3. J Biomol NMR. 2010. PMID: 20680401 Free PMC article.
  • The Protein Data Bank in Europe (PDBe): bringing structure to biology.
    Velankar S, Kleywegt GJ. Velankar S, et al. Acta Crystallogr D Biol Crystallogr. 2011 Apr;67(Pt 4):324-30. doi: 10.1107/S090744491004117X. Epub 2011 Mar 18. Acta Crystallogr D Biol Crystallogr. 2011. PMID: 21460450 Free PMC article.
  • PDBe: Protein Data Bank in Europe.
    Velankar S, Best C, Beuth B, Boutselakis CH, Cobley N, Sousa Da Silva AW, Dimitropoulos D, Golovin A, Hirshberg M, John M, Krissinel EB, Newman R, Oldfield T, Pajon A, Penkett CJ, Pineda-Castillo J, Sahni G, Sen S, Slowley R, Suarez-Uruena A, Swaminathan J, van Ginkel G, Vranken WF, Henrick K, Kleywegt GJ. Velankar S, et al. Nucleic Acids Res. 2010 Jan;38(Database issue):D308-17. doi: 10.1093/nar/gkp916. Epub 2009 Oct 25. Nucleic Acids Res. 2010. PMID: 19858099 Free PMC article.
  • NRG-CING: integrated validation reports of remediated experimental biomolecular NMR data and coordinates in wwPDB.
    Doreleijers JF, Vranken WF, Schulte C, Markley JL, Ulrich EL, Vriend G, Vuister GW. Doreleijers JF, et al. Nucleic Acids Res. 2012 Jan;40(Database issue):D519-24. doi: 10.1093/nar/gkr1134. Epub 2011 Dec 1. Nucleic Acids Res. 2012. PMID: 22139937 Free PMC article.

References

    1. Bates D, Chambers J, Dalgaard P et al (2007) The R project for statistical computing. http://www.r-project.org/. Cited 10 Apr 2006. University site
    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'DOI', 'value': '10.1093/nar/gkl971', 'is_inner': False, 'url': 'https://doi.org/10.1093/nar/gkl971'}, {'type': 'PMC', 'value': 'PMC1669775', 'is_inner': False, 'url': 'https://pmc.ncbi.nlm.nih.gov/articles/PMC1669775/'}, {'type': 'PubMed', 'value': '17142228', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/17142228/'}]}
    2. Berman H, Henrick K, Nakamura H et al (2007) The worldwide Protein Data Bank (wwPDB): ensuring a single uniform archive of PDB data. Nucleic Acids Res 35:D301–D303 - PMC - PubMed
    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'DOI', 'value': '10.1006/jmbi.1998.1808', 'is_inner': False, 'url': 'https://doi.org/10.1006/jmbi.1998.1808'}, {'type': 'PubMed', 'value': '9680482', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/9680482/'}]}
    2. Doreleijers JF, Rullmann JA, Kaptein R (1998) Quality assessment of NMR structures: a statistical survey. J Mol Biol 281:149–164 - PubMed
    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'DOI', 'value': '10.1023/A:1023514106644', 'is_inner': False, 'url': 'https://doi.org/10.1023/a:1023514106644'}, {'type': 'PubMed', 'value': '12766409', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/12766409/'}]}
    2. Doreleijers JF, Mading S, Maziuk D et al (2003) BioMagResBank database with sets of experimental NMR constraints corresponding to the structures of over 1400 biomolecules deposited in the Protein Data Bank. J Biomol NMR 26:139–146 - PubMed
    1. {'text': '', 'ref_index': 1, 'ids': [{'type': 'DOI', 'value': '10.1007/s10858-005-2195-0', 'is_inner': False, 'url': 'https://doi.org/10.1007/s10858-005-2195-0'}, {'type': 'PubMed', 'value': '16041478', 'is_inner': True, 'url': 'https://pubmed.ncbi.nlm.nih.gov/16041478/'}]}
    2. Doreleijers JF, Nederveen AJ, Vranken W et al (2005) BioMagResBank databases DOCR and FRED containing converted and filtered sets of experimental NMR restraints and coordinates from over 500 protein PDB structures. J Biomol NMR 32:1–12 - PubMed

Publication types