. 2016 Sep 19;55(39):11970-4.

doi: 10.1002/anie.201604788. Epub 2016 Aug 25.

Prediction of Protein Structure Using Surface Accessibility Data

Christoph Hartlmüller^{1

2}, Christoph Göbl^{1

2}, Tobias Madl^{3

4

5}

Affiliations

¹ Center for Integrated Protein Science Munich, Technische Universität München, Department of Chemistry, Lichtenbergstrasse 4, 85748, Garching, Germany.
² Institute of Structural Biology, Helmholtz Zentrum München, Ingolstädter Landstrasse 1, 85764, Neuherberg, Germany.
³ Center for Integrated Protein Science Munich, Technische Universität München, Department of Chemistry, Lichtenbergstrasse 4, 85748, Garching, Germany. tobias.madl@medunigraz.at.
⁴ Institute of Structural Biology, Helmholtz Zentrum München, Ingolstädter Landstrasse 1, 85764, Neuherberg, Germany. tobias.madl@medunigraz.at.
⁵ Institute of Molecular Biology & Biochemistry, Center of Molecular Medicine, Medical University of Graz, 8010, Graz, Austria. tobias.madl@medunigraz.at.

PMID: 27560616
PMCID: PMC5026166
DOI: 10.1002/anie.201604788

Prediction of Protein Structure Using Surface Accessibility Data

Christoph Hartlmüller et al. Angew Chem Int Ed Engl. 2016.

. 2016 Sep 19;55(39):11970-4.

doi: 10.1002/anie.201604788. Epub 2016 Aug 25.

Authors

Christoph Hartlmüller^{1

2}, Christoph Göbl^{1

2}, Tobias Madl^{3

4

5}

Affiliations

¹ Center for Integrated Protein Science Munich, Technische Universität München, Department of Chemistry, Lichtenbergstrasse 4, 85748, Garching, Germany.
² Institute of Structural Biology, Helmholtz Zentrum München, Ingolstädter Landstrasse 1, 85764, Neuherberg, Germany.
³ Center for Integrated Protein Science Munich, Technische Universität München, Department of Chemistry, Lichtenbergstrasse 4, 85748, Garching, Germany. tobias.madl@medunigraz.at.
⁴ Institute of Structural Biology, Helmholtz Zentrum München, Ingolstädter Landstrasse 1, 85764, Neuherberg, Germany. tobias.madl@medunigraz.at.
⁵ Institute of Molecular Biology & Biochemistry, Center of Molecular Medicine, Medical University of Graz, 8010, Graz, Austria. tobias.madl@medunigraz.at.

PMID: 27560616
PMCID: PMC5026166
DOI: 10.1002/anie.201604788

Abstract

An approach to the de novo structure prediction of proteins is described that relies on surface accessibility data from NMR paramagnetic relaxation enhancements by a soluble paramagnetic compound (sPRE). This method exploits the distance-to-surface information encoded in the sPRE data in the chemical shift-based CS-Rosetta de novo structure prediction framework to generate reliable structural models. For several proteins, it is demonstrated that surface accessibility data is an excellent measure of the correct protein fold in the early stages of the computational folding algorithm and significantly improves accuracy and convergence of the standard Rosetta structure prediction approach.

Keywords: CS-Rosetta; NMR spectroscopy; paramagnetic relaxation; protein structure prediction; structural biology.

PubMed Disclaimer

Figures

**Figure 1**
Principle of sPRE‐CS‐Rosetta. a) NMR sPRE data provides quantitative and residue specific information on the solvent accessibility as the effect of paramagnetic probes such as Gd(DTPA‐BMA) is distance dependent. b) Back‐calculation of sPRE data relies on placing the protein into equidistantly spaced grid points, while overlapping grid points are removed. The sPRE is approximated by the sum of all contributions of the surrounding grid points. c) The sPRE module is implemented as a scoring function capable of scoring centroid as well as full‐atom models. At its core, the experimental sPRE data (sPRE^exp) is compared to the predicted sPRE data of the current Rosetta model (sPRE^calc) and a score based on the Spearman correlation coefficient (colored numbers) is computed. In this scheme, the sPRE score is used during the folding of the protein backbone using the simplified centroid model as well as for rescoring the final full‐atom models.

**Figure 2**
sPRE data is an excellent measure of the correct protein fold and improves protein structure prediction. a) Structural ensembles of ubiquitin representing different stages of the AbinitioRelax protocol were rescored using Rosetta centroid and full‐atom scores (orange axis), the sPRE score (blue axis), and the chemical shift score (black axis). Experimental sPRE data for H^N and H^aliphatic protons were used as input for the sPRE score. b), c) Box plots showing the average C^α‐RMSD to the native structure for models obtained from CS‐Rosetta (orange) and sPRE‐CS‐Rosetta (blue). sPRE data was determined by NMR experiments (b) or back‐calculated (c). All obtained structural models were scored according to the sum of the Rosetta, chemical shift and sPRE score (b) or according to the sum of the Rosetta and the chemical shift score (c). For every protein, the best scored 0.2 % structures of all models were selected and used to generate the box plots. Proteins for which the sampling was improved by the sPRE module (reduced mean RMSD to native structure compared to CS‐Rosetta) are marked with a gray background and proteins for which CS‐Rosetta and sPRE‐CS‐Rosetta failed are not shown (average C^α‐RMSD >10 Å in the case of p16, 1CX1, 1F2 H, 1GXE, 1IX5, 1ON4, 1RFL, 1XWE, 2KNR, 2LFC, 2LFP, 2LLL, 2PQE, 2RRF, 3ZQD, and 4A5V). All scores are shown in arbitrary units.

**Figure 3**
sPRE data enhances accuracy and convergence of CS‐Rosetta structure prediction. The lowest‐energy models of CS‐Rosetta (orange) and sPRE‐CS‐Rosetta (blue) are compared to the NMR solution structures (gray, PDB code). For both methods, the corresponding Rosetta score (score13_env_hb) is plotted on the left and the distribution of the C^α‐RMSD of the sampled structures is shown below for both methods in a logarithmic histogram. For ubiquitin (a) and the C‐terminal domain of Phl p 5a (b) experimental sPRE data for amide and aliphatic protons is used, and for human prion protein (c) and the P‐type ATPase CopA (d) the input sPRE data was back‐calculated using the lowest energy model. In (a) and (c), the best scored model according to the Rosetta score is shown (see arrow in score plots), and for (b) and (d) the 10 lowest‐energy models are shown. For ubiquitin (a), a red sphere represents the position of the C^β atom of His 68, indicating the wrong positioning of the β‐strand in the CS‐Rosetta run. A more detailed picture of the scores is shown in the Supporting Information, Figure S3. All scores are shown in arbitrary units.

See this image and copyright information in PMC

Cited by

Characterization of Protein-Protein Interfaces in Large Complexes by Solid-State NMR Solvent Paramagnetic Relaxation Enhancements.
Öster C, Kosol S, Hartlmüller C, Lamley JM, Iuga D, Oss A, Org ML, Vanatalu K, Samoson A, Madl T, Lewandowski JR. Öster C, et al. J Am Chem Soc. 2017 Sep 6;139(35):12165-12174. doi: 10.1021/jacs.7b03875. Epub 2017 Aug 25. J Am Chem Soc. 2017. PMID: 28780861 Free PMC article.
AssignSLP_GUI, a software tool exploiting AI for NMR resonance assignment of sparsely labeled proteins.
Williams RV, Rogals MJ, Eletsky A, Huang C, Morris LC, Moremen KW, Prestegard JH. Williams RV, et al. J Magn Reson. 2022 Dec;345:107336. doi: 10.1016/j.jmr.2022.107336. Epub 2022 Nov 19. J Magn Reson. 2022. PMID: 36442299 Free PMC article.
Amino Acid Insertion Frequencies Arising from Photoproducts Generated Using Aliphatic Diazirines.
Ziemianowicz DS, Bomgarden R, Etienne C, Schriemer DC. Ziemianowicz DS, et al. J Am Soc Mass Spectrom. 2017 Oct;28(10):2011-2021. doi: 10.1007/s13361-017-1730-z. Epub 2017 Aug 10. J Am Soc Mass Spectrom. 2017. PMID: 28799075
A cation-π interaction in a transmembrane helix of vacuolar ATPase retains the proton-transporting arginine in a hydrophobic environment.
Hohlweg W, Wagner GE, Hofbauer HF, Sarkleti F, Setz M, Gubensäk N, Lichtenegger S, Falsone SF, Wolinski H, Kosol S, Oostenbrink C, Kohlwein SD, Zangger K. Hohlweg W, et al. J Biol Chem. 2018 Dec 7;293(49):18977-18988. doi: 10.1074/jbc.RA118.005276. Epub 2018 Sep 12. J Biol Chem. 2018. PMID: 30209131 Free PMC article.
Utilization of Hydrophobic Microenvironment Sensitivity in Diethylpyrocarbonate Labeling for Protein Structure Prediction.
Biehn SE, Limpikirati P, Vachet RW, Lindert S. Biehn SE, et al. Anal Chem. 2021 Jun 15;93(23):8188-8195. doi: 10.1021/acs.analchem.1c00395. Epub 2021 Jun 1. Anal Chem. 2021. PMID: 34061512 Free PMC article.

See all "Cited by" articles

References

1. None
1. Göbl C., Madl T., Simon B., Sattler M., Prog. Nucl. Magn. Reson. Spectrosc. 2014, 80, 26–63; - PubMed
1. Cavanagh J., Protein NMR Spectroscopy: Principles and Practice , 2nd ed., Academic Press, Amsterdam, Boston, 2007;
1. Rule G. S., Hitchens T. K., Fundamentals of Protein NMR Spectroscopy, Springer, Dordrecht, 2006.
1. Berman H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., Shindyalov I. N., Bourne P. E., Nucleic Acids Res. 2000, 28, 235–242. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

P 28854/FWF_/Austrian Science Fund FWF/Austria

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Prediction of Protein Structure Using Surface Accessibility Data

Affiliations

Prediction of Protein Structure Using Surface Accessibility Data

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources