A round-robin approach provides a detailed assessment of biomolecular small-angle scattering data reproducibility and yields consensus curves for benchmarking

Jill Trewhella¹, Patrice Vachette², Jan Bierma³, Clement Blanchet⁴, Emre Brookes⁵, Srinivas Chakravarthy⁶, Leonie Chatzimagas⁷, Thomas E Cleveland 4th⁸, Nathan Cowieson⁹, Ben Crossett¹⁰, Anthony P Duff¹¹, Daniel Franke⁴, Frank Gabel¹², Richard E Gillilan¹³, Melissa Graewert⁴, Alexander Grishaev⁸, J Mitchell Guss¹, Michal Hammel³, Jesse Hopkins⁶, Qingqui Huang¹³, Jochen S Hub⁷, Greg L Hura³, Thomas C Irving⁶, Cy Michael Jeffries⁴, Cheol Jeong¹⁴, Nigel Kirby¹⁵, Susan Krueger¹⁶, Anne Martel¹⁷, Tsutomu Matsui¹⁸, Na Li¹⁹, Javier Pérez²⁰, Lionel Porcar¹⁷, Thierry Prangé²¹, Ivan Rajkovic¹⁸, Mattia Rocco²², Daniel J Rosenberg³, Timothy M Ryan¹⁵, Soenke Seifert²³, Hiroshi Sekiguchi²⁴, Dmitri Svergun⁴, Susana Teixeira¹⁶, Aurelien Thureau²⁰, Thomas M Weiss¹⁸, Andrew E Whitten¹¹, Kathleen Wood¹¹, Xiaobing Zuo²³

Affiliations

¹ School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW 2006, Australia.
² Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Paris, 91198 Gif-sur-Yvette, France.
³ Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA.
⁴ European Molecular Biology Laboratory (EMBL) Hamburg Unit, Notkestrasse 85, c/o Deutsches Elektronen-Synchrotron, 22607 Hamburg, Germany.
⁵ Chemistry and Biochemistry, University of Montana, 32 Campus Drive, Missoula, MT 59812, USA.
⁶ BioCAT, Department of Biological Sciences, Illinois Institute of Technology, Chicago, IL 60616, USA.
⁷ Theoretical Physics and Center for Biophysics, Saarland University, Campus E2.6, 66123 Saarbrücken, Germany.
⁸ Institute for Bioscience and Biotechnology Research, 9600 Gudelsky Drive, Rockville, MD 20850, USA.
⁹ Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom.
¹⁰ Sydney Mass Spectrometry, The University of Sydney, Sydney, NSW 2006, Australia.
¹¹ Australian Nuclear Science and Technology Organisation, New Illawara Road, Lucas Heights, NSW 2234, Australia.
¹² Institut de Biologie Structurale, CEA, CNRS, Université Grenoblé Alpes, 41 Rue Jules Horowitz, 38027 Grenoble, France.
¹³ Cornell High-Energy Synchrotron Source, 161 Synchrotron Drive, Ithaca, NY 14853, USA.
¹⁴ Department of Physics, Wesleyan University, Middletown, CT 06459, USA.
¹⁵ Australian Synchrotron, ANSTO, 800 Blackburn Road, Clayton, VIC 3158, Australia.
¹⁶ National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, MD 20899, USA.
¹⁷ Institut Laue-Langevin, 71 Avenue des Martyrs, 38042 Grenoble CEDEX 9, France.
¹⁸ Stanford Synchrotron Radiation Lightsource, Stanford University, 2575 Sand Hill Road, Menlo Park, CA 94025, USA.
¹⁹ National Facility for Protein Science in Shanghai, Zhangjiang Laboratory, Shanghai Advanced Research Institute, Chinese Academy of Sciences, Road No. 333, Haike Road, Shanghai 201210, People's Republic of China.
²⁰ Synchrotron SOLEIL, L'Orme des Merisiers, Saint-Aubin BP 48, 91192 Gif-sur-Yvette, France.
²¹ CITCoM (UMR 8038 CNRS), Faculté de Pharmacie, 4 Avenue de l'Observatoire, 75006 Paris, France.
²² Proteomica e Spettrometria di Massa, IRCCS Ospedale Policlinico San Martino, Largo R. Benzi 10, 16132 Genova, Italy.
²³ X-ray Science Division, Advanced Photon Source, Argonne National Laboratory, Lemont, IL 60439, USA.
²⁴ SPring-8, Japan Synchrotron Radiation Research Institute, 1-1-1 Kouto, Sayo-cho, Sayo-gun, Hyōgo 679-5198, Japan.

PMID: 36322416
PMCID: PMC9629491
DOI: 10.1107/S2059798322009184

A round-robin approach provides a detailed assessment of biomolecular small-angle scattering data reproducibility and yields consensus curves for benchmarking

Jill Trewhella et al. Acta Crystallogr D Struct Biol. 2022.

. 2022 Nov 1;78(Pt 11):1315-1336.

doi: 10.1107/S2059798322009184. Epub 2022 Oct 20.

Authors

Affiliations

¹ School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW 2006, Australia.
² Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Paris, 91198 Gif-sur-Yvette, France.
³ Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA.
⁴ European Molecular Biology Laboratory (EMBL) Hamburg Unit, Notkestrasse 85, c/o Deutsches Elektronen-Synchrotron, 22607 Hamburg, Germany.
⁵ Chemistry and Biochemistry, University of Montana, 32 Campus Drive, Missoula, MT 59812, USA.
⁶ BioCAT, Department of Biological Sciences, Illinois Institute of Technology, Chicago, IL 60616, USA.
⁷ Theoretical Physics and Center for Biophysics, Saarland University, Campus E2.6, 66123 Saarbrücken, Germany.
⁸ Institute for Bioscience and Biotechnology Research, 9600 Gudelsky Drive, Rockville, MD 20850, USA.
⁹ Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom.
¹⁰ Sydney Mass Spectrometry, The University of Sydney, Sydney, NSW 2006, Australia.
¹¹ Australian Nuclear Science and Technology Organisation, New Illawara Road, Lucas Heights, NSW 2234, Australia.
¹² Institut de Biologie Structurale, CEA, CNRS, Université Grenoblé Alpes, 41 Rue Jules Horowitz, 38027 Grenoble, France.
¹³ Cornell High-Energy Synchrotron Source, 161 Synchrotron Drive, Ithaca, NY 14853, USA.
¹⁴ Department of Physics, Wesleyan University, Middletown, CT 06459, USA.
¹⁵ Australian Synchrotron, ANSTO, 800 Blackburn Road, Clayton, VIC 3158, Australia.
¹⁶ National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, MD 20899, USA.
¹⁷ Institut Laue-Langevin, 71 Avenue des Martyrs, 38042 Grenoble CEDEX 9, France.
¹⁸ Stanford Synchrotron Radiation Lightsource, Stanford University, 2575 Sand Hill Road, Menlo Park, CA 94025, USA.
¹⁹ National Facility for Protein Science in Shanghai, Zhangjiang Laboratory, Shanghai Advanced Research Institute, Chinese Academy of Sciences, Road No. 333, Haike Road, Shanghai 201210, People's Republic of China.
²⁰ Synchrotron SOLEIL, L'Orme des Merisiers, Saint-Aubin BP 48, 91192 Gif-sur-Yvette, France.
²¹ CITCoM (UMR 8038 CNRS), Faculté de Pharmacie, 4 Avenue de l'Observatoire, 75006 Paris, France.
²² Proteomica e Spettrometria di Massa, IRCCS Ospedale Policlinico San Martino, Largo R. Benzi 10, 16132 Genova, Italy.
²³ X-ray Science Division, Advanced Photon Source, Argonne National Laboratory, Lemont, IL 60439, USA.
²⁴ SPring-8, Japan Synchrotron Radiation Research Institute, 1-1-1 Kouto, Sayo-cho, Sayo-gun, Hyōgo 679-5198, Japan.

PMID: 36322416
PMCID: PMC9629491
DOI: 10.1107/S2059798322009184

Abstract

Through an expansive international effort that involved data collection on 12 small-angle X-ray scattering (SAXS) and four small-angle neutron scattering (SANS) instruments, 171 SAXS and 76 SANS measurements for five proteins (ribonuclease A, lysozyme, xylanase, urate oxidase and xylose isomerase) were acquired. From these data, the solvent-subtracted protein scattering profiles were shown to be reproducible, with the caveat that an additive constant adjustment was required to account for small errors in solvent subtraction. Further, the major features of the obtained consensus SAXS data over the q measurement range 0-1 Å^-1 are consistent with theoretical prediction. The inherently lower statistical precision for SANS limited the reliably measured q-range to <0.5 Å^-1, but within the limits of experimental uncertainties the major features of the consensus SANS data were also consistent with prediction for all five proteins measured in H₂O and in D₂O. Thus, a foundation set of consensus SAS profiles has been obtained for benchmarking scattering-profile prediction from atomic coordinates. Additionally, two sets of SAXS data measured at different facilities to q > 2.2 Å^-1 showed good mutual agreement, affirming that this region has interpretable features for structural modelling. SAS measurements with inline size-exclusion chromatography (SEC) proved to be generally superior for eliminating sample heterogeneity, but with unavoidable sample dilution during column elution, while batch SAS data collected at higher concentrations and for longer times provided superior statistical precision. Careful merging of data measured using inline SEC and batch modes, or low- and high-concentration data from batch measurements, was successful in eliminating small amounts of aggregate or interparticle interference from the scattering while providing improved statistical precision overall for the benchmarking data set.

Keywords: X-ray scattering; benchmarking standards; biomolecular small-angle scattering; neutron scattering; scattering-profile calculation; standards.

open access.

PubMed Disclaimer

Figures

**Figure 1**
Ribbon representations of the crystal structures of RNaseA (PDB entry 7rsa, black), lysozyme (PDB entry 2vb1, red), xylanase (PDB entry 2dfc, blue), urate oxidase (PDB entry 3l8w, dark cyan, with added C-terminal SLKSKL in magenta) and xylose isomerase (PDB entry 1mnz, purple).

**Figure 2**
Distribution of Guinier and P(r)-derived R _g values, d _max values and the Porod volume to molecular mass ratio (V _P/m) for the data contributing to the consensus SAXS profiles for each protein. Individual experimental values are represented as black open squares, with horizontal offsets for clarity and error bars for R _g values (standard errors). Red squares represent the mean values for each set, with error bars indicating ±1 standard deviation.

**Figure 3**
(a, b) I(q) versus q for consensus SAXS data (symbols) for each protein with P(r) model fits (lines). Insets are Guinier plots to qR _g = 1.3. (c, d) Error-weighted residual plots for the P(r) model fits in (a) and (b), respectively. (e, f) P(r) versus r corresponding to the fits in (a) and (b), respectively. Error bars are standard errors, and where not apparent are smaller than the symbols. The colour key for the symbols throughout is RNaseA, black; lysozyme, red; xylanase, blue; urate oxidase, dark cyan; xylose isomerase, purple.

**Figure 4**
Distribution of Guinier- and P(r)-derived R _g values and associated d _max values for SANS measurements for each protein in D₂O (batch data, open black squares; SEC–SANS data, light blue filled squares) and H₂O (batch data, open blue squares; SEC–SANS data, orange filled squares) with horizontal offsets for clarity. Errors bars on individual R _g values are standard errors. Red squares represent the mean value for each set, with the error bar indicating ±1 standard deviation. The values for H₂O urate oxidase SEC–SANS are for data that have a constant additive background adjustment to match the batch data.

**Figure 5**
(a, b) I(q) versus q (symbols) with P(r) model fits (black lines) from consensus SANS data in D₂O for each protein, with Guinier plots (to qR _g = 1.3) as insets. (c, d) Error-weighted residual plots for the fits in (a) and (b), respectively. (e) and (f) are the same plots as in (a) and (b) but for SANS data in H₂O, with (g) and (h) showing the corresponding error-weighted residual plots. (i, j) Corresponding P(r) versus r plots for the fits in (a) and (b) (D₂O, solid squares) and (e) and (f) (H₂O, open squares). The colour key throughout is RNaseA, black; lysozyme, red; xylanase, blue; urate oxidase, dark cyan; xylose isomerase, purple. Error bars are standard errors, and where not apparent are smaller than the symbols.

**Figure 6**
Guinier- and P(r)-derived R _g values (open and filled black squares, respectively) for RNaseA, lysozyme, xylanase, urate oxidase and xylose isomerase consensus profiles for SAXS, SANS in H₂O and SANS in D₂O measurements (Tables 2 ▸ and 3 ▸) compared with R _g values predicted with *CRYSOL*/*CRYSON* (red filled triangles) and *WAXSiS* (blue filled inverted triangles) (Supplementary Table S8). Where they are not evident, error bars for R _g from the consensus curve are smaller than the symbols.

**Figure 7**
I(q) versus q and dimensionless Kratky [(qR _g)² I(q)/I(0) versus qR _g] plots for the consensus data (open black squares) overlaid with the profiles predicted by *WAXSiS* (red), *CRYSOL* (cyan), *Pepsi-SAXS* (orange) and *FoXS* (blue) from the crystal structures. Every second or third experiment point is omitted for clarity. I(q) versus q plots are offset vertically, while the Kratky plots are stacked vertically so that for each panel the dashed lines are for (qR _g)² I(q)/I(0) = 0.0 or 1.1 for the plots above or below, respectively. The solid black reference lines in the Kratky plots are at qR _g = 1.73. Error bars are standard errors based on propagated counting statistics.

**Figure 8**
I(q) versus q and dimensionless Kratky [(qR _g)² I(q)/I(0) versus qR _g] plots for the consensus SANS data (black squares) in D₂O (upper plots) or H₂O (lower plots) overlaid with the profiles predicted by *WAXSiS* (red), *CRYSON* (cyan) and *Pepsi-SANS* (orange) from the crystal structures. I(q) versus q plots are offset vertically for clarity; from top to bottom, RNaseA, lysozyme, xylanase, urate oxidase and xylose isomerase. Kratky plots are stacked vertically so that for each panel the dashed lines are for (qR _g)² I(q)/I(0) = 0.0 or 1.1 for the plots above or below, respectively. Black solid reference lines in the Kratky plots are at qR _g = 1.73. Error bars are standard errors based on propagated counting statistics.

**Figure 9**
Data for (top to bottom traces) RNaseA, lysozyme, xylanase, urate oxidase and xylose isomerase from SEC–WAXS (black symbols, measured on the P12 BioSAXS beamline at EMBL, no lysozyme data) and batch WAXS (red symbols, measured on the 12-ID-B beamline at the APS, no urate oxidase data). The plot is log–log, with every second data point skipped for the sake of clarity and starting at q = 0.1 Å⁻¹ to better highlight the data beyond q = 1 Å⁻¹. Full log–log and log linear plots are given in Supplementary Fig. S12. Error bars are standard errors based on propagated counting statistics.

See this image and copyright information in PMC

References

1. Abraham, M. J., Murtola, T., Schulz, R., Páll, S., Smith, J. C., Hess, B. & Lindahl, E. (2015). SoftwareX, 1–2, 19–25.
1. Basham, M., Filik, J., Wharmby, M. T., Chang, P. C. Y., El Kassaby, B., Gerring, M., Aishima, J., Levik, K., Pulford, B. C. A., Sikharulidze, I., Sneddon, D., Webber, M., Dhesi, S. S., Maccherozzi, F., Svensson, O., Brockhauser, S., Náray, G. & Ashton, A. W. (2015). J. Synchrotron Rad. 22, 853–858. - PMC - PubMed
1. Berendsen, H. J. C., Postma, J. P. M., van Gunsteren, W. F., DiNola, A. & Haak, J. R. (1984). J. Chem. Phys. 81, 3684–3690.
1. Bernadó, P., Mylonas, E., Petoukhov, M. V., Blackledge, M. & Svergun, D. I. (2007). J. Am. Chem. Soc. 129, 5656–5664. - PubMed
1. Blanchet, C. E., Spilotros, A., Schwemmer, F., Graewert, M. A., Kikhney, A., Jeffries, C. M., Franke, D., Mark, D., Zengerle, R., Cipriani, F., Fiedler, S., Roessle, M. & Svergun, D. I. (2015). J. Appl. Cryst. 48, 431–443. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A round-robin approach provides a detailed assessment of biomolecular small-angle scattering data reproducibility and yields consensus curves for benchmarking

Affiliations

A round-robin approach provides a detailed assessment of biomolecular small-angle scattering data reproducibility and yields consensus curves for benchmarking

Authors

Affiliations

Abstract

Figures

References

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources