Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 16;120(22):5124-5135.
doi: 10.1016/j.bpj.2021.10.003. Epub 2021 Oct 8.

Refining conformational ensembles of flexible proteins against small-angle x-ray scattering data

Affiliations

Refining conformational ensembles of flexible proteins against small-angle x-ray scattering data

Francesco Pesce et al. Biophys J. .

Erratum in

Abstract

Intrinsically disordered proteins and flexible regions in multidomain proteins display substantial conformational heterogeneity. Characterizing the conformational ensembles of these proteins in solution typically requires combining one or more biophysical techniques with computational modeling or simulations. Experimental data can either be used to assess the accuracy of a computational model or to refine the computational model to get a better agreement with the experimental data. In both cases, one generally needs a so-called forward model (i.e., an algorithm to calculate experimental observables from individual conformations or ensembles). In many cases, this involves one or more parameters that need to be set, and it is not always trivial to determine the optimal values or to understand the impact on the choice of parameters. For example, in the case of small-angle x-ray scattering (SAXS) experiments, many forward models include parameters that describe the contribution of the hydration layer and displaced solvent to the background-subtracted experimental data. Often, one also needs to fit a scale factor and a constant background for the SAXS data but across the entire ensemble. Here, we present a protocol to dissect the effect of the free parameters on the calculated SAXS intensities and to identify a reliable set of values. We have implemented this procedure in our Bayesian/maximum entropy framework for ensemble refinement and demonstrate the results on four intrinsically disordered proteins and a protein with three domains connected by flexible linkers. Our results show that the resulting ensembles can depend on the parameters used for solvent effects and suggest that these should be chosen carefully. We also find a set of parameters that work robustly across all proteins.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Reweighting ensembles using SAXS data calculated using different values for the parameters that effect the contribution from for the hydration layer and displaced solvent. The grids show the results from the iBME ensemble optimization with different combinations of δρ and r0. The top row (ac) shows Hst5, the second row (df) shows Sic1, the third row (gi) shows Tau, and the last row (jl) shows results for TIA1. For each protein, we show in the first column (a, d, g, and j) ln(χred2), we show in the second column (b, e, h, and k) φeff, and we show in the third column (c, f, i, and l) γ=ln(χred2φeff). White spots correspond to ensembles in which the iBME reweighting failed. To see this figure in color, go online.
Figure 2
Figure 2
Comparing ensembles relative to the optimum. For each protein (a: Hst5, b: Sic1, c: Tau and d: TIA1) we calculated the effective fraction of frames (shown here as φeff) between the weights obtained using the parameters in Table 1 and the weights obtained at all other combinations of r0 and δρ. White spots correspond to ensembles in which the iBME reweighting failed. Purple spots correspond to the minima for γ. To see this figure in color, go online.
Figure 3
Figure 3
Effect of the δρ and r0 parameters on reweighted probability distributions of Rg. We use Sic1 as an example and show p(Rg) from both the optimal (lowest γ) parameters (blue) as well as three other choices of r0 and δρ in the low-γ region (orange, green, and red). The insert shows the parameters used in each case and the results of the reweighting on the Rg distribution. To see this figure in color, go online.
Figure 4
Figure 4
Effect of the prior distribution. (a) Distributions of Rg of α-Synuclein sampled with flexible-meccano (FM), a99SB-disp (disp), and a03ws. (b) Reweighted Rg distributions, either from the optimal (lowest γ) δρ and r0 parameters for each ensemble (solid lines) or using the default values, we propose (δρ= 3.34 e/nm3 and r0 = 1.681 Å; dotted lines). To see this figure in color, go online.
Figure 5
Figure 5
Reweighting α-Synuclein ensembles using SAXS data calculated using different values for the parameters that effect the contribution from the hydration layer and displaced solvent. The grids show the results from the iBME ensemble optimization with different combinations of δρ and r0. The top row (ac) shows the results from the flexible-meccano ensemble, the second row (df) shows the results using a99SB-disp as the prior, and the third row (gi) shows the results from a03ws as the prior. For each ensemble we show in the first column (a, d, and g) ln(χred2), in the second column we show (b, e, and h) φeff, and in the third column (c, f, and i) we show γ=ln(χred2φeff). White spots correspond to ensembles in which the iBME reweighting failed. Purple spots in the third column correspond to the minima for γ. To see this figure in color, go online.
Figure 6
Figure 6
Estimating ‹Rg› from experimental SAXS profiles of (a) Hst5, (b) Sic1, (c) α-Synuclein, (d) Tau, and (e) TIA1 using Guinier fitting and ensemble refinement. We used the Guinier approximation to estimate Rg by fitting from the lowest measured value of q (in the case of Hst5 we ignored the first 10 points due to noise) to different values of qmax, reporting the results as Rg vs. qmaxRg (black circles). The horizontal black lines are the ensemble-averaged Rg calculated from the conformational ensembles (in the case of α-Synuclein, we used the flexible-meccano prior) with the chosen optimal r0 and δρ parameters (Table 1).

References

    1. Kikhney A.G., Svergun D.I. A practical guide to small angle X-ray scattering (SAXS) of flexible and intrinsically disordered proteins. FEBS Lett. 2015;589:2570–2577. - PubMed
    1. Grant T.D. Ab initio electron density determination directly from solution scattering data. Nat. Methods. 2018;15:191–193. - PubMed
    1. Prior C., Davies O.R., et al. Pohl E. Obtaining tertiary protein structures by the ab initio interpretation of small angle X-ray scattering data. J. Chem. Theory Comput. 2020;16:1985–2001. - PMC - PubMed
    1. He H., Liu C., Liu H. Model reconstruction from small-angle x-ray scattering data using deep learning methods. iScience. 2020;23:100906. - PMC - PubMed
    1. Hub J.S. Interpreting solution X-ray scattering data using molecular simulations. Curr. Opin. Struct. Biol. 2018;49:18–26. - PubMed

Publication types

Substances

LinkOut - more resources