Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Oct 25:8:727553.
doi: 10.3389/fmolb.2021.727553. eCollection 2021.

Model-Free or Not?

Affiliations
Review

Model-Free or Not?

Kai Zumpfe et al. Front Mol Biosci. .

Abstract

Relaxation in nuclear magnetic resonance is a powerful method for obtaining spatially resolved, timescale-specific dynamics information about molecular systems. However, dynamics in biomolecular systems are generally too complex to be fully characterized based on NMR data alone. This is a familiar problem, addressed by the Lipari-Szabo model-free analysis, a method that captures the full information content of NMR relaxation data in case all internal motion of a molecule in solution is sufficiently fast. We investigate model-free analysis, as well as several other approaches, and find that model-free, spectral density mapping, LeMaster's approach, and our detector analysis form a class of analysis methods, for which behavior of the fitted parameters has a well-defined relationship to the distribution of correlation times of motion, independent of the specific form of that distribution. In a sense, they are all "model-free." Of these methods, only detectors are generally applicable to solid-state NMR relaxation data. We further discuss how detectors may be used for comparison of experimental data to data extracted from molecular dynamics simulation, and how simulation may be used to extract details of the dynamics that are not accessible via NMR, where detector analysis can be used to connect those details to experiments. We expect that combined methodology can eventually provide enough insight into complex dynamics to provide highly accurate models of motion, thus lending deeper insight into the nature of biomolecular dynamics.

Keywords: NMR relaxation; dynamics detectors; model-free analysis; molecular dynamics simulation; solid-state NMR.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Complexity of reorientational dynamics. For each bond in a molecule, multiple types of motion result in orientational sampling, where the distribution of angles for each motion result in a generalized order parameter, S 2. Therefore, in (A) we plot a possible distribution of Euler angles for a single type of motion (population is plotted as a function of angles β and γ, where α is not required for a symmetric interaction tensor). A single motion is furthermore described by a correlation time, and may be distributed over a range of correlation times. In (B) we plot a possible distribution of correlation times (1S2)θ(z) , that is, amplitude of motion as a function of the log-correlation time, z=log10(τc/s) . Each distribution is characterized by an amplitude, center, and width. Note that the integral of the distribution is (1S2) , S 2 being determined by the distribution of angles in (A). While (A,B) illustrate aspects of a single motion, multiple motions influence a given bond, where the total correlation function is the product of individual correlation functions. In (C), we plot four distributions of motion (color). Above each motion, we plot the distribution resulting from the product of that motion and all motions below it (black), eventually resulting in the total distribution seen at the top. Finally, we note that the total distribution varies as a function of position in the molecule, resulting in the 3D plot of the distribution as a function of correlation time and position in the molecule observed in (D). While this is just an illustration, one could imagine that motion in (D) results from three α-helices in a protein, each having a slightly different behavior, and varying dynamics as one approaches the end of each helix.
FIGURE 2
FIGURE 2
Five distributions of orientations and correlation times that yield the same model-free parameters ( (1S2) = 0.3, τe = 0.1 ns). In (A–E), we plot a distribution of orientations (sphere, right); on the axes, we plot the distribution of correlation times resulting from exchange among that set of orientations. Models of motion are wobbling-on-a-cone ( θcone=19° ), wobbling-in-a-cone ( θcone=28° ), symmetric two-site hop ( θhop=39° ), asymmetric two-site hop ( θhop=70° ), and 6-site asymmetric exchange. Insets in (A,B) show correlation times with small amplitudes. Sresid.2 refers to the order parameter from residual couplings (see Supplementary Section S3), which deviates from the generalized order parameter for asymmetric motion.
FIGURE 3
FIGURE 3
Model-free fit parameters as a function of input parameters. For each plot, a data set is calculated, using the experiments found in from Table I of Lipari and Szabo (1982b), and the resulting rate constants are fit using the model-free approach, with the resulting τefit and (1S2) shown on the left and right, respectively. For all plots, the tumbling correlation time is τM = 4 ns and (1S2) = 0.3. One correlation time of the internal motion is varied, and we plot τein on the x-axis. In each plot, we fit using the full spectral density (blue, solid, see Eq. 6) and using a linear approximation (red, dashed, see Eq. 5). In (A), the input correlation function only has a single correlation time. In (B), one correlation time is fixed to 10 ps, and the second correlation time is swept. In (C), a log-Gaussian distribution (μ = 10 ps, σ = 0.75 order of magnitude) is combined with a correlation time that is varied (with total amplitude equal). On the left plots, black dotted lines indicate where the input value, τein , matches the fit, τefit . In all plots, vertical black dotted lines indicate where ωτein=0.5 for ω/2π = 90 MHz, where this frequency corresponds to the highest field used for the data set.
FIGURE 4
FIGURE 4
EMF parameters as a function of input correlation time (solution-state). For each plot, a data set is calculated, using the set of experiments from Clore et al. (1990), and the resulting rate constants are fitted using the EMF approach. For all plots, τM = 8.3 ns, and the input (1S2) = 0.3. In each subplot, the fitted correlation times (left) and amplitudes (right) are shown, as a function of an input correlation time (x-axis). In (A), the input correlation function has two correlation times (with equal amplitudes), with one fixed at 10 ps, and the other swept. In (B), the input correlation function has three correlation times, two fixed at 10 ps and 1 ns, and the third is swept. In (C), a log-Gaussian distribution of correlation times is used (μ = 100 ps, σ = 0.75 orders of magnitude), and a single correlation time is swept. Black dotted lines show the input correlation times (left plots).
FIGURE 5
FIGURE 5
EMF parameters as a function of input correlation time (solid-state). For each plot, a data set is calculated, including direct measurement of S resid. via residual couplings (Eq. 17), 15N T 1 at 400, 500, and 850 MHz, and T 2 with MAS of 60 kHz. The resulting rate constants are fitted using the EMF approach. For all plots, (1S2) = 0.3. In each subplot, the fitted correlation times (left) and amplitudes (right) are shown, as a function of an input correlation time (x-axis). In (A), the input correlation function has two correlation times (with equal amplitudes), with one fixed at 3.2 ps, and the other swept. In (B), the input correlation function has three correlation times, two fixed at 3.2 ps and 32 ns, and the third is swept. In (C), a log-Gaussian distribution of correlation times is used (μ = 100 ps, σ = 0.75 orders of magnitude), and a single correlation time is swept. Black dotted lines show the input correlation times (left plots).
FIGURE 6
FIGURE 6
Behavior of SDM as a function of correlation time. In each subplot, we calculate 15N T 1, T 2, and σNH at 600 MHz, and analyze the results using Eq. 21. In (A), the input total correlation function consists of a single decaying exponential term (with amplitude 1), where the terms J(ω) are plotted as the correlation time is varied (results are normalized). Black dotted lines show the spectral densities, J(0) , J(ωI) , J(0.870ωS) , calculated with Eq. 22, and colored lines show the results of the data analysis, yielding an almost exact correspondence. In (B), the total correlation function now uses two correlation times (equal amplitudes), with one fixed at 10 ps, and the second swept (x-axis). On the y-axis, we plot contribution to the terms, ΔJ(ω) , from the correlation time being varied. The resulting behavior is identical to that in (A), except that the amplitude is half as large, since we have split the total amplitude between the fixed and variable correlation time (dashed line marks 0.5). In (C), the same information is plotted, but the total correlation function includes a log-Gaussian distribution (μ = 630 ps, σ = 1 order of magnitude), and a single, variable correlation time.
FIGURE 7
FIGURE 7
Behavior of LeMaster’s approach as a function of correlation time. In each subplot, we calculate 15N T 1, T 2, and σNH at 600 MHz for motion with (1S2) = 0.3 and tumbling correlation time of τM = 4 ns, and analyze the results using Eq. 24. In (A) the internal correlation function consists of a single decaying exponential term (with amplitude 0.3), where the fitted amplitudes are plotted as the correlation time is varied. In (B) the internal correlation function uses two correlation times (both amplitudes are 0.15), with one correlation time fixed at 10 ps, and the second swept (x-axis). On the y-axis, we plot contributions to the terms from the correlation time being varied. The resulting behavior is identical to that in (A), except that the amplitude is half, since we have split the total amplitude between the fixed and variable correlation time (dashed line marks 0.15). In (C), the same information is plotted, but the total correlation function includes a log-Gaussian distribution (μ = 630 ps, σ = 1 order of magnitude), and a single, variable correlation time.
FIGURE 8
FIGURE 8
Behavior of the IMPACT approach as a function of correlation time. In each plot, we fit calculated relaxation rate constants, and fit the amplitudes in Eq. 25 according to the IMPACT procedure, using the set of experiments from Khan et al. (2015). In (A), the input total correlation function consists of a single decaying exponential term (with amplitude 1), where the amplitudes are plotted as the correlation time is varied. In (B), the total correlation function uses two correlation times (equal amplitudes), with one fixed at 1 ns, and the second swept (x-axis). On the y-axis, we plot contributions to the Ak from the correlation time being varied. In (C), the same information is plotted, but the total correlation function includes a log-Gaussian distribution (μ = 630 ps, σ = 1 order of magnitude), and a single, variable correlation time.
FIGURE 9
FIGURE 9
IMPACT behavior in solids. In each plot, we test the behavior of the amplitudes, Ak , using calculated solid-state NMR data (S 2, 15N R 1, 15N R 1ρ , with experimental conditions taken from Smith et al. (2016)). (A) plots the behavior of fitting S 2 and three R 1 rate constants to three correlation times (1 ps, 1.4 ns, 5 ns), where the input correlation function has a single correlation time ( (1S2) =0.3), while restricting the Ak to fall between 0 and 1. (B) shows fits under the same conditions, but includes two correlation times, with one fixed at 1 ns, and the other swept (x-axis). The y-axis plots the change in the Ak due to the swept correlation time. (C) shows fits under the same conditions as (A), without restricting the values of the Ak . (D) also removes restrictions on the Ak , but fits S 2 and R 1ρ data, using correlation times of (1 ps, 2.5 μs, and 17.8 μs). (E) fits all data (S 2, R 1, R 1ρ ) simultaneously without restrictions on the Ak , with correlation times of 1 ps, 1.4 ns, 5 ns, 2.5 μs, and 17.8 μs (F) fits R 1 data, but uses one very short correlation time (32 ps), and one very long correlation time (100 ns).
FIGURE 10
FIGURE 10
Similarity between the CIE XYZ colorspace and the relaxation rate constant space. (A) plots the XYZ colorspace, black lines indicate where single wavelengths fall in the colorspace (z not shown, space is normalized such that x + y + z = 1). Points connected by a triangle indicate the definition of red, green, and blue colors as defined by the sRGB standard (Anderson et al., 1996). (B) plots the sensitivity of the x¯(λ) , y¯(λ) , and z¯(λ) color matching functions as a function of wavelength (λ). (C) plots sRGB sensitivities resulting from transformation from the XYZ to sRGB spaces. Points connected by triangles correspond to definitions of r1 , r2 , and r3 that define the detector space. (D) shows the normalized relaxation rate space for 13C R 1 at 300 and 800 MHz and H–C NOE at 800 MHz. (E) shows the sensitivities of each of these experiments a function of correlation time. (F) shows detector sensitivities resulting from transformation from the relaxation rate constant space to detector space (defined by the points in (D)).
FIGURE 11
FIGURE 11
Model-free analysis from detectors. (A) shows a detector analysis of HET-s (218–289) fibrils (Smith et al., 2016), with sensitivities shown in (B) (amplitude scale not shown; sensitivities have a maximum of 1). (B) illustrates the procedure to convert 273Ser detector responses into model-free parameters. Bars give the detector responses (y-axis), plotted at the center of the corresponding detector’s sensitivity (x-axis, note that ρ 0, blue, does not have a well-defined center). At top, we find the ratio of ρ3(θ,S)/ρ2(θ,S) is consistent with a correlation time of 34 ns, with corresponding amplitude of 0.075 (intermediate motion). After subtracting the contribution of this correlation time to ρ0(θ,S) (middle), we find the ratio ρ1(θ,S)/ρ0(θ,S) is consistent with a correlation time of 49 ps, and amplitude of 0.12 (fast motion). Using a fixed correlation time of 14.7 μs, we find an amplitude for the slow motion of 1.8 × 10−3 (bottom). (C) shows the results of EMF analysis for all residues using the procedure in (B).
FIGURE 12
FIGURE 12
Separating population from hop angle and change in chemical shift in NERRD and BMRD experiments. Relevant parameters are shown as insets ( ωr/2π =40 kHz for NERDD plots). In (A–D), contour plots are shown for NERDD and BMRD relaxation rate constants under various conditions, and in each plot, a contour shows all values of p1 and θ or Δω1 that yield R1ρ equal to the value obtained for p1 = 0.25 and θ=16° or ΔωI = 500 Hz (marked as a cross on each plot). In (E–L), we only show the contour, but for a range of experimental conditions (five experiments, linearly spaced, with range indicated in the plot). In some cases, this yields nearly identical contours, such that we only see one of the five contours.
FIGURE 13
FIGURE 13
Behavior of fitting R 1ρ data to an offset and a fixed correlation time. (A) shows the offset term, R1ρ0 , divided by 2000, and the order parameter for the slow correlation time, (1Ss2) resulting from fitting calculated relaxation rate constants as a function of correlation time to Eq. 36. Experiments are 15N R 1ρ acquired with MAS frequency of 60 kHz and spin-lock strengths of 11, 16, 25, 38, and 51 kHz, and τs is fixed at 18.5 μs. (B) shows detector sensitivities optimized using the same data set. (C) shows R1ρ0 divided by 2000 and (1Ss2) for 13C R 1ρ acquired with MAS frequency of 60 kHz and spin-lock strengths of 9, 18, 35, and 48 kHz, as well as an additional experiment with MAS frequency of 40 kHz and spin-lock strength of 25 kHz τs is fixed at 7.0 μs. (D) shows detector sensitivities optimized using the same data set.
FIGURE 14
FIGURE 14
Combining NMR and MD. (A) plots normalized NMR sensitivities for a selection of experiments (S 2, 15N R 1 at 400, 500, 850 MHz, 15N R 1ρ at 850 MHz, 60 kHz MAS, ν 1 at 10, 25, and 45 kHz). (B) shows a linear combination of the normalized sensitivities (x, y positions shifted to reduce plot overlap), which yields the sensitivity of ρ 2, shown in (C) (green, bold). In color are the weighted contributions from each rate constant, and grey shows the cumulative sum (summing all sensitivities at and below the grey line). (C) shows the five sensitivities optimized from NMR data. (D) plots sensitivities of time points from MD-derived correlation functions (0 s, 10 points log-spaced from 50 ps to 1 μs). (E) shows a linear combination of those sensitivities, optimized to match the sensitivity of ρ 2 (x, y positions shifted to reduce plot overlap). (F) shows detectors optimized to match the NMR-derived detectors in (C). (G) shows spatial correlation of motion in a helix as a function of correlation time (windows for <20, ∼20, ∼100, ∼800 ns). Color intensity and bond radii indicate the correlation coefficient between that residue’s H–N motion and the motion of the black residue. (H) illustrates frames used to separate transformation from the PAS to the lab frame into four steps: a peptide plane frame, a helix frame, and a molecule frame (illustration inspired by Brown (1996), molecule plots created with ChimeraX (Pettersen et al., 2021)).

References

    1. Abergel D., Palmer A. G. (2003). On the Use of the Stochastic Liouville Equation in Nuclear Magnetic Resonance: Application to R1ρ Relaxation in the Presence of Exchange. Concepts Magn. Reson. 19A, 134–148. 10.1002/cmr.a.10091 - DOI
    1. Altis A., Nguyen P. H., Hegger R., Stock G. (2007). Dihedral Angle Principal Component Analysis of Molecular Dynamics Simulations. J. Chem. Phys. 126, 244111. 10.1063/1.2746330 - DOI - PubMed
    1. Amadei A., Linssen A. B. M., Berendsen H. J. C. (1993). Essential Dynamics of Proteins. Proteins 17, 412–425. 10.1002/prot.340170408 - DOI - PubMed
    1. Anderson M., Motta R., Chandrasekar S., Stokes M. (1996). “Proposal for a Standard Default Color Space for the Internet—sRGB,” in Color and Imaging Conference (Society for Imaging Science and Technology; ), 238–245.
    1. Antila H. S., Ferreira T. M., Ollila O. H. S., Miettinen M. S. (2021). Using Open Data to Rapidly Benchmark Biomolecular Simulations: Phospholipid Conformational Dynamics. J. Chem. Inf. Model. 61, 938–949. 10.1021/acs.jcim.0c01299 - DOI - PMC - PubMed

LinkOut - more resources