Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Nov 6;11(6):064301.
doi: 10.1063/4.0000269. eCollection 2024 Nov.

Scaling and merging time-resolved pink-beam diffraction with variational inference

Affiliations

Scaling and merging time-resolved pink-beam diffraction with variational inference

Kara A Zielinski et al. Struct Dyn. .

Abstract

Time-resolved x-ray crystallography (TR-X) at synchrotrons and free electron lasers is a promising technique for recording dynamics of molecules at atomic resolution. While experimental methods for TR-X have proliferated and matured, data analysis is often difficult. Extracting small, time-dependent changes in signal is frequently a bottleneck for practitioners. Recent work demonstrated this challenge can be addressed when merging redundant observations by a statistical technique known as variational inference (VI). However, the variational approach to time-resolved data analysis requires identification of successful hyperparameters in order to optimally extract signal. In this case study, we present a successful application of VI to time-resolved changes in an enzyme, DJ-1, upon mixing with a substrate molecule, methylglyoxal. We present a strategy to extract high signal-to-noise changes in electron density from these data. Furthermore, we conduct an ablation study, in which we systematically remove one hyperparameter at a time to demonstrate the impact of each hyperparameter choice on the success of our model. We expect this case study will serve as a practical example for how others may deploy VI in order to analyze their time-resolved diffraction data.

PubMed Disclaimer

Conflict of interest statement

The authors have no conflicts to disclose.

Figures

FIG. 1.
FIG. 1.
careless commands used in this work.
FIG. 2.
FIG. 2.
The origin of harmonic overlap in pink-beam crystallography depicted by the reciprocal lattice for a 2-dimensional crystal. The diffracting condition, shaded gray, is bounded by the three limiting spheres determined by the minimum and maximum wavelength of the spectrum and the resolution limit of the sample. Three reflections from the same central ray are on the diffracting condition. Their scattered beam wavevectors, depicted as arrows, have different magnitudes, inverse wavelengths, but are all parallel. The wavelengths and resolutions of these reflections are recorded in the table to the right. Because the scattered beams are parallel, they will arrive at the same location on the detector.
FIG. 3.
FIG. 3.
Results of a one-dimensional sweep of the multivariate prior correlation parameter. (a) The active site difference peak heights for various multivariate prior correlation values. There is a clear maximum at 0.990. (b) and (c) RSCCs for ligand alone and for the full model. (d) Rfree is provided as an additional global measure of map to model agreement. (e) and (f) CCPred was calculated to assess overfitting using either the maximum-likelihood weighted Pearson or Spearman correlation coefficients. Careless produces a posterior distribution for each intensity observation. The mean and standard deviation of this distribution are recorded in the *_predictions_#.mtz files saved after model training. The means of these distributions is typically used to compute CCPred or the correlation between observed and predicted intensities. Here, we quantify the uncertainty in CCPred using the bootstrap method whereby we resample the predicted intensities recorded in the careless output with replacement 1000 times yielding 1000 estimates of CCPred per hyperparameter setting. These bootstrapped estimates are visualized as violin plots. The optimal hyperparameter setting has the highest value while having the smallest gap between test and train. We observe this clearly at 0.990 for CCPred(Pearson) and at 0.999 for CCPred(Spearman), indicating that the exact optimum likely lies between the two.
FIG. 4.
FIG. 4.
The results of the one-dimensional sweep of Student's t degrees of freedom, ν, on active site difference peak heights, RSCCs, and CCPred. (a)–(c) The active site difference peak heights and RSCCs have a slight maximum at 32. (d) Rfree is provided as an additional global measure of map to model agreement. (e) CCpred(Pearson) has the smallest gap between test and train at 32, which are the optimal results. (f) CCpred(Spearman) is an alternative metric to assess overfitting. It has the best results at 32 with the highest overall value, albeit by a small margin, and the smallest gap between test and train. The distributions visualized in the violin plots were generated by the bootstrap method described in the caption of Fig. 3.
FIG. 5.
FIG. 5.
Careless infers the spectrum from wavelength metadata. (a) The spectrum of the x-ray beam at BioCARS measured by a channel-cut monochromator. (b) An example diffraction pattern from our DJ-1 dataset with the indicated spots circled and colored by the predicted peak wavelength. (c) A two-dimensional histogram of the scale value (systematic error) predicted by careless and the peak wavelength predicted by Precognition (Renz Research). The skewed distribution is similar to the BioCARS spectrum, indicating that careless can infer spectral information. (d) The equivalent two-dimensional histogram produced from the wavelength-normalization ablation study wherein careless did not have access to the peak wavelength of each reflection observation.

Update of

Similar articles

Cited by

References

    1. Tenboer J., Basu S., Zatsepin N., Pande K., Milathianaki D., Frank M., Hunter M., Boutet S., Williams G. J., Koglin J. E., Oberthuer D., Heymann M., Kupitz C., Conrad C., Coe J., Roy-Chowdhury S., Weierstall U., James D., Wang D., Grant T., Barty A., Yefanov O., Scales J., Gati C., Seuring C., Srajer V., Henning R., Schwander P., Fromme R., Ourmazd A., Moffat K., Thor J. J. V., Spence J. C. H., Fromme P., Chapman H. N., and Schmidt M., “ Time-resolved serial crystallography captures high-resolution intermediates of photoactive yellow protein,” Science 346, 1242 (2014).10.1126/science.1259357 - DOI - PMC - PubMed
    1. Dods R., Båth P., Morozov D., Gagnér V. A., Arnlund D., Luk H. L., Kübel J., Maj M., Vallejos A., Wickstrand C., Bosman R., Beyerlein K. R., Nelson G., Liang M., Milathianaki D., Robinson J., Harimoorthy R., Berntsen P., Malmerberg E., Johansson L., Andersson R., Carbajo S., Claesson E., Conrad C. E., Dahl P., Hammarin G., Hunter M. S., Li C., Lisova S., Royant A., Safari C., Sharma A., Williams G. J., Yefanov O., Westenhoff S., Davidsson J., DePonte D. P., Boutet S., Barty A., Katona G., Groenhof G., Brändén G., and Neutze R., “ Ultrafast structural changes within a photosynthetic reaction centre,” Nature 589, 310 (2021).10.1038/s41586-020-3000-7 - DOI - PubMed
    1. Gruhl T., Weinert T., Rodrigues M. J., Milne C. J., Ortolani G., Nass K., Nango E., Sen S., Johnson P. J. M., Cirelli C., Furrer A., Mous S., Skopintsev P., James D., Dworkowski F., Båth P., Kekilli D., Ozerov D., Tanaka R., Glover H., Bacellar C., Brünle S., Casadei C. M., Diethelm A. D., Gashi D., Gotthard G., Guixà-González R., Joti Y., Kabanova V., Knopp G., Lesca E., Ma P., Martiel I., Mühle J., Owada S., Pamula F., Sarabi D., Tejero O., Tsai C.-J., Varma N., Wach A., Boutet S., Tono K., Nogly P., Deupi X., Iwata S., Neutze R., Standfuss J., Schertler G., and Panneels V., “ Ultrafast structural changes direct the first molecular events of vision,” Nature 615, 939 (2023).10.1038/s41586-023-05863-6 - DOI - PMC - PubMed
    1. Olmos J. L., Pandey S., Martin-Garcia J. M., Calvey G., Katz A., Knoska J., Kupitz C., Hunter M. S., Liang M., Oberthuer D., Yefanov O., Wiedorn M., Heyman M., Holl M., Pande K., Barty A., Miller M. D., Stern S., Roy-Chowdhury S., Coe J., Nagaratnam N., Zook J., Verburgt J., Norwood T., Poudyal I., Xu D., Koglin J., Seaberg M. H., Zhao Y., Bajt S., Grant T., Mariani V., Nelson G., Subramanian G., Bae E., Fromme R., Fung R., Schwander P., Frank M., White T. A., Weierstall U., Zatsepin N., Spence J., Fromme P., Chapman H. N., Pollack L., Tremblay L., Ourmazd A., Phillips G. N., and Schmidt M., “ Enzyme intermediates captured “on the fly” by mix-and-inject serial crystallography,” BMC Biol. 16, 59 (2018).10.1186/s12915-018-0524-5 - DOI - PMC - PubMed
    1. Dasgupta M., Budday D., De Oliveira S. H. P., Madzelan P., Marchany-Rivera D., Seravalli J., Hayes B., Sierra R. G., Boutet S., Hunter M. S., Alonso-Mori R., Batyuk A., Wierman J., Lyubimov A., Brewster A. S., Sauter N. K., Applegate G. A., Tiwari V. K., Berkowitz D. B., Thompson M. C., Cohen A. E., Fraser J. S., Wall M. E., Van Den Bedem H., and Wilson M. A., “ Mix-and-inject XFEL crystallography reveals gated conformational dynamics during enzyme catalysis,” Proc. Natl. Acad. Sci. U. S. A. 116, 25634 (2019).10.1073/pnas.1901864116 - DOI - PMC - PubMed

LinkOut - more resources