Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Dec 16;97(12):3196-205.
doi: 10.1016/j.bpj.2009.09.031.

Learning rates and states from biophysical time series: a Bayesian approach to model selection and single-molecule FRET data

Affiliations

Learning rates and states from biophysical time series: a Bayesian approach to model selection and single-molecule FRET data

Jonathan E Bronson et al. Biophys J. .

Abstract

Time series data provided by single-molecule Förster resonance energy transfer (smFRET) experiments offer the opportunity to infer not only model parameters describing molecular complexes, e.g., rate constants, but also information about the model itself, e.g., the number of conformational states. Resolving whether such states exist or how many of them exist requires a careful approach to the problem of model selection, here meaning discrimination among models with differing numbers of states. The most straightforward approach to model selection generalizes the common idea of maximum likelihood--selecting the most likely parameter values--to maximum evidence: selecting the most likely model. In either case, such an inference presents a tremendous computational challenge, which we here address by exploiting an approximation technique termed variational Bayesian expectation maximization. We demonstrate how this technique can be applied to temporal data such as smFRET time series; show superior statistical consistency relative to the maximum likelihood approach; compare its performance on smFRET data generated from experiments on the ribosome; and illustrate how model selection in such probabilistic or generative modeling can facilitate analysis of closely related temporal data currently prevalent in biophysics. Source code used in this analysis, including a graphical user interface, is available open source via http://vbFRET.sourceforge.net.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A single (synthetic) FRET trace analyzed by ME and ML. The trace contains three hidden states. (A) (Upper) Idealized traces inferred by ME when K = 1, K = 3, and K = 5, as well as the corresponding log(evidence) for the inference. The data are underresolved when K = 1, but for both K = 3 and K = 5, the correct number of states is populated. (Lower) Idealized traces inferred by ML when K = 1, K = 3, and K = 5, as well as the corresponding log(likelihood). Inferences when K = 1 and K = 3 are the same as for ME, but the data are overfit when K = 5. (B) The log (evidence) from ME (black) and log likelihood from ML (gray) for 1 ≤ K ≤ 10. The evidence is correctly maximized for K = 3, but the likelihood increases monotonically.
Figure 2
Figure 2
Comparison of ME and ML as a function of increasing hidden-state noise. Fast-transitioning (hidden-state mean lifetime of four time steps) and slow-transitioning (hidden-state mean lifetime of 15 time steps) traces were created and analyzed separately. Each data point represents the average value taken over 100 traces. (Upper left) p(|z^|=|z0|): the probability in any trace of inferring the correct number of states. (Upper right) p(z^=z0): the probability in any trace at any time that a transition is inferred given that a transition actually occurred. (Lower left) Sensitivity to true transitions: the fraction of time the correct FRET state was inferred during FRET trajectories. (Lower right) Specificity of inferred transitions: the probability in any trace at any time that no transition is inferred given that no transition actually occurred. Error bars on all plots were omitted for clarity and because the data plotted represent mean success rates for Bernoulli processes (and, therefore, determine the variances of the data as well).
Figure 3
Figure 3
Analysis of the smFRETL1–tRNAfmid state. (A) A representative smFRETL1–tRNA trace idealized by ME, taken from the 50-ms exposure time data set. Both the observed data (blue) and idealized path (red) are shown. Individual data points, real and idealized, are shown as Xs. To emphasize the data at or near fmid, the Xs are enlarged and the observed and idealized data are shown in black and green, respectively. (B) Bar graph of the percentages of transitions to or from the fmid state under 25 ms, 50 ms, and 100 ms CCD integration time. (C) Normalized population histograms of dwell time spent at the fmid state under 25 ms, 50 ms, and 100 ms CCD integration time.

Similar articles

Cited by

References

    1. Koster D., Wiggins C., Dekker N. Multiple events on single molecules: Unbiased estimation in single-molecule biophysics. Proc. Natl. Acad. Sci. USA. 2006;103:1750–1755. - PMC - PubMed
    1. Moffitt J.R., Chemla Y.R., Aathavan K., Grimes S., Jardine P.J. Intersubunit coordination in a homomeric ring ATPase. Nature. 2009;457:446–450. - PMC - PubMed
    1. Munro J.B., Altman R.B., O'Connor N., Blanchard S.C. Identification of two distinct hybrid state intermediates on the ribosome. Mol. Cell. 2007;25:505–517. - PMC - PubMed
    1. Zhuang X.W., Bartley L.E., Babcock H.P., Russell R., Ha T.J. A single-molecule study of RNA catalysis and folding. Science. 2000;288:2048–2051. - PubMed
    1. Zhuang X.W., Kim H., Pereira M.J.B., Babcock H.P., Walter N.G. Correlating structural dynamics and function in single ribozyme molecules. Science. 2002;296:1473–1476. - PubMed

Publication types