Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Aug 26:6:31232.
doi: 10.1038/srep31232.

Metadynamic metainference: Enhanced sampling of the metainference ensemble using metadynamics

Affiliations

Metadynamic metainference: Enhanced sampling of the metainference ensemble using metadynamics

Massimiliano Bonomi et al. Sci Rep. .

Abstract

Accurate and precise structural ensembles of proteins and macromolecular complexes can be obtained with metainference, a recently proposed Bayesian inference method that integrates experimental information with prior knowledge and deals with all sources of errors in the data as well as with sample heterogeneity. The study of complex macromolecular systems, however, requires an extensive conformational sampling, which represents a separate challenge. To address such challenge and to exhaustively and efficiently generate structural ensembles we combine metainference with metadynamics and illustrate its application to the calculation of the free energy landscape of the alanine dipeptide.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Schematic illustration of the M&M algorithm.
In M&M, an ensemble of models is simulated in parallel. Typically a model is composed by the following variables: the coordinates of the system X and a list of variables σ that represent the level of noise in the data. In principle one can associate a σ variable to each experimental data point (Eq. 19), or use an outlier model of noise that requires a single σ for all data points (Eq. 20). The replicas are coupled by the metainference energy function, which is composed by different terms: one that describes prior information on the system X (for example a molecular mechanics force field), one that describes prior information on the σ variable (typically a Jeffrey’s prior), and one term that describes the agreement of the models with the experimental data. This last energy term couples the multiple replicas, as experimental data are expected to be generated by the average over an ensemble of conformations. To accelerate sampling, M&M adds to each replica the bias potential of PBMetaD. This is defined as a function of multiple CVs, which are selected from the user based on a priori knowledge of the system to include all the slow modes that need to be accelerated. The PBMetaD bias potential is defined in terms of multiple potentials (one for each CVs), which are stored on a common grid and shared among all replicas. In doing so, the multiple replicas all contribute to fill in parallel the underlying free-energy landascapes and thus to accelerate sampling of the entire ensemble.
Figure 2
Figure 2. Benchmark of M&M accuracy on the alanine dipeptide in vacuo.
We assumed that the ensemble generated by the prior alone (AMBER99SB-ILDN force field, (A,D) red lines) is inaccurate, and that in the actual ensemble the local minimum Cax is more populated (A,D, black lines). We then used M&M to combine the inaccurate prior with synthetic experimental data generated as averages on the correct ensemble. We calculated the error in the reconstructed free energies as a function of the backbone dihedrals ϕ (upper panels) and ψ (lower panels), using Gaussian (B,E) and outliers noise models (C,F). For both noise models, we benchmarked the method as a function of the number of data points, the level of noise in the data, and the number of replicas used.
Figure 3
Figure 3. Analysis of the uncertainties inferred by M&M.
The distributions of the σB parameter in representative runs with 8 replicas and all the 36 data points available, using the Gaussian (AC) and outliers noise models (BD), in absence (top panels) and presence of systematic errors (bottom panels). When using Gaussian noise, we plot the distributions of the uncertainty parameter associated to four representative data points (formula image). When systematic errors are present, we selected two outliers (formula image and formula image) and two data points not affected by error (formula image and formula image). When using the outliers model, we plot the distribution of the typical dataset uncertainty formula image.
Figure 4
Figure 4. Reweighting of M&M simulations.
Free energy of alanine dipeptide as a function of the backbone dihedrals obtained with the AMBER99SB-ILDN prior alone (A) and with the correction to lower the free energy of the local minimum Cax (B). The latter is considered our reference (exact) free energy. Free energy obtained from reweighting a M&M simulation carried out using 64 replicas, all the 36 data points available (without addition of systematic errors), the Gaussian noise model with one uncertainty parameter for each data point (C) and the outliers noise model with one parameter per dataset (D). The visualization is truncated at 50 kJ/mol from the global minimum. For each case (prior, exact, Gaussian and outliers noise), we also reported the monodimensional free energies as a function of the dihedrals ϕ (EH) and ψ (IL), calculated directly from the bias potentials. The same analysis using a dataset with systematic errors is reported in Fig. S7.

Similar articles

Cited by

References

    1. Schneidman-Duhovny D., Pellarin R. & Sali A. Uncertainty in integrative structural modeling. Curr. Opin. Struct. Biol. 28, 96–104 (2014). - PMC - PubMed
    1. Bonomi M., Camilloni C., Cavalli A. & Vendruscolo M. Metainference: a Bayesian inference method for heterogeneous systems. Sci. Adv. 2, e1501177 (2016). - PMC - PubMed
    1. Rieping W., Habeck M. & Nilges M. Inferential structure determination. Science 309, 303–306 (2005). - PubMed
    1. Cavalli A., Camilloni C. & Vendruscolo M. Molecular dynamics simulations with replica-averaged structural restraints generate structural ensembles according to the maximum entropy principle. J. Chem. Phys. 138, 094112 (2013). - PubMed
    1. Lindorff-Larsen K., Best R. B., Depristo M. A., Dobson C. M. & Vendruscolo M. Simultaneous determination of protein structure and dynamics. Nature 433, 128–132 (2005). - PubMed