. 2010 Jun 15;26(12):i269-77.

doi: 10.1093/bioinformatics/btq177.

Markov dynamic models for long-timescale protein motion

Tsung-Han Chiang¹, David Hsu, Jean-Claude Latombe

Affiliations

PMID: 20529916
PMCID: PMC2881362
DOI: 10.1093/bioinformatics/btq177

Markov dynamic models for long-timescale protein motion

Tsung-Han Chiang et al. Bioinformatics. 2010.

. 2010 Jun 15;26(12):i269-77.

doi: 10.1093/bioinformatics/btq177.

Authors

Tsung-Han Chiang¹, David Hsu, Jean-Claude Latombe

Affiliation

¹ Department of Computer Science, National University of Singapore, Singapore 117417, Singapore. chiangts@comp.nus.edu.sg

PMID: 20529916
PMCID: PMC2881362
DOI: 10.1093/bioinformatics/btq177

Abstract

Molecular dynamics (MD) simulation is a well-established method for studying protein motion at the atomic scale. However, it is computationally intensive and generates massive amounts of data. One way of addressing the dual challenges of computation efficiency and data analysis is to construct simplified models of long-timescale protein motion from MD simulation data. In this direction, we propose to use Markov models with hidden states, in which the Markovian states represent potentially overlapping probabilistic distributions over protein conformations. We also propose a principled criterion for evaluating the quality of a model by its ability to predict long-timescale protein motions. Our method was tested on 2D synthetic energy landscapes and two extensively studied peptides, alanine dipeptide and the villin headpiece subdomain (HP-35 NleNle). One interesting finding is that although a widely accepted model of alanine dipeptide contains six states, a simpler model with only three states is equally good for predicting long-timescale motions. We also used the constructed Markov models to estimate important kinetic and dynamic quantities for protein folding, in particular, mean first-passage time. The results are consistent with available experimental measurements.

PubMed Disclaimer

Figures

**Fig. 1.**
Five synthetic energy landscapes and the corresponding models.

**Fig. 2.**
Average log-likelihood scores of the models for synthetic energy landscapes.

**Fig. 3.**
Average log-likelihood scores of alanine dipeptide models.

**Fig. 4.**
Conformations generated from the 3-state model A3 (a) and the 6-state model M6 (b).

**Fig. 5.**
Average log-likelihood scores for the villin headpiece models.

**Fig. 6.**
(a) Main state transitions of the 20-state villin headpiece model. The size of each node is proportional to the probability of the corresponding state in the stationary distribution. The width of each edge is proportional to the transition probability. States with probability <0.01 in the stationary distribution, self-transitions and transitions with probability <0.002 are not shown to avoid cluttering the diagram. The initial conformations most likely belong to state 12, and the native conformation most likely belongs to state 15. (b) Example conformations from states 7, 12, 13, 15 and 18. The residues forming helix 1 are drawn in red. (c) The most likely state transition sequences from states 12 to 15.

See this image and copyright information in PMC

Cited by

Computational models of protein kinematics and dynamics: beyond simulation.
Gipson B, Hsu D, Kavraki LE, Latombe JC. Gipson B, et al. Annu Rev Anal Chem (Palo Alto Calif). 2012;5:273-91. doi: 10.1146/annurev-anchem-062011-143024. Epub 2012 Apr 9. Annu Rev Anal Chem (Palo Alto Calif). 2012. PMID: 22524225 Free PMC article. Review.
QAARM: quasi-anharmonic autoregressive model reveals molecular recognition pathways in ubiquitin.
Savol AJ, Burger VM, Agarwal PK, Ramanathan A, Chennubhotla CS. Savol AJ, et al. Bioinformatics. 2011 Jul 1;27(13):i52-60. doi: 10.1093/bioinformatics/btr248. Bioinformatics. 2011. PMID: 21685101 Free PMC article.
Quantifying the Sources of Kinetic Frustration in Folding Simulations of Small Proteins.
Savol AJ, Chennubhotla CS. Savol AJ, et al. J Chem Theory Comput. 2014 Aug 12;10(8):2964-2974. doi: 10.1021/ct500361w. Epub 2014 Jun 13. J Chem Theory Comput. 2014. PMID: 25136267 Free PMC article.
SIMS: a hybrid method for rapid conformational analysis.
Gipson B, Moll M, Kavraki LE. Gipson B, et al. PLoS One. 2013 Jul 23;8(7):e68826. doi: 10.1371/journal.pone.0068826. Print 2013. PLoS One. 2013. PMID: 23935893 Free PMC article.

References

1. Amadei A, et al. Essential dynamics of proteins. Prot. Struct. Funct. Genet. 1993;17:412–425. - PubMed
1. Amato NM, et al. Using motion planning to map protein folding landscapes and analyze folding kinetics of known native structures. J. Comput. Biol. 2003;10:239–255. - PubMed
1. Apaydin MS, et al. Stochastic roadmap simulation: an efficient representation and algorithm for analyzing molecular motion. J. Comput. Biol. 2003;10:257–281. - PubMed
1. Bishop G. Pattern Recognition and Machine Learning. New York: Springer; 2007.
1. Chekmarev DS, et al. Long-time conformational transitions of alanine dipeptide in aqueous solution: continuous and discrete-state kinetic models. J. Phys. Chem. B. 2004;108:19487–19495.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Markov dynamic models for long-timescale protein motion

Affiliation

Markov dynamic models for long-timescale protein motion

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous