Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 16;28(18):6659.
doi: 10.3390/molecules28186659.

Einstein Model of a Graph to Characterize Protein Folded/Unfolded States

Affiliations

Einstein Model of a Graph to Characterize Protein Folded/Unfolded States

Steve Tyler et al. Molecules. .

Abstract

The folded structures of proteins can be accurately predicted by deep learning algorithms from their amino-acid sequences. By contrast, in spite of decades of research studies, the prediction of folding pathways and the unfolded and misfolded states of proteins, which are intimately related to diseases, remains challenging. A two-state (folded/unfolded) description of protein folding dynamics hides the complexity of the unfolded and misfolded microstates. Here, we focus on the development of simplified order parameters to decipher the complexity of disordered protein structures. First, we show that any connected, undirected, and simple graph can be associated with a linear chain of atoms in thermal equilibrium. This analogy provides an interpretation of the usual topological descriptors of a graph, namely the Kirchhoff index and Randić resistance, in terms of effective force constants of a linear chain. We derive an exact relation between the Kirchhoff index and the average shortest path length for a linear graph and define the free energies of a graph using an Einstein model. Second, we represent the three-dimensional protein structures by connected, undirected, and simple graphs. As a proof of concept, we compute the topological descriptors and the graph free energies for an all-atom molecular dynamics trajectory of folding/unfolding events of the proteins Trp-cage and HP-36 and for the ensemble of experimental NMR models of Trp-cage. The present work shows that the local, nonlocal, and global force constants and free energies of a graph are promising tools to quantify unfolded/disordered protein states and folding/unfolding dynamics. In particular, they allow the detection of transient misfolded rigid states.

Keywords: Kirchhoff index; Wiener index; graph theory; intrinsically disordered proteins; molecular dynamics; protein folding.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
MD trajectory of Trp-cage at 380 K. Time t in red (t):ξ(t)>0.6. The yellow curve is computed for a moving mean with a window size of 1 ns.
Figure 2
Figure 2
Evolution of the global force constant K for the MD trajectory shown in Figure 1. The bold green curve is computed for a moving mean with a window size of 1 ns.
Figure 3
Figure 3
Panels (a,b) represent respectively the PDF of (ξ,K) values and (ξ,l0) computed from the trajectory shown in Figure 1.
Figure 4
Figure 4
Comparison between the average shortest path length (blue) and global force constant (green) for the MD trajectory shown in Figure 1. The bold green curve is computed for a moving mean with a window size of 1 ns. Times t0, t1, t2, t3, t4, and t5 discussed in the text are indicated.
Figure 5
Figure 5
Relationship between K and l computed for the MD trajectory in Figure 1. Panel (a) PDF of (K,l0) (blue dots) and pairs of values (K,l0) for three selected snapshots named s1 (green dot), s2 (pink dot), and s3 (red dot) with the same value of l0 as discussed in the text. Red line is the result of application of Equation (44). Black dots are the results of model chains with regular long distance spring force constants of different lengths named (20,j=1,2,3) in the main text. Panel (b) PDF of (K,l0) from all snapshots with ξ>0.6 (blue). Red line and black dots are as in Panel (a). Orange dots are the (K,l0) values of the experimental NMR models of Trp-cage (PDB ID: 12lY). Colors dots correspond to the values computed for the snapshots at times t0 to t5 indicated at Figure 4. Panels (ck) are three-dimensional representations of the structures s1, s2, s3 in Panel (a) and of the structures at times t0, t1, t2, t3, t4, t5, respectively. The spheres are the positions of the Cα atoms, and the tube represents the backbone. The black lines are the contacts considered to build the PG.
Figure 5
Figure 5
Relationship between K and l computed for the MD trajectory in Figure 1. Panel (a) PDF of (K,l0) (blue dots) and pairs of values (K,l0) for three selected snapshots named s1 (green dot), s2 (pink dot), and s3 (red dot) with the same value of l0 as discussed in the text. Red line is the result of application of Equation (44). Black dots are the results of model chains with regular long distance spring force constants of different lengths named (20,j=1,2,3) in the main text. Panel (b) PDF of (K,l0) from all snapshots with ξ>0.6 (blue). Red line and black dots are as in Panel (a). Orange dots are the (K,l0) values of the experimental NMR models of Trp-cage (PDB ID: 12lY). Colors dots correspond to the values computed for the snapshots at times t0 to t5 indicated at Figure 4. Panels (ck) are three-dimensional representations of the structures s1, s2, s3 in Panel (a) and of the structures at times t0, t1, t2, t3, t4, t5, respectively. The spheres are the positions of the Cα atoms, and the tube represents the backbone. The black lines are the contacts considered to build the PG.
Figure 6
Figure 6
Distribution of the local force constants at times t0 (bold green), t1 (light blue), t2 (red), t3 (brown), t4 (dark blue) and t5 (orange) indicated in Figure 4 and discussed in the text. The gray area limited by dashed lines represents the range of values observed in the MD trajectory.
Figure 7
Figure 7
Free-energy graph calculations for the trajectory of Figure 1. (a) Local (blue), nonlocal (green), global (red), and collective (black) free energy with ϵ=0. Horizontal dashed lines indicate the zero baselines of the free energies with the corresponding colors. (b) Enthalpy term of the free-energy graph with ϵ=1 (blue), ϵ=3 (green), and ϵ=5 (dark red). Horizontal dashed line indicates the zero baseline. d(t)idi(t) and d(0)idi(0). (c) Local free energy with ϵ=0 (blue) and ϵ=5 (green). Folded regions are indicated by red vertical lines as in Figure 1. Horizontal dashed lines indicate the zero baselines of the free energies with the corresponding colors.

Similar articles

References

    1. Dyson H.J., Wright P.E. Unfolded Proteins and Protein Folding Studied by NMR. Chem. Rev. 2004;104:3607–3622. doi: 10.1021/cr030403s. - DOI - PubMed
    1. Schuler B., Eaton W.A. Protein folding studied by single-molecule FRET. Curr. Opin. Struct. Biol. 2008;18:16–26. doi: 10.1016/j.sbi.2007.12.003. - DOI - PMC - PubMed
    1. Lai J.K., Kubelka G.S., Kubelka J. Sequence, structure, and cooperativity in folding of elementary protein structural motifs. Proc. Natl. Acad. Sci. USA. 2015;112:9890–9895. doi: 10.1073/pnas.1506309112. - DOI - PMC - PubMed
    1. Sukenik S., Pogorelov T.V., Gruebele M. Can Local Probes Go Global? A Joint Experiment–Simulation Analysis of λ6–85 Folding. J. Phys. Chem. Lett. 2016;7:1960–1965. doi: 10.1021/acs.jpclett.6b00582. - DOI - PubMed
    1. Muñoz V., Campos L.A., Sadqi M. Limited cooperativity in protein folding. Curr. Opin. Struct. Biol. 2016;36:58–66. doi: 10.1016/j.sbi.2015.12.001. - DOI - PubMed