Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Aug 18;17 Suppl 4(Suppl 4):546.
doi: 10.1186/s12864-016-2798-8.

Computing energy landscape maps and structural excursions of proteins

Affiliations

Computing energy landscape maps and structural excursions of proteins

Emmanuel Sapin et al. BMC Genomics. .

Abstract

Background: Structural excursions of a protein at equilibrium are key to biomolecular recognition and function modulation. Protein modeling research is driven by the need to aid wet laboratories in characterizing equilibrium protein dynamics. In principle, structural excursions of a protein can be directly observed via simulation of its dynamics, but the disparate temporal scales involved in such excursions make this approach computationally impractical. On the other hand, an informative representation of the structure space available to a protein at equilibrium can be obtained efficiently via stochastic optimization, but this approach does not directly yield information on equilibrium dynamics.

Methods: We present here a novel methodology that first builds a multi-dimensional map of the energy landscape that underlies the structure space of a given protein and then queries the computed map for energetically-feasible excursions between structures of interest. An evolutionary algorithm builds such maps with a practical computational budget. Graphical techniques analyze a computed multi-dimensional map and expose interesting features of an energy landscape, such as basins and barriers. A path searching algorithm then queries a nearest-neighbor graph representation of a computed map for energetically-feasible basin-to-basin excursions.

Results: Evaluation is conducted on intrinsically-dynamic proteins of importance in human biology and disease. Visual statistical analysis of the maps of energy landscapes computed by the proposed methodology reveals features already captured in the wet laboratory, as well as new features indicative of interesting, unknown thermodynamically-stable and semi-stable regions of the equilibrium structure space. Comparison of maps and structural excursions computed by the proposed methodology on sequence variants of a protein sheds light on the role of equilibrium structure and dynamics in the sequence-function relationship.

Conclusions: Applications show that the proposed methodology is effective at locating basins in complex energy landscapes and computing basin-basin excursions of a protein with a practical computational budget. While the actual temporal scales spanned by a structural excursion cannot be directly obtained due to the foregoing of simulation of dynamics, hypotheses can be formulated regarding the impact of sequence mutations on protein function. These hypotheses are valuable in instigating further research in wet laboratories.

Keywords: Energy landscape map; Evolutionary algorithm; Low-cost paths; Mechanical work; Multi-basin energy landscape; Multi-state protein; Nearest-neighbor graph; Protein equilibrium dynamics; Sample-based representation; Structural excursion.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Run time profiling. The break-down of the run time along each of the three main components of the methodology is shown for SOD1, CaM, and H-Ras WT. The path query time refers to the proportion of running time spent on computing the lowest-cost path
Fig. 2
Fig. 2
Crossover evaluation. The top panel shows the average fitness/energy among individuals in the hall of fame list (map), as the map is updated over generations. The bottom panel shows the average diversity among individuals in the hall of fame (measured via Euclidean distance) as the map is updated over generations
Fig. 3
Fig. 3
Visualization in 2D of map and a lowest-cost path computed for SOD1. The computed map for SOD1 is projected onto 2D and projections are color-coded by Rosetta score12 energy values. Black dots show projections of experimentally-known structures. A lowest-cost path connecting the two visible basins is additionally drawn. Its cost (in REUs) is also listed
Fig. 4
Fig. 4
Visualization in 2D of map and a lowest-cost path computed for caM. The computed map for CaM is projected onto 2D and projections are color-coded by Rosetta score12 energy values. Black dots show projections of experimentally-known structures. A lowest-cost path connecting two experimentally-known structures is also drawn; its cost (in REUs) is listed. The PDB ids of the structures the path connects, as well as other experimentally-known structures of interest, are additionally shown where these structures project onto the top two PCs
Fig. 5
Fig. 5
Visualization via conditioning plots of map computed for H-Ras WT. The map computed for the H-Ras WT is visualized via conditioning plots, which plot projections of the hall of fame on PC1 and PC2 conditional on projections on PC3 and PC4. Sixteeen plots are generated, considering combinations of all quartiles of PC3 with all quartiles of PC4. The 4D space in each subplot is discretized via hexagons, plotting for each hexagon only the projection of the lowest-energy individual in the hexagon. The blue-to-red color-coding scheme follows the low to blue energy range. Dots in yellow show PC1-PC2 projections of experimentally-known structures of H-Ras WT
Fig. 6
Fig. 6
Visualization in 2D of map and paths computed for H-Ras WT. The computed map for H-Ras WT is projected onto 2D and projections are color-coded by Rosetta score12 energy values. Low-cost paths (costs in REUs are listed) modeling the On →Off structural excursion are also drawn. The projections of experimentally-known structures on the top two PCs are related by showing whether the structures are captured in the wet laboratory for the WT or variants
Fig. 7
Fig. 7
Visualization in 2D of map and paths computed for H-Ras G12C. The computed map for the H-Ras G12C variant is projected onto 2D and projections are color-coded by Rosetta score12 energy values. Low-cost paths (costs in REUs are listed) modeling the On →Off structural excursion are also drawn. The projections of experimentally-known structures on the top two PCs are related by showing whether the structures are captured in the wet laboratory for the WT or variants
Fig. 8
Fig. 8
Visualization in 2D of map and paths computed for H-Ras Q61L. The computed map for the H-Ras Q61L variant is projected onto 2D and projections are color-coded by Rosetta score12 energy values. Low-cost paths (costs in REUs are listed) modeling the On →Off structural excursion are also drawn. The projections of experimentally-known structures on the top two PCs are related by showing whether the structures are captured in the wet laboratory for the WT or variants
Fig. 9
Fig. 9
Visualization in 2D of map and paths computed for H-Ras Y32CC118S. The computed map for the H-Ras Y32CC118S variant is projected onto 2D and projections are color-coded by Rosetta score12 energy values. Low-cost paths (costs in REUs are listed) modeling the On →Off structural excursion are also drawn. The projections of experimentally-known structures on the top two PCs are related by showing whether the structures are captured in the wet laboratory for the WT or variants
Fig. 10
Fig. 10
Visualization in 3D of map computed for H-Ras WT. The lowest-energy structures in the map computed for H-Ras WT are shown projected onto the top 3 PCs. Projections of the experimentally-known structures are also drawn, as red spheres of a larger radius. The PDB ids of some of these structures are also shown. The four basins that emerge on the WT and the various variants are also delineated and named per the convention described in the main text

Similar articles

Cited by

References

    1. Frauenfelder H, Sligar SG, Wolynes PG. The energy landscapes and motion on proteins. Science. 1991;254(5038):1598–603. doi: 10.1126/science.1749933. - DOI - PubMed
    1. Jenzler-Wildman K, Kern D. Dynamic personalities of proteins. Nature. 2007;450:964–72. doi: 10.1038/nature06522. - DOI - PubMed
    1. Hub JS, de Groot BL. Detection of functional modes in protein dynamics. PLoS Comp Biol. 2009;5(8):1000480. doi: 10.1371/journal.pcbi.1000480. - DOI - PMC - PubMed
    1. Schroedinger E. What Is Life? New York: Cambridge University Press; 1944.
    1. Kendrew JC, Dickerson RE, Strandberg BE, Hart RG, Davies DR, Phillips DC, Shore VC. Structure of myoglobin: A three-dimensional fourier synthesis at 2å resolution. Nature. 1960;185(4711):422–7. doi: 10.1038/185422a0. - DOI - PubMed

Publication types

LinkOut - more resources