Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Nov 7;11(Suppl 1):S19.
doi: 10.1186/1477-5956-11-S1-S19. Epub 2013 Nov 7.

An effective evolutionary algorithm for protein folding on 3D FCC HP model by lattice rotation and generalized move sets

Affiliations

An effective evolutionary algorithm for protein folding on 3D FCC HP model by lattice rotation and generalized move sets

Jyh-Jong Tsay et al. Proteome Sci. .

Abstract

Background: Proteins are essential biological molecules which play vital roles in nearly all biological processes. It is the tertiary structure of a protein that determines its functions. Therefore the prediction of a protein's tertiary structure based on its primary amino acid sequence has long been the most important and challenging subject in biochemistry, molecular biology and biophysics. In the past, the HP lattice model was one of the ab initio methods that many researchers used to forecast the protein structure. Although these kinds of simplified methods could not achieve high resolution, they provided a macrocosm-optimized protein structure. The model has been employed to investigate general principles of protein folding, and plays an important role in the prediction of protein structures.

Methods: In this paper, we present an improved evolutionary algorithm for the protein folding problem. We study the problem on the 3D FCC lattice HP model which has been widely used in previous research. Our focus is to develop evolutionary algorithms (EA) which are robust, easy to implement and can handle various energy functions. We propose to combine three different local search methods, including lattice rotation for crossover, K-site move for mutation, and generalized pull move; these form our key components to improve previous EA-based approaches.

Results: We have carried out experiments over several data sets which were used in previous research. The results of the experiments show that our approach is able to find optimal conformations which were not found by previous EA-based approaches.

Conclusions: We have investigated the geometric properties of the 3D FCC lattice and developed several local search techniques to improve traditional EA-based approaches to the protein folding problem. It is known that EA-based approaches are robust and can handle arbitrary energy functions. Our results further show that by extensive development of local searches, EA can also be very effective for finding optimal conformations on the 3D FCC HP model. Furthermore, the local searches developed in this paper can be integrated with other approaches such as the Monte Carlo and Tabu searches to improve their performance.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A ground-state conformation in the 3D FCC HP model. An example on HP lattice model: (a) The native state of the protein with PDB id 1CNL, (b) an optimal HP conformation for 1CNL on 3D FCC lattice with 7 HH contacts denoted by dashed blue lines.
Figure 2
Figure 2
The FCC lattice model: each lattice point has 12 neighbours.
Figure 3
Figure 3
Main steps of the proposed EA-based approach.
Figure 4
Figure 4
Square-Based Rotation: 3 ways to partition the 12 neighbours of the central point into 3 squares.
Figure 5
Figure 5
Triangle-Hexagon-Based Rotations. Triangle-Hexagon-Based Rotations: 4 ways to partition the 12 neighbours of the central point into two triangles and one hexagon.
Figure 6
Figure 6
Rotation-based crossover operate. Illustration of rotation-based crossover: (a) and (b) are the parent conformations; (c) is the offspring without rotation; (d) shows the 3 squares in one partitioning of neighbours; (e) shows the corresponding label permutation for rotation angle 90°, 180° and 270°; (f) shows the 4 offspring with the part in red rotated 0°, 90°, 180° and 270°; (g), (i) and (j) illustrate triangle-hexagon-based rotation.
Figure 7
Figure 7
Generalized Pull Move. Generalized Pull Move on 3D FCC lattice: (a) shows the result obtained by the traditional Pull Move; (b) to (e) shows the 4 possible results obtained by Generalized Pull Move.
Figure 8
Figure 8
Configurations for F180_1, F180_2 and F180_3 obtained by our approach.
Figure 9
Figure 9
Configurations for sequences in Data Set IV obtained by our approach.

References

    1. Hagerman PJ, Jr IT. From sequence to structure to function. Current Opinion in Structural Biology. 1996;11(3):277–280. doi: 10.1016/S0959-440X(96)80044-3. - DOI - PubMed
    1. Mirsky AE, Pauling L. On the structure of native, denatured, and coagulated proteins. Proceedings of the National Academy of Sciences of the United States of America. 1936;11(7):439–447. doi: 10.1073/pnas.22.7.439. - DOI - PMC - PubMed
    1. Orengo CA, Todd AE, Thornton JM. From protein structure to function. Current Opinion in Structural Biology. 1999;11(3):374–382. doi: 10.1016/S0959-440X(99)80051-7. - DOI - PubMed
    1. Lau K, Dill K. A lattice statistical mechanics model of the conformation and sequence space of proteins. Macromolecules. 1989;11:3986–3997. doi: 10.1021/ma00200a030. - DOI
    1. Istrail S, Lam F. Combinatorial algorithms for protein folding in lattice models: a survey of mathematical results. Commun Inf Syst. 2009;11(4):303–346.