Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2013 Sep-Oct;4(5):266-78.
doi: 10.4161/bioe.23041. Epub 2012 Dec 6.

Naturally selecting solutions: the use of genetic algorithms in bioinformatics

Affiliations
Review

Naturally selecting solutions: the use of genetic algorithms in bioinformatics

Timmy Manning et al. Bioengineered. 2013 Sep-Oct.

Abstract

For decades, computer scientists have looked to nature for biologically inspired solutions to computational problems; ranging from robotic control to scheduling optimization. Paradoxically, as we move deeper into the post-genomics era, the reverse is occurring, as biologists and bioinformaticians look to computational techniques, to solve a variety of biological problems. One of the most common biologically inspired techniques are genetic algorithms (GAs), which take the Darwinian concept of natural selection as the driving force behind systems for solving real world problems, including those in the bioinformatics domain. Herein, we provide an overview of genetic algorithms and survey some of the most recent applications of this approach to bioinformatics based problems.

Keywords: genetic algorithm; multiple sequence alignment; optimization; protein structure prediction.

PubMed Disclaimer

Figures

None
Figure 1. An analysis of 150 bioinformatics papers referencing GAs. (A) A plot of the distribution of the sampled papers (y-axis) plotted against the year of publication (x-axis), and (B) A breakdown of the subject matter addressed by the papers.
None
Figure 2. Flowchart for the execution of a genetic algorithm
None
Figure 3. (A) A possible genotype representation for the TSP, describing a permutation of a list of cities. (B) A phenotype (the itinerary) generated from the genotype.
None
Figure 4. One-point crossover on two binary parent genotypes, creating two new offspring
None
Figure 5. Two-point crossover. The parent genotypes are each divided into three segments, and the offspring are produced by combining alternate segments of the parents.
None
Figure 6. A possible crossover implementation for the TSP problem. (A) Two parents are chosen using a selection algorithm. A crossover point is randomly selected in the first parent. (B and C) The genes from one side of the crossover point are selected and are combined with the alternate genes in the order which they appear in the second parent to form a new child genotype.
None
Figure 7. A possible mutation operator for the TSP. An offset in the genotype is selected. The genes either side of the offset are then switched to produce the new valid genotype.
None
Figure 8. (A) An example distribution of 20 cities and (B) a path linking the cities found using a genetic algorithm.
None
Figure 9. A typical learning curve for a genetic algorithm
None
Figure 10. An example 4-sequence alignment. Individual sequences are represented on separate lines. Columns represent aligned amino acids. Adapted from reference .
None
Figure 11. An example distribution of a set of solutions in a Pareto Optimality. Each box represents the fitness of a solution in terms of ID score plotted against its fitness in terms of the SOP score. The gray line represents the Pareto front. In this example the Pareto front shows optimal trade-offs between the ID and SOP scores.
None
Figure 12. Example dissection of two parents for a vertical one-point crossover operation
None
Figure 13. Example dissection of two parents for a horizontal one-point crossover operation
None
Figure 14. Graphical depiction of the flow of solution exchanges in 4-island model comprising three slave populations and a centrally located master population
None
Figure 15. (A) A sequence alignment for four sequence, and two equivalent encodings for the alignment. (B) The CGA-MSA genotype representing an alignment solution.
None
Figure 16. The CGA-MSA mutation operator. The binary digits up to the mutation point are grouped by sequence and considered for mutation individually.
None
Figure 17. A protein with the format HPHPPHHPHPPHPHHPPHPH fitted to (A) a 2D triangular lattice, and (B) a 3D cubic lattice. The hydrophobic amino acids are represented as black dots and the polar amino acids as white circles. Both conformations are optimal under the HP model. Adapted from references and , respectively.

Similar articles

Cited by

References

    1. Holland JH. Adaptation in natural and artiðcial systems. Ann Arbor: University of Michigan Press 1975; 2.
    1. Jiang W, Baker ML, Ludtke SJ, Chiu W. Bridging the information gap: computational tools for intermediate resolution structure interpretation. J Mol Biol. 2001;308:1033–44. doi: 10.1006/jmbi.2001.4633. - DOI - PubMed
    1. Kumar SN, Panneerselvam R. A Survey on the Vehicle Routing Problem and Its Variants. Intelligent Information Management. 2012;4:66–74. doi: 10.4236/iim.2012.43010. - DOI
    1. Gemperline P, Niazi A, Leardi R. Genetic algorithms in chemometrics. J Chemometr. 2012;26:345–51. doi: 10.1002/cem.2426. - DOI
    1. Sharma RN, Pancholi SS. Optimization techniques in pharmaceutical industry: A Review. Journal of Current Pharmaceutical Research. 2011;7:21–8.

Publication types

LinkOut - more resources