Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010:12:203-210.
doi: 10.1145/1830483.1830519.

Initialization Parameter Sweep in ATHENA: Optimizing Neural Networks for Detecting Gene-Gene Interactions in the Presence of Small Main Effects

Affiliations

Initialization Parameter Sweep in ATHENA: Optimizing Neural Networks for Detecting Gene-Gene Interactions in the Presence of Small Main Effects

Emily R Holzinger et al. Genet Evol Comput Conf. 2010.

Abstract

Recent advances in genotyping technology have led to the generation of an enormous quantity of genetic data. Traditional methods of statistical analysis have proved insufficient in extracting all of the information about the genetic components of common, complex human diseases. A contributing factor to the problem of analysis is that amongst the small main effects of each single gene on disease susceptibility, there are non-linear, gene-gene interactions that can be difficult for traditional, parametric analyses to detect. In addition, exhaustively searching all multi-locus combinations has proved computationally impractical. Novel strategies for analysis have been developed to address these issues. The Analysis Tool for Heritable and Environmental Network Associations (ATHENA) is an analytical tool that incorporates grammatical evolution neural networks (GENN) to detect interactions among genetic factors. Initial parameters define how the evolutionary process will be implemented. This research addresses how different parameter settings affect detection of disease models involving interactions. In the current study, we iterate over multiple parameter values to determine which combinations appear optimal for detecting interactions in simulated data for multiple genetic models. Our results indicate that the factors that have the greatest influence on detection are: input variable encoding, population size, and parallel computation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic of ATHENA: Analysis Tool for Heritable and Environmental Network Associations.
Figure 2
Figure 2
Central dogma (left) of molecular biology and evolution by natural selection to GENN (right).
Figure 3
Figure 3
Serial ATHENA parameter combination results. Different color bars represent different values for PopSize (as indicated in figure legend). At each position, the x-axis values represent MaxDepth, GrammarFile, and DummyEncode. Detection power values are shown on the y-axis.
Figure 4
Figure 4
Parallel ATHENA parameter combination results. Different color bars represent different values for PopSize (as indicated in figure legend). At each position, the x-axis values represent MaxDepth, GrammarFile, and DummyEncode. Detection power values are shown on the y-axis.
Figure 5
Figure 5
A comparison of dummy-encoding with its linear and quadratic components. Here we are showing the detection power (y-axis) for the additive 100 SNP simulation models. Different color bars represent different population sizes. (MaxDepth = 7 and GrammarFile = add)

Similar articles

Cited by

References

    1. Bishop CM. Neural Networks for Pattern Recognition. Oxford University Press; London: 1995.
    1. Bush WS, Dudek SM, Ritchie MD. Biofilter: a knowledge-integration system for the multi-locus analysis of genome-wide association studies. Pac. Symp. Biocomput. 2009:368–379. - PMC - PubMed
    1. Cordell HJ. Genome-wide association studies: Detecting gene-gene interactions that underlie human diseases. Nat Rev Genet. 2009;10:392–404. - PMC - PubMed
    1. Dudek SM, Motsinger AA, Velez DR, Williams SM, Ritchie MD. Data Simulation Software for Whole-Genome Association and Other Studies in Human Genetics. Pac Symp BioComput. 2006:499–510. - PubMed
    1. Edwards TL, Bush WS, Turner SD, Dudek SM, Torstenson ES, Schmidt M, Martin E, Ritchie MD. Generating Linkage Disequilibrium Patterns in Data Simulations Using genomeSIMLA. Lecture Notes in Computer Science. 2008;4793:24–35. - PubMed

LinkOut - more resources