Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009:368-79.

Biofilter: a knowledge-integration system for the multi-locus analysis of genome-wide association studies

Affiliations

Biofilter: a knowledge-integration system for the multi-locus analysis of genome-wide association studies

William S Bush et al. Pac Symp Biocomput. 2009.

Abstract

Genome-wide association studies provide an unprecedented opportunity to identify combinations of genetic variants that contribute to disease susceptibility. The combinatorial problem of jointly analyzing the millions of genetic variations accessible by high-throughput genotyping technologies is a difficult challenge. One approach to reducing the search space of this variable selection problem is to assess specific combinations of genetic variations based on prior statistical and biological knowledge. In this work, we provide a systematic approach to integrate multiple public databases of gene groupings and sets of disease-related genes to produce multi-SNP models that have an established biological foundation. This approach yields a collection of models which can be tested statistically in genome-wide data, along with an ordinal quantity describing the number of data sources that support any given model. Using this knowledge-driven approach reduces the computational and statistical burden of large-scale interaction analysis while simultaneously providing a biological foundation for the relevance of any significant statistical result that is found.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Overview of the Biofilter process. GWAS platform SNPs are mapped to Ensembl gene IDs and related to disease-independent sources (left) and to disease-dependent sources (right). Multi-marker models are generated from SNPs within knowledge-related genes. Derived models are overlaid to assess overall model implication.
Figure 2
Figure 2
Two-gene model types. Each box represents a gene, and each line a connection between genes. Boxes that are dashed have been previously linked to disease by at least one data source.

References

    1. Bateson W. Mendel's Principles of Heredity. Cambridge: Cambridge University Press; 1909.
    1. Moore JH, Williams SM. Traversing the conceptual divide between biological and statistical epistasis: systems biology and a more modern synthesis. Bioessays. 2005;27:637–646. - PubMed
    1. Moore JH. The ubiquitous nature of epistasis in determining susceptibility to common human diseases. Hum Hered. 2003;56:73–82. - PubMed
    1. Fisher RA. The Correlation Between Relatives on the Supposition of Mendelian Inheritance. Transactions of the Royal Society of Edinburgh. 1918;52:399–433.
    1. Bridges CB. Specific modifiers of eosin eye color in Drosophila melanogaster. J Experimental Zoology. 1919;28:337–384.

Publication types