Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 May 15;23(10):2780-90.
doi: 10.1093/hmg/ddt668. Epub 2013 Dec 30.

Guilt by rewiring: gene prioritization through network rewiring in genome wide association studies

Affiliations

Guilt by rewiring: gene prioritization through network rewiring in genome wide association studies

Lin Hou et al. Hum Mol Genet. .

Abstract

Although Genome Wide Association Studies (GWAS) have identified many susceptibility loci for common diseases, they only explain a small portion of heritability. It is challenging to identify the remaining disease loci because their association signals are likely weak and difficult to identify among millions of candidates. One potentially useful direction to increase statistical power is to incorporate functional genomics information, especially gene expression networks, to prioritize GWAS signals. Most current methods utilizing network information to prioritize disease genes are based on the 'guilt by association' principle, in which networks are treated as static, and disease-associated genes are assumed to locate closer with each other than random pairs in the network. In contrast, we propose a novel 'guilt by rewiring' principle. Studying the dynamics of gene networks between controls and patients, this principle assumes that disease genes more likely undergo rewiring in patients, whereas most of the network remains unaffected in disease condition. To demonstrate this principle, we consider the changes of co-expression networks in Crohn's disease patients and controls, and how network dynamics reveals information on disease associations. Our results demonstrate that network rewiring is abundant in the immune system, and disease-associated genes are more likely to be rewired in patients. To integrate this network rewiring feature and GWAS signals, we propose to use the Markov random field framework to integrate network information to prioritize genes. Applications in Crohn's disease and Parkinson's disease show that this framework leads to more replicable results, and implicates potentially disease-associated pathways.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Network rewiring is abundant in immune system (defined in the Reactome pathway). The y-axis is the fold change of density in the subnetwork of the immune system to the total network. Network density is the number of edges in the observed network over the number of all possible edges.
Figure 2.
Figure 2.
Comparisons of edge frequency in static network and rewiring network. (+1, +1) denotes an edge that connects two associated genes, while (+1, −1) denotes an edge that connects one associated gene and one non-associated gene. The y-axis is the fold change between observed number of edges and that of random expectation. Both the static and rewiring network were dichotomized at a threshold so that the corresponding network density is 0.01.
Figure 3.
Figure 3.
Replication rates between independent cohorts in Crohn's disease study. A gene is called replicable if its association P-value in the replication cohort is <0.05. (A). Black circle: prioritized genes. Grey circle: non-prioritized genes. Black diamond: baseline replication rates for the top k genes. Grey cross: prioritized genes with microarray data set GSE8397 (a microarray data set of Parkinson's disease). The numbers on the top of x-axis are the association P-value of the kth genes in the discovery cohort. (B). Empirical P-values of the replication rate of the prioritized gene set, derived from permutations. The dotted line indicates significance level at 0.05. The y-axis is drawn at the log10 scale but labelled with original P-values for ease of reading.
Figure 4.
Figure 4.
Replication rates between independent cohorts in Parkinson's disease study. A gene is called replicable if its association P-value in the replication cohort is <0.05. (A) Black circles: prioritized genes. Grey circles: non-prioritized genes. Black diamonds: baseline replication rates for the top k genes. Grey crosses: prioritized genes with microarray data set GSE20881 (a microarray data set of Crohn's disease). The numbers on the top of x-axis are the association P-value of the kth genes in the discovery cohort. (B) Empirical P-values of the replication rate of the prioritized gene set derived from permutations. The dotted line indicates significance level at 0.05. The y-axis is drawn at the log10 scale but labelled with original P-values for ease of reading.

References

    1. Manolio T.A., Brooks L.D., Collins F.S. A HapMap harvest of insights into the genetics of common disease. J. Clin. Invest. 2008;118:1590–1605. doi:10.1172/JCI34772. - DOI - PMC - PubMed
    1. Franke A., McGovern D.P.B., Barrett J.C., Wang K., Radford-Smith G.L., Ahmad T., Lees C.W., Balschun T., Lee J., Roberts R., et al. Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nat. Genet. 2010;42:1118–1125. doi:10.1038/ng.717. - DOI - PMC - PubMed
    1. Jostins L., Ripke S., Weersma R.K., Duerr R.H., McGovern D.P., Hui K.Y., Lee J.C., Philip Schumm L., Sharma Y., Anderson C.A., et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature. 2012;491:119–124. doi:10.1038/nature11582. - DOI - PMC - PubMed
    1. Goldstein D.B. Common genetic variation and human traits. N. Engl. J. Med. 2009;360:1696–1698. doi:10.1056/NEJMp0806284. - DOI - PubMed
    1. Hardy J., Singleton A. Genomewide Association Studies and human disease. N. Engl. J. Med. 2009;360:1759–1768. doi:10.1056/NEJMra0808700. - DOI - PMC - PubMed

Publication types