Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Mar;16(3):407-26.
doi: 10.1089/cmb.2008.0081.

Analysis of gene sets based on the underlying regulatory network

Affiliations

Analysis of gene sets based on the underlying regulatory network

Ali Shojaie et al. J Comput Biol. 2009 Mar.

Abstract

Networks are often used to represent the interactions among genes and proteins. These interactions are known to play an important role in vital cell functions and should be included in the analysis of genes that are differentially expressed. Methods of gene set analysis take advantage of external biological information and analyze a priori defined sets of genes. These methods can potentially preserve the correlation among genes; however, they do not directly incorporate the information about the gene network. In this paper, we propose a latent variable model that directly incorporates the network information. We then use the theory of mixed linear models to present a general inference framework for the problem of testing the significance of subnetworks. Several possible test procedures are introduced and a network based method for testing the changes in expression levels of genes as well as the structure of the network is presented. The performance of the proposed method is compared with methods of gene set analysis using both simulation studies, as well as real data on genes related to the galactose utilization pathway in yeast.

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
Yeast galactose utilization pathway published in Ideker et al. (2001). (Printed with permission from Science and the American Association for the Advancement of Science.)
FIG. 2.
FIG. 2.
A simple gene network.
FIG. 3.
FIG. 3.
Illustration of the Network contrast vector on a simple network; dashed line indicates the interactions that are included in the contrast vector.
FIG. 4.
FIG. 4.
Design of the second simulation study. Solid arrows and boxes represent high positive association (0.6 here), dashed arrows and boxes represent high negative association (−0.6), and dotted arrows and boxes indicate low positive association (0.1). The root genes 1 and 2 are upregulated, while the expression level for gene 3 does not change.
FIG. 5.
FIG. 5.
Estimated powers of test of significance of subnetworks of Table 6 with random noise in weights of the adjacency matrix. Plots in gray represent the powers of subnetworks whose true adjacency matrices in control and treatment are the same.
FIG. 6.
FIG. 6.
Estimated powers of test of significance of subnetworks of Table 6 with systematic bias in weights of the adjacency matrix. Plots in gray represent the powers of subnetworks whose true adjacency matrices in control and treatment are the same.
FIG. 7.
FIG. 7.
Yeast gene network indicating the significant pathways; significant pathways have been marked with ovals.

References

    1. Alexa A. Rahnenfuhrer J. Lengauer T. Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics. 2006;22:1600–1607. - PubMed
    1. Ashburner M. Ball C. Blake J., et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000;25:25–29. - PMC - PubMed
    1. Bader J.S. Chaudhuri A. Rothberg J.M., et al. Gaining confidence in high-throughput protein interaction networks. Nat. Biotechnol. 2004;22:78–85. - PubMed
    1. Barry W.T. Nobel A.B. Wright F.A. Significance analysis of functional categories in gene expression studies: a structured permutation approach. Bioinformatics. 2005;21:1943–1949. - PubMed
    1. Benjamini Y. Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Statist. Soc. Ser. B. 1995;57:289–300.

Publication types