Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Jun 18;3(6):e2439.
doi: 10.1371/journal.pone.0002439.

Functional annotation and identification of candidate disease genes by computational analysis of normal tissue gene expression data

Affiliations

Functional annotation and identification of candidate disease genes by computational analysis of normal tissue gene expression data

Laura Miozzi et al. PLoS One. .

Abstract

Background: High-throughput gene expression data can predict gene function through the "guilt by association" principle: coexpressed genes are likely to be functionally associated.

Methodology/principal findings: We analyzed publicly available expression data on normal human tissues. The analysis is based on the integration of data obtained with two experimental platforms (microarrays and SAGE) and of various measures of dissimilarity between expression profiles. The building blocks of the procedure are the Ranked Coexpression Groups (RCG), small sets of tightly coexpressed genes which are analyzed in terms of functional annotation. Functionally characterized RCGs are selected by means of the majority rule and used to predict new functional annotations. Functionally characterized RCGs are enriched in groups of genes associated to similar phenotypes. We exploit this fact to find new candidate disease genes for many OMIM phenotypes of unknown molecular origin.

Conclusions/significance: We predict new functional annotations for many human genes, showing that the integration of different data sets and coexpression measures significantly improves the scope of the results. Combining gene expression data, functional annotation and known phenotype-gene associations we provide candidate genes for several genetic diseases of unknown molecular basis.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Enrichment of RCGs in genes involved in similar phenotypes.
For each dataset/coexpresison measure combination we show the fraction of RCGs associated to an OMIM phenotype as described in the text. The fraction is shown for functionally characterized RCGs (grey), all RCGs (purple), and randomized RCGs (orange). The coexpression measures are: E: Euclidean; P: Pearson; D: Poisson “distinct”; O: Poisson “overrepresentation”.

References

    1. Brown PO, Botstein D. Exploring the new world of the genome with DNA microarrays. Nat Genet. 1999;21:33–37. - PubMed
    1. Quackenbush J. Computational analysis of microarraydata. Nat Rev Genet. 2001;2:418–427. - PubMed
    1. Eisen MB, Spellman PT, Brown PO, Botstein D. Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A. 1998;95:14863–14868. - PMC - PubMed
    1. Zhang W, Morris QD, Chang R, Shai O, Bakowski MA, et al. The functional landscape of mouse gene expression. J Biol. 2004;3:21. - PMC - PubMed
    1. Wolfe CJ, Kohane IS, Butte AJ. Systematic survey reveals general applicability of guilt-by-association within gene coexpression networks. BMC Bioinformatics. 2005;6:227. - PMC - PubMed

Publication types