Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jan;40(Database issue):D1060-6.
doi: 10.1093/nar/gkr901. Epub 2011 Nov 21.

GeneSigDB: a manually curated database and resource for analysis of gene expression signatures

Affiliations

GeneSigDB: a manually curated database and resource for analysis of gene expression signatures

Aedín C Culhane et al. Nucleic Acids Res. 2012 Jan.

Abstract

GeneSigDB (http://www.genesigdb.org or http://compbio.dfci.harvard.edu/genesigdb/) is a database of gene signatures that have been extracted and manually curated from the published literature. It provides a standardized resource of published prognostic, diagnostic and other gene signatures of cancer and related disease to the community so they can compare the predictive power of gene signatures or use these in gene set enrichment analysis. Since GeneSigDB release 1.0, we have expanded from 575 to 3515 gene signatures, which were collected and transcribed from 1604 published articles largely focused on gene expression in cancer, stem cells, immune cells, development and lung disease. We have made substantial upgrades to the GeneSigDB website to improve accessibility and usability, including adding a tag cloud browse function, facetted navigation and a 'basket' feature to store genes or gene signatures of interest. Users can analyze GeneSigDB gene signatures, or upload their own gene list, to identify gene signatures with significant gene overlap and results can be viewed on a dynamic editable heatmap that can be downloaded as a publication quality image. All data in GeneSigDB can be downloaded in numerous formats including .gmt file format for gene set enrichment analysis or as a R/Bioconductor data file. GeneSigDB is available from http://www.genesigdb.org.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Growth of GeneSigDB. GeneSigDB has grown considerably over 4 database releases (August 2009, March 2010, December 2010, September 2011). The most recent release (Release 4.0, September 2011) contains 3515 human, mouse and rat gene sets curated from 1604 published articles.
Figure 2.
Figure 2.
Overview of the GeneSigDB Data Curation pipeline. Gene signatures in tables or figures are transcribed from published articles indexed in PubMed and we then use a pipeline based on Biomart (20) to map all published gene identifiers to EnsEMBL IDs, to create standardized gene sets.
Figure 3.
Figure 3.
Screenshot showing the (A) faceting and the (B) ‘Shopping Basket’ search features when we performed a publication search for ‘serous ovarian cancer’ that returned a list of 11 publications. Further detail about (C) publications or (D) Signature can be viewed by clicking on their respective links. By default 10 results are shown but up to 100 search results can be viewed. Selecting ‘Add All’ will add the 15 gene signatures associated with the 11 displayed publications to the basket so they can be compared or downloaded.
Figure 4.
Figure 4.
Screen shots of the ‘Analyze My Gene Lists’ and ‘Comparison View’. The gene overlap between 15 serous ovarian cancer gene signatures selected in Figure 3 were analyzed. (A) shows the gene overlap between signature, where presence and absence of a gene are indicates by a red or gray pixel in the heatmap. The image can be edited, reordered or genes and gene signatures can be added or removed before the image is exported as a publication quality image. (B) shows results of a Fisher's exact test of enrichment between the 15 gene signatures. These results can also be visualized as a list.

References

    1. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA. 2005;102:15545–15550. - PMC - PubMed
    1. Mootha VK, Lindgren CM, Eriksson K-F, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstråle M, Laurila E, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat. Genet. 2003;34:267–273. - PubMed
    1. Goeman JJ, Bühlmann P. Analyzing gene expression data in terms of gene sets: methodological issues. Bioinformatics. 2007;23:980–987. - PubMed
    1. Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27:1739–1740. - PMC - PubMed
    1. Bardelli A, Velculescu VE. Mutational analysis of gene families in human cancer. Curr. Opin. Genet. Dev. 2005;15:5–12. - PubMed

Publication types