Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jul;35(Web Server issue):W245-52.
doi: 10.1093/nar/gkm427. Epub 2007 Jun 18.

oPOSSUM: integrated tools for analysis of regulatory motif over-representation

Affiliations

oPOSSUM: integrated tools for analysis of regulatory motif over-representation

Shannan J Ho Sui et al. Nucleic Acids Res. 2007 Jul.

Abstract

The identification of over-represented transcription factor binding sites from sets of co-expressed genes provides insights into the mechanisms of regulation for diverse biological contexts. oPOSSUM, an internet-based system for such studies of regulation, has been improved and expanded in this new release. New features include a worm-specific version for investigating binding sites conserved between Caenorhabditis elegans and C. briggsae, as well as a yeast-specific version for the analysis of co-expressed sets of Saccharomyces cerevisiae genes. The human and mouse applications feature improvements in ortholog mapping, sequence alignments and the delineation of multiple alternative promoters. oPOSSUM2, introduced for the analysis of over-represented combinations of motifs in human and mouse genes, has been integrated with the original oPOSSUM system. Analysis using user-defined background gene sets is now supported. The transcription factor binding site models have been updated to include new profiles from the JASPAR database. oPOSSUM is available at http://www.cisreg.ca/oPOSSUM/

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Determination of one-to-one orthologs for human and mouse genes. An initial set of homologs was downloaded from EnsEMBL v41 (30). All homologs annotated as ‘one2one’ are extracted. To select the closest putative ortholog pairs from homologs with ‘one2many’ or ‘many2many’ relationships, we check for upstream conservation using the whole-genome human–mouse alignments (6). We re-annotate unambiguously aligned homologs as putative one-to-one orthologs, adding 195 gene pairs to our set and bringing the total number of orthologs to 15 162.
Figure 2.
Figure 2.
Identification of transcription start regions (TSRs) using a combination of EnsEMBL annotations and CAGE data. To improve our alignments, we determine putative alternative TSSs for the human and mouse genes. For each gene, the entire repertoire of transcripts from both EnsEMBL core genes and EST genes are retrieved. The TSSs for all transcripts are recorded, followed by a clustering step such that TSSs within 500 bp of one another are merged to form a transcriptional start region (TSR). For each TSR containing a transcript annotated as ‘known’ or ‘novel’, we accept the TSR as is. For TSRs based solely on EST gene transcripts, we require a minimum of 5 CAGE tags as evidence for transcription initiation.
Figure 3.
Figure 3.
(A) A screenshot of the output of the oPOSSUM Human SSA analysis, with TFBS profiles ranked by Z-score. The arrows allow the user to sort and re-order the results by Fisher score, TF name, TF class, TF supergroup or TF profile information content (IC). Each TF name links to a pop-up window displaying the TFBS profile information. (B) Pop-up window displaying genes that contain a particular TFBS (in this case, MEF2A; partial list shown), as well as the promoter coordinates associated with each gene, and the motif locations and scores. Sites in overlapping alternative promoters are highlighted for emphasis. Such sites are only counted once in the statistical analysis.

References

    1. Ho Sui SJ, Mortimer JR, Arenillas DJ, Brumm J, Walsh CJ, Kennedy BP, Wasserman WW. oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes. Nucleic Acids Res. 2005;33:3154–3164. - PMC - PubMed
    1. Huang SS, Fulton DL, Arenillas DJ, Perco P, Ho Sui SJ, Mortimer JR, Wasserman WW. Advances in Bioinformatics and Computational Biology. Vol. 3. London, UK: Imperial College Press; 2006. Identification of over-represented combinations of transcription factor binding sites in sets of co-expressed genes; pp. 247–256.
    1. Urnov FD, Rebar EJ. Designed transcription factors as tools for therapeutics and functional genomics. Biochem. Pharmacol. 2002;64:919–923. - PubMed
    1. Luscombe NM, Austin SE, Berman HM, Thornton JM. An overview of the structures of protein-DNA complexes. Genome Biol. 2000;1:REVIEWS001. - PMC - PubMed
    1. Sandelin A, Wasserman WW. Constrained binding site diversity within families of transcription factors enhances pattern discovery bioinformatics. J. Mol. Biol. 2004;338:207–215. - PubMed

Publication types