Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jun;150(2):535-46.
doi: 10.1104/pp.109.136028. Epub 2009 Apr 8.

Unraveling transcriptional control in Arabidopsis using cis-regulatory elements and coexpression networks

Affiliations

Unraveling transcriptional control in Arabidopsis using cis-regulatory elements and coexpression networks

Klaas Vandepoele et al. Plant Physiol. 2009 Jun.

Abstract

Analysis of gene expression data generated by high-throughput microarray transcript profiling experiments has demonstrated that genes with an overall similar expression pattern are often enriched for similar functions. This guilt-by-association principle can be applied to define modular gene programs, identify cis-regulatory elements, or predict gene functions for unknown genes based on their coexpression neighborhood. We evaluated the potential to use Gene Ontology (GO) enrichment of a gene's coexpression neighborhood as a tool to predict its function but found overall low sensitivity scores (13%-34%). This indicates that for many functional categories, coexpression alone performs poorly to infer known biological gene functions. However, integration of cis-regulatory elements shows that 46% of the gene coexpression neighborhoods are enriched for one or more motifs, providing a valuable complementary source to functionally annotate genes. Through the integration of coexpression data, GO annotations, and a set of known cis-regulatory elements combined with a novel set of evolutionarily conserved plant motifs, we could link many genes and motifs to specific biological functions. Application of our coexpression framework extended with cis-regulatory element analysis on transcriptome data from the cell cycle-related transcription factor OBP1 yielded several coexpressed modules associated with specific cis-regulatory elements. Moreover, our analysis strongly suggests a feed-forward regulatory interaction between OBP1 and the E2F pathway. The ATCOECIS resource (http://bioinformatics.psb.ugent.be/ATCOECIS/) makes it possible to query coexpression data and GO and cis-regulatory element annotations and to submit user-defined gene sets for motif analysis, providing an access point to unravel the regulatory code underlying transcriptional control in Arabidopsis (Arabidopsis thaliana).

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
EC scores for genes functionally annotated using GO (A) and AraCyc (B). BP, MF, and CC refer to Biological Process, Molecular Function, and Cellular Component categories, and the number of categories is indicated in parentheses. [See online article for color version of this figure.]
Figure 2.
Figure 2.
Functional enrichment of GO and cis-regulatory element annotation for guide gene cluster AT5G59220. Lines indicate coexpression relationships, and colored circles show the functional annotation for the individual genes. Enrichment analysis is performed using the hypergeometric distribution and Bonferroni correction for multiple hypotheses testing. Results are shown for the ATH95 benchmark coexpression network. [See online article for color version of this figure.]
Figure 3.
Figure 3.
Functional predictive power for three benchmark coexpression networks built using different expression similarity thresholds (ATH90, ATH95, and ATH99). A to C show cumulative SN scores for a subset of GO categories, and D shows overall cumulative SN scores.
Figure 4.
Figure 4.
Examples of cis-regulatory motifs showing significant associations with one or more GO categories. GO motif networks reveal for different GO categories the fraction of genes having the motif in their promoter (P value < 0.05 using the hypergeometric distribution). The line thickness reflects the motif coverage per GO category and varies from 6% to 70%. Known motifs from AGRIS or PLACE are indicated in italics.

References

    1. Aoki K, Ogata Y, Shibata D (2007) Approaches for extracting practical information from gene co-expression networks in plant biology. Plant Cell Physiol 48 381–390 - PubMed
    1. Ben-Dor A, Shamir R, Yakhini Z (1999) Clustering gene expression patterns. J Comput Biol 6 281–297 - PubMed
    1. Birnbaum K, Shasha DE, Wang JY, Jung JW, Lambert GM, Galbraith DW, Benfey PN (2003) A gene expression map of the Arabidopsis root. Science 302 1956–1960 - PubMed
    1. Casneuf T, Van de Peer Y, Huber W (2007) In situ analysis of cross-hybridisation on microarrays and the inference of expression correlation. BMC Bioinformatics 8 461. - PMC - PubMed
    1. Cline MS, Smoot M, Cerami E, Kuchinsky A, Landys N, Workman C, Christmas R, Avila-Campilo I, Creech M, Gross B, et al (2007) Integration of biological networks and gene expression data using Cytoscape. Nat Protocols 2 2366–2382 - PMC - PubMed

Publication types

MeSH terms

Substances