Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana
- PMID: 22168340
- PMCID: PMC3247083
- DOI: 10.1186/1471-2105-12-S12-S2
Systematic identification of functional modules and cis-regulatory elements in Arabidopsis thaliana
Abstract
Background: Several large-scale gene co-expression networks have been constructed successfully for predicting gene functional modules and cis-regulatory elements in Arabidopsis (Arabidopsis thaliana). However, these networks are usually constructed and analyzed in an ad hoc manner. In this study, we propose a completely parameter-free and systematic method for constructing gene co-expression networks and predicting functional modules as well as cis-regulatory elements.
Results: Our novel method consists of an automated network construction algorithm, a parameter-free procedure to predict functional modules, and a strategy for finding known cis-regulatory elements that is suitable for consensus scanning without prior knowledge of the allowed extent of degeneracy of the motif. We apply the method to study a large collection of gene expression microarray data in Arabidopsis. We estimate that our co-expression network has ~94% of accuracy, and has topological properties similar to other biological networks, such as being scale-free and having a high clustering coefficient. Remarkably, among the ~300 predicted modules whose sizes are at least 20, 88% have at least one significantly enriched functions, including a few extremely significant ones (ribosome, p < 1E-300, photosynthetic membrane, p < 1.3E-137, proteasome complex, p < 5.9E-126). In addition, we are able to predict cis-regulatory elements for 66.7% of the modules, and the association between the enriched cis-regulatory elements and the enriched functional terms can often be confirmed by the literature. Overall, our results are much more significant than those reported by several previous studies on similar data sets. Finally, we utilize the co-expression network to dissect the promoters of 19 Arabidopsis genes involved in the metabolism and signaling of the important plant hormone gibberellin, and achieved promising results that reveal interesting insight into the biosynthesis and signaling of gibberellin.
Conclusions: The results show that our method is highly effective in finding functional modules from real microarray data. Our application on Arabidopsis leads to the discovery of the largest number of annotated Arabidopsis functional modules in the literature. Given the high statistical significance of functional enrichment and the agreement between cis-regulatory and functional annotations, we believe our Arabidopsis gene modules can be used to predict the functions of unknown genes in Arabidopsis, and to understand the regulatory mechanisms of many genes.
Figures





Similar articles
-
Expression-based network biology identifies immune-related functional modules involved in plant defense.BMC Genomics. 2014 Jun 3;15:421. doi: 10.1186/1471-2164-15-421. BMC Genomics. 2014. PMID: 24888606 Free PMC article.
-
Discovery of core biotic stress responsive genes in Arabidopsis by weighted gene co-expression network analysis.PLoS One. 2015 Mar 2;10(3):e0118731. doi: 10.1371/journal.pone.0118731. eCollection 2015. PLoS One. 2015. PMID: 25730421 Free PMC article.
-
Unraveling transcriptional control in Arabidopsis using cis-regulatory elements and coexpression networks.Plant Physiol. 2009 Jun;150(2):535-46. doi: 10.1104/pp.109.136028. Epub 2009 Apr 8. Plant Physiol. 2009. PMID: 19357200 Free PMC article.
-
Manipulating large-scale Arabidopsis microarray expression data: identifying dominant expression patterns and biological process enrichment.Methods Mol Biol. 2009;553:57-77. doi: 10.1007/978-1-60327-563-7_4. Methods Mol Biol. 2009. PMID: 19588101 Free PMC article. Review.
-
Dissecting the plant transcriptome and the regulatory responses to phosphate deprivation.Physiol Plant. 2010 Jun 1;139(2):129-43. doi: 10.1111/j.1399-3054.2010.01356.x. Epub 2010 Jan 25. Physiol Plant. 2010. PMID: 20113436 Review.
Cited by
-
Manipulation of the Plant Host by the Geminivirus AC2/C2 Protein, a Central Player in the Infection Cycle.Front Plant Sci. 2020 May 19;11:591. doi: 10.3389/fpls.2020.00591. eCollection 2020. Front Plant Sci. 2020. PMID: 32508858 Free PMC article. Review.
-
Expression-based network biology identifies immune-related functional modules involved in plant defense.BMC Genomics. 2014 Jun 3;15:421. doi: 10.1186/1471-2164-15-421. BMC Genomics. 2014. PMID: 24888606 Free PMC article.
-
Incorporating motif analysis into gene co-expression networks reveals novel modular expression pattern and new signaling pathways.PLoS Genet. 2013;9(10):e1003840. doi: 10.1371/journal.pgen.1003840. Epub 2013 Oct 3. PLoS Genet. 2013. PMID: 24098147 Free PMC article.
-
Altered expression of Arabidopsis genes in response to a multifunctional geminivirus pathogenicity protein.BMC Plant Biol. 2014 Nov 18;14:302. doi: 10.1186/s12870-014-0302-7. BMC Plant Biol. 2014. PMID: 25403083 Free PMC article.
-
Global Transcriptomic Analysis of Interactions between Pseudomonas aeruginosa and Bacteriophage PaP3.Sci Rep. 2016 Jan 11;6:19237. doi: 10.1038/srep19237. Sci Rep. 2016. PMID: 26750429 Free PMC article.
References
-
- Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Edgar R. NCBI GEO: mining tens of millions of expression profiles—database and tools update. Nucleic Acids Res. 2007;35(Database issue):760–765. http://www.hubmed.org/display.cgi?uids=17099226 - PMC - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources