Computational approaches to identify promoters and cis-regulatory elements in plant genomes
- PMID: 12857799
- PMCID: PMC167057
- DOI: 10.1104/pp.102.017715
Computational approaches to identify promoters and cis-regulatory elements in plant genomes
Abstract
The identification of promoters and their regulatory elements is one of the major challenges in bioinformatics and integrates comparative, structural, and functional genomics. Many different approaches have been developed to detect conserved motifs in a set of genes that are either coregulated or orthologous. However, although recent approaches seem promising, in general, unambiguous identification of regulatory elements is not straightforward. The delineation of promoters is even harder, due to its complex nature, and in silico promoter prediction is still in its infancy. Here, we review the different approaches that have been developed for identifying promoters and their regulatory elements. We discuss the detection of cis-acting regulatory elements using word-counting or probabilistic methods (so-called "search by signal" methods) and the delineation of promoters by considering both sequence content and structural features ("search by content" methods). As an example of search by content, we explored in greater detail the association of promoters with CpG islands. However, due to differences in sequence content, the parameters used to detect CpG islands in humans and other vertebrates cannot be used for plants. Therefore, a preliminary attempt was made to define parameters that could possibly define CpG and CpNpG islands in Arabidopsis, by exploring the compositional landscape around the transcriptional start site. To this end, a data set of more than 5,000 gene sequences was built, including the promoter region, the 5'-untranslated region, and the first introns and coding exons. Preliminary analysis shows that promoter location based on the detection of potential CpG/CpNpG islands in the Arabidopsis genome is not straightforward. Nevertheless, because the landscape of CpG/CpNpG islands differs considerably between promoters and introns on the one side and exons (whether coding or not) on the other, more sophisticated approaches can probably be developed for the successful detection of "putative" CpG and CpNpG islands in plants.
Figures





Similar articles
-
Identification of plant promoter constituents by analysis of local distribution of short sequences.BMC Genomics. 2007 Mar 8;8:67. doi: 10.1186/1471-2164-8-67. BMC Genomics. 2007. PMID: 17346352 Free PMC article.
-
Genome wide analysis of Arabidopsis core promoters.BMC Genomics. 2005 Feb 25;6:25. doi: 10.1186/1471-2164-6-25. BMC Genomics. 2005. PMID: 15733318 Free PMC article.
-
Clusters of regulatory signals for RNA polymerase II transcription associated with Alu family repeats and CpG islands in human promoters.Genomics. 2004 May;83(5):873-82. doi: 10.1016/j.ygeno.2003.11.001. Genomics. 2004. PMID: 15081116
-
Synthetic Promoters: Designing the cis Regulatory Modules for Controlled Gene Expression.Mol Biotechnol. 2018 Aug;60(8):608-620. doi: 10.1007/s12033-018-0089-0. Mol Biotechnol. 2018. PMID: 29855997 Review.
-
Identification and validation of promoters and cis-acting regulatory elements.Plant Sci. 2014 Mar;217-218:109-19. doi: 10.1016/j.plantsci.2013.12.007. Epub 2013 Dec 14. Plant Sci. 2014. PMID: 24467902 Review.
Cited by
-
Comprehensive analysis and discovery of drought-related NAC transcription factors in common bean.BMC Plant Biol. 2016 Sep 7;16(1):193. doi: 10.1186/s12870-016-0882-5. BMC Plant Biol. 2016. PMID: 27604581 Free PMC article.
-
Structures of the three homoeologous loci of wheat benzoxazinone biosynthetic genes TaBx3 and TaBx4 and characterization of their promoter sequences.Theor Appl Genet. 2008 Feb;116(3):373-81. doi: 10.1007/s00122-007-0675-1. Epub 2007 Nov 27. Theor Appl Genet. 2008. PMID: 18040657
-
Isolation and characterization of drought and ABA responsive promoter of a transcription factor encoding gene from rice.Physiol Mol Biol Plants. 2022 Oct;28(10):1813-1831. doi: 10.1007/s12298-022-01246-9. Epub 2022 Nov 3. Physiol Mol Biol Plants. 2022. PMID: 36484033 Free PMC article.
-
Screening of tissue-specific genes and promoters in tomato by comparing genome wide expression profiles of Arabidopsis orthologues.Mol Cells. 2012 Jul;34(1):53-9. doi: 10.1007/s10059-012-0068-4. Epub 2012 Jun 12. Mol Cells. 2012. PMID: 22699756 Free PMC article.
-
GC-compositional strand bias around transcription start sites in plants and fungi.BMC Genomics. 2005 Feb 28;6:26. doi: 10.1186/1471-2164-6-26. BMC Genomics. 2005. PMID: 15733327 Free PMC article.
References
-
- Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796–815 - PubMed
-
- Antequera F, Bird A (1999) CpG islands as genomic footprints of promoters that are associated with replication origins. Curr Biol 9: R661–R667 - PubMed
-
- Aparicio S, Chapman J, Stupka E, Putnam N, Chia JM, Dehal P, Christoffels A, Rash S, Hoon S, Smit A et al. (2002) Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes. Science 23: 1301–1310 - PubMed
-
- Ashikawa I (2001) Gene-associated CpG islands in plants as revealed by analyses of genomic sequences. Plant J 26: 617–625 - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials