Assigning roles to DNA regulatory motifs using comparative genomics
- PMID: 20147307
- PMCID: PMC2844991
- DOI: 10.1093/bioinformatics/btq049
Assigning roles to DNA regulatory motifs using comparative genomics
Abstract
Motivation: Transcription factors (TFs) are crucial during the lifetime of the cell. Their functional roles are defined by the genes they regulate. Uncovering these roles not only sheds light on the TF at hand but puts it into the context of the complete regulatory network.
Results: Here, we present an alignment- and threshold-free comparative genomics approach for assigning functional roles to DNA regulatory motifs. We incorporate our approach into the Gomo algorithm, a computational tool for detecting associations between a user-specified DNA regulatory motif [expressed as a position weight matrix (PWM)] and Gene Ontology (GO) terms. Incorporating multiple species into the analysis significantly improves Gomo's ability to identify GO terms associated with the regulatory targets of TFs. Including three comparative species in the process of predicting TF roles in Saccharomyces cerevisiae and Homo sapiens increases the number of significant predictions by 75 and 200%, respectively. The predicted GO terms are also more specific, yielding deeper biological insight into the role of the TF. Adjusting motif (binding) affinity scores for individual sequence composition proves to be essential for avoiding false positive associations. We describe a novel DNA sequence-scoring algorithm that compensates a thermodynamic measure of DNA-binding affinity for individual sequence base composition. GOMO's prediction accuracy proves to be relatively insensitive to how promoters are defined. Because GOMO uses a threshold-free form of gene set analysis, there are no free parameters to tune. Biologists can investigate the potential roles of DNA regulatory motifs of interest using GOMO via the web (http://meme.nbcr.net).
Figures


Similar articles
-
MEME SUITE: tools for motif discovery and searching.Nucleic Acids Res. 2009 Jul;37(Web Server issue):W202-8. doi: 10.1093/nar/gkp335. Epub 2009 May 20. Nucleic Acids Res. 2009. PMID: 19458158 Free PMC article.
-
SCOPE: a web server for practical de novo motif discovery.Nucleic Acids Res. 2007 Jul;35(Web Server issue):W259-64. doi: 10.1093/nar/gkm310. Epub 2007 May 7. Nucleic Acids Res. 2007. PMID: 17485471 Free PMC article.
-
Searching for statistically significant regulatory modules.Bioinformatics. 2003 Oct;19 Suppl 2:ii16-25. doi: 10.1093/bioinformatics/btg1054. Bioinformatics. 2003. PMID: 14534166
-
Associating transcription factor-binding site motifs with target GO terms and target genes.Nucleic Acids Res. 2008 Jul;36(12):4108-17. doi: 10.1093/nar/gkn374. Epub 2008 Jun 10. Nucleic Acids Res. 2008. PMID: 18544606 Free PMC article.
-
DNA Motif Databases and Their Uses.Curr Protoc Bioinformatics. 2015 Sep 3;51:2.15.1-2.15.6. doi: 10.1002/0471250953.bi0215s51. Curr Protoc Bioinformatics. 2015. PMID: 26334922 Review.
Cited by
-
A comprehensive meta-analysis of transcriptome data to identify signature genes associated with pancreatic ductal adenocarcinoma.PLoS One. 2024 Feb 7;19(2):e0289561. doi: 10.1371/journal.pone.0289561. eCollection 2024. PLoS One. 2024. PMID: 38324544 Free PMC article.
-
A temporal hierarchy underpins the transcription factor-DNA interactome of the maize UPR.Plant J. 2021 Jan;105(1):254-270. doi: 10.1111/tpj.15044. Epub 2020 Nov 15. Plant J. 2021. PMID: 33098715 Free PMC article.
-
Heterodimeric DNA motif synthesis and validations.Nucleic Acids Res. 2019 Feb 28;47(4):1628-1636. doi: 10.1093/nar/gky1297. Nucleic Acids Res. 2019. PMID: 30590725 Free PMC article.
-
Genome-wide distribution of Auts2 binding localizes with active neurodevelopmental genes.Transl Psychiatry. 2014 Sep 2;4(9):e431. doi: 10.1038/tp.2014.78. Transl Psychiatry. 2014. PMID: 25180570 Free PMC article.
-
Transcriptome data reveal gene clusters and key genes in pepper response to heat shock.Front Plant Sci. 2022 Sep 21;13:946475. doi: 10.3389/fpls.2022.946475. eCollection 2022. Front Plant Sci. 2022. PMID: 36212322 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Miscellaneous