Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Jan;36(Database issue):D88-92.
doi: 10.1093/nar/gkm964. Epub 2007 Dec 11.

DBD--taxonomically broad transcription factor predictions: new content and functionality

Affiliations

DBD--taxonomically broad transcription factor predictions: new content and functionality

Derek Wilson et al. Nucleic Acids Res. 2008 Jan.

Abstract

DNA-binding domain (DBD) is a database of predicted sequence-specific DNA-binding transcription factors (TFs) for all publicly available proteomes. The proteomes have increased from 150 in the initial version of DBD to over 700 in the current version. All predicted TFs must contain a significant match to a hidden Markov model representing a sequence-specific DNA-binding domain family. Access to TF predictions is provided through http://transcriptionfactor.org, where new search options are now provided such as searching by gene names in model organisms, searching for all proteins in a particular DBD family and specific organism. We illustrate the application of this type of search facility by contrasting trends of DBD family occurrence throughout the tree of life, highlighting the clear partition between eukaryotic and prokaryotic DBD expansions. The website content has been expanded to include dedicated pages for each TF containing domain assignment details, gene names, links to external databases and links to TFs with similar domain arrangements. We compare the increase in number of predicted TFs with proteome size in eukaryotes and prokaryotes. Eukaryotes follow a slower rate of increase in TFs than prokaryotes, which could be due to the presence of splice variants or an increase in combinatorial control.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Examples of new search capabilities and content. (a) Search for TFs from a particular organism containing a specified DBD. The example used here is TFs from Homo sapiens containing the homoeobox domain. (b) The search in (a) results in TF predictions from Homo sapiens containing the homoeobox DNA-binding domain. (c) Selection of HOXA9 from (b) results in a web page with detailed information on this particular TF. (d) Clicking on the Pfam domain combination link in (c) retrieves the subset of TF predictions, which have the same two-domain arrangement as the HOXA9 transcription factor.
Figure 2.
Figure 2.
(a) Expansion and contraction patterns of DBD occurrence across the tree of life. Each column corresponds to a Pfam DBD. Each row of the heatmap represents a genome, ordered using the NCBI taxonomy. The vertical coloured bars indicate superkingdoms, kingdoms or phyla to which genomes belong. Eukaryotes are indicated using a red bar, archaea using a green bar and bacteria using a blue bar. Other kingdoms are represented using white bars. DNA-binding domain families are clustered using the average linkage method with Pearson correlation distance. Red squares represent an expansion of a DBD family, green squares represent a contraction of that family in a genome relative to other genomes. (b) A zoom on DBD expansions in the viridiplantae lineage. (c) Illustration of the three-dimensional structure of one of the DBDs specifically expanded in the viridiplantae kingdom, the AP2 domain in complex with DNA. The AP2 family transcription factors are known to be involved in plant pathogen defence response processes.

References

    1. Robertson G, Bilenky M, Lin K, He A, Yuen W, Dagpinar M, Varhol R, Teague K, Griffith OL, et al. cisRED: a database system for genome-scale computational discovery of regulatory elements. Nucleic Acids Res. 2006;34:68–73. - PMC - PubMed
    1. Bulyk ML. Computational prediction of transcription-factor binding site locations. Genome Biol. 2003;5:201–201. - PMC - PubMed
    1. Hermoso A, Aguilar D, Aviles FX, Querol E. TrSDB: a proteome database of transcription factors. Nucleic Acids Res. 2004;32:171–173. - PMC - PubMed
    1. Messina DN, Glasscock J, Gish W, Lovett M. An ORFeome-based analysis of human transcription factor genes and the construction of a microarray to interrogate their expression. Genome Res. 2004;14:2041–2047. - PMC - PubMed
    1. Barrasa MI, Vaglio P, Cavasino F, Jacotot L, Walhout AJ. EdgeDB: a transcription factor-DNA interaction database for the analysis of C. elegans differential gene expression. BMC Genomics. 2007;8:21–21. - PMC - PubMed

Publication types