Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009;10(5):R46.
doi: 10.1186/gb-2009-10-5-r46. Epub 2009 May 1.

MotifAdjuster: a tool for computational reassessment of transcription factor binding site annotations

Affiliations

MotifAdjuster: a tool for computational reassessment of transcription factor binding site annotations

Jens Keilwagen et al. Genome Biol. 2009.

Abstract

Valuable binding-site annotation data are stored in databases. However, several types of errors can, and do, occur in the process of manually incorporating annotation data from the scientific literature into these databases. Here, we introduce MotifAdjuster http://dig.ipk-gatersleben.de/MotifAdjuster.html, a tool that helps to detect these errors, and we demonstrate its efficacy on public data sets.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Comparison of binding-site conservation, showing the original sequence logos, the consensus sequences for the TFs obtained from the literature [56-61], and the adjusted sequence logos for the data sets of the TFs CpxR, Crp, Fis, Fnr, Fur, Lrp, and NarL. We find in all seven cases that (i) the adjusted sequence logos show a higher conservation than the original sequence logos, (ii) the adjusted sequence logos are more similar to the consensus sequences than to the original sequence logos; and (iii) clear motifs can be recognized in the adjusted sequence logos of the TFs CpxR, Fur, and NarL that could not be recognized in the original sequence logos.
Figure 2
Figure 2
Position of the predicted NarL binding site in the upstream region of torC. The NarL BS TACCCT is located on the forward strand with respect to the target operon torCAD starting at position -209 bp (red color). All positions are relative to the first nucleotide of the start codon of torC. (a) The fragment of the upstream region of the torCAD operon containing the NarL BS predicted by the PWM model trained on the adjusted data set. (b) Histogram of all positions of NarL BSs in the database. The red line indicates the position of the predicted BS.

Similar articles

Cited by

References

    1. Babu MM, Teichmann SA. Evolution of transcription factors and the gene regulatory network in Escherichia coli. Nucleic Acids Res. 2003;31:1234–1244. doi: 10.1093/nar/gkg210. - DOI - PMC - PubMed
    1. Pabo CO, Sauer RT. Transcription factors: structural families and principles of DNA recognition. Annu Rev Biochem. 1992;61:1053–1095. doi: 10.1146/annurev.bi.61.070192.005201. - DOI - PubMed
    1. Hellman LM, Fried MG. Electrophoretic mobility shift assay (EMSA) for detecting protein-nucleic acid interactions. Nat Protoc. 2007;2:1849–1861. doi: 10.1038/nprot.2007.249. - DOI - PMC - PubMed
    1. Galas DJ, Schmitz A. DNAse footprinting: a simple method for the detection of protein-DNA binding specificity. Nucleic Acids Res. 1978;5:3157–3170. doi: 10.1093/nar/5.9.3157. - DOI - PMC - PubMed
    1. Benotmane AM, Hoylaerts MF, Collen D, Belayew A. Nonisotopic quantitative analysis of protein-DNA interactions at equilibrium. Analyt Biochem. 1997;250:181–185. doi: 10.1006/abio.1997.2231. - DOI - PubMed

Publication types

MeSH terms

LinkOut - more resources