Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Mar 15:6:29.
doi: 10.1186/1471-2180-6-29.

ExtraTrain: a database of Extragenic regions and Transcriptional information in prokaryotic organisms

Affiliations

ExtraTrain: a database of Extragenic regions and Transcriptional information in prokaryotic organisms

Eduardo Pareja et al. BMC Microbiol. .

Abstract

Background: Transcriptional regulation processes are the principal mechanisms of adaptation in prokaryotes. In these processes, the regulatory proteins and the regulatory DNA signals located in extragenic regions are the key elements involved. As all extragenic spaces are putative regulatory regions, ExtraTrain covers all extragenic regions of available genomes and regulatory proteins from bacteria and archaea included in the UniProt database.

Description: ExtraTrain provides integrated and easily manageable information for 679816 extragenic regions and for the genes delimiting each of them. In addition ExtraTrain supplies a tool to explore extragenic regions, named Palinsight, oriented to detect and search palindromic patterns. This interactive visual tool is totally integrated in the database, allowing the search for regulatory signals in user defined sets of extragenic regions. The 26046 regulatory proteins included in ExtraTrain belong to the families AraC/XylS, ArsR, AsnC, Cold shock domain, CRP-FNR, DeoR, GntR, IclR, LacI, LuxR, LysR, MarR, MerR, NtrC/Fis, OmpR and TetR. The database follows the InterPro criteria to define these families. The information about regulators includes manually curated sets of references specifically associated to regulator entries. In order to achieve a sustainable and maintainable knowledge database ExtraTrain is a platform open to the contribution of knowledge by the scientific community providing a system for the incorporation of textual knowledge.

Conclusion: ExtraTrain is a new database for exploring Extragenic regions and Transcriptional information in bacteria and archaea. ExtraTrain database is available at http://www.era7.com/ExtraTrain/.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Case study: constructing the working set. The set of extragenic regions upstream genes encoding AcrR BLAST similar proteins has been incorporated to the "working set". For extragenic sequences 8, 10, 13, 16 and 17 the check-box for obtaining the complementary inverted sequence has been marked. Thus, the 17 upstream extragenic sequences are equally oriented with regard to the start points of the genes. Clicking on "FASTA SEQUENCES" button the user obtains the extragenic sequences in FASTA format. Clicking on "PALINSIGHT" button the user sends the sequences to Palinsight viewer.
Figure 2
Figure 2
Case study: Palinsight displaying the shared palindrome in sequences 1 to 10 of the working set. Using Palinsight we have detected a shared palindrome in these extragenic sequences upstream genes encoding AcrR similar proteins. The same palindrome is conserved for all Escherichia coli and Shigella flexneri sequences (extragenic sequences 1–6). Another slightly different palindrome is conserved in Salmonella enterica and Salmonella typhimurium (extragenic sequences 7–10).
Figure 3
Figure 3
Case study: Palinsight displaying the shared palindrome in sequences 11 to 17 of the working set. Extragenic sequences 11–14 from Yersinia present another identical palindrome. Palindromes detected for Erwinia carotovora, Photorhabdus luminescens and Pseudomonas syringae present more differences but the 17 extragenic sequences conserve the palindromic motif TAC - -ACA- - - -|- - - -TGT - -GTA that appears at the top of the figures 2 and 3. The detected palindromes are candidates to be binding-sites for AcrR and AcrR similar proteins.
Figure 4
Figure 4
Case study: Palinsight displaying the window of Results of searching the pattern. When we searched for the pattern TAC - -ACA- - - -|- - - -TGT - -GTA clicking on the "Search this pattern" button we obtained the positions of this motif in the set of selected extragenic sequences. Table 3 is the copy of the complete content of this window.

Similar articles

Cited by

References

    1. Matys V, Fricke E, Geffers R, Gossling E, Haubrock M, Hehl R, Hornischer K, Karas D, Kel AE, Kel-Margoulis OV, Kloos DU, Land S, Lewicki-Potapov B, Michael H, Munch R, Reuter I, Rotert S, Saxel H, Scheer M, Thiele S, Wingender E. TRANSFAC: transcriptional regulation from patterns to profiles. Nucleic Acids Res. 2003;31:374–378. doi: 10.1093/nar/gkg108. - DOI - PMC - PubMed
    1. Salgado H, Gama-Castro S, Martinez-Antonio A, Diaz-Peredo E, Sanchez-Solano F, Peralta-Gil M, Garcia-Alonso D, Jimenez-Jacinto V, Santos-Zavaleta A, Bonavides-Martinez C, Collado-Vides J. RegulonDB (version 40): transcri ptional regulation operon organization and growth conditions in Escherichia coli K-12. Nucleic Acids Res. 2004:D303–6. doi: 10.1093/nar/gkh140. - DOI - PMC - PubMed
    1. Tobes R, Ramos JL. AraC-XylS database: a family of positive transcriptional regulators in bacteria. Nucleic Acids Res. 2002;30:318–321. doi: 10.1093/nar/30.1.318. - DOI - PMC - PubMed
    1. Martinez-Bueno M, Molina-Henares AJ, Pareja E, Ramos JL, Tobes R. BacTregulators: a database of transcriptional regulators in bacteria and archaea. Bioinformatics. 2004;20:2787–2791. doi: 10.1093/bioinformatics/bth330. - DOI - PubMed
    1. Di Cara A, Schmidt K, Hemmings BA, Oakeley EJ. PromoterPlot: a graphical display of promoter similarities by pattern recognition. Nucleic Acids Res. 2005;33 :W423–6. doi: 10.1093/nar/gki413. - DOI - PMC - PubMed

Publication types

LinkOut - more resources