Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2014 May 1:5:114.
doi: 10.3389/fgene.2014.00114. eCollection 2014.

tRNADB-CE: tRNA gene database well-timed in the era of big sequence data

Affiliations
Review

tRNADB-CE: tRNA gene database well-timed in the era of big sequence data

Takashi Abe et al. Front Genet. .

Abstract

The tRNA gene data base curated by experts "tRNADB-CE" (http://trna.ie.niigata-u.ac.jp) was constructed by analyzing 1,966 complete and 5,272 draft genomes of prokaryotes, 171 viruses', 121 chloroplasts', and 12 eukaryotes' genomes plus fragment sequences obtained by metagenome studies of environmental samples. 595,115 tRNA genes in total, and thus two times of genes compiled previously, have been registered, for which sequence, clover-leaf structure, and results of sequence-similarity and oligonucleotide-pattern searches can be browsed. To provide collective knowledge with help from experts in tRNA researches, we added a column for enregistering comments to each tRNA. By grouping bacterial tRNAs with an identical sequence, we have found high phylogenetic preservation of tRNA sequences, especially at the phylum level. Since many species-unknown tRNAs from metagenomic sequences have sequences identical to those found in species-known prokaryotes, the identical sequence group (ISG) can provide phylogenetic markers to investigate the microbial community in an environmental ecosystem. This strategy can be applied to a huge amount of short sequences obtained from next-generation sequencers, as showing that tRNADB-CE is a well-timed database in the era of big sequence data. It is also discussed that batch-learning self-organizing-map with oligonucleotide composition is useful for efficient knowledge discovery from big sequence data.

Keywords: BLSOM; big data; database; metagenome; phylogenic maker; tRNA.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Basic functions of tRNADB-CE. (A) Statistics information for frequency of nucleotide in each position on tRNA cloverleaf secondary structure, (B) List of Phylum/Class/Species and anticodon table, (C) tRNA gene list, (D) Detailed information, (E) Search of neighboring tRNA genes, and (F) Advanced keyword and pattern search.
FIGURE 2
FIGURE 2
List and search for (A) Identical sequence groups (ISGs) found in environmental samples, (B) Table of microbial population in all samples, (C) List of microbial population in a sample, and (D) Detailed information for samples assignable to phylum.
FIGURE 3
FIGURE 3
Examples of non-standard-type tRNAs. (A) The anticodon occurrence table of bacterial tRNAs. It is clear that A-starting anticodons are very rare in bacterial tRNAs. (B) Cloverleaf secondary structure of M. mobile tRNAIle2.

Similar articles

Cited by

References

    1. Abe T., Ikemura T., Ohara Y., Uehara H., Kinouchi M., Kanaya S., et al. (2009). tRNADB-CE: tRNA gene database curated manually by experts. Nucleic Acids Res. 37 D163–D168 10.1093/nar/gkn692 - DOI - PMC - PubMed
    1. Abe T., Ikemura T., Sugahara J., Kanai A., Ohara Y., Uehara H., et al. (2011). tRNADB-CE 2011: tRNA gene database curated manually by experts. Nucleic Acids Res. 39 D210–D213 10.1093/nar/gkq1007 - DOI - PMC - PubMed
    1. Abe T., Kanaya S., Kinouchi M., Ichiba M., Kozuki T., Ikemura T. (2003). Informatics for unveiling hidden genome signatures. Genome Res. 13 693–702 10.1101/gr.634603 - DOI - PMC - PubMed
    1. Chan P. P., Lowe T. M. (2009). A database of transfer RNA genes detected in genome sequence. Nucleic Acids Res. 37 D93–D97 10.1093/nar/gkn787 - DOI - PMC - PubMed
    1. Iwasaki Y., Wada K., Wada Y., Abe T., Ikemura T. (2013a). Notable clustering of transcription-factor-binding motifs in human pericentric regions and its biological significance. Chromosome Res. 5 461–474 10.1007/s10577-013-9371-y - DOI - PMC - PubMed

LinkOut - more resources