Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep 20;49(16):9077-9096.
doi: 10.1093/nar/gkab688.

tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes

Affiliations

tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes

Patricia P Chan et al. Nucleic Acids Res. .

Abstract

tRNAscan-SE has been widely used for transfer RNA (tRNA) gene prediction for over twenty years, developed just as the first genomes were decoded. With the massive increase in quantity and phylogenetic diversity of genomes, the accurate detection and functional prediction of tRNAs has become more challenging. Utilizing a vastly larger training set, we created nearly one hundred specialized isotype- and clade-specific models, greatly improving tRNAscan-SE's ability to identify and classify both typical and atypical tRNAs. We employ a new comparative multi-model strategy where predicted tRNAs are scored against a full set of isotype-specific covariance models, allowing functional prediction based on both the anticodon and the highest-scoring isotype model. Comparative model scoring has also enhanced the program's ability to detect tRNA-derived SINEs and other likely pseudogenes. For the first time, tRNAscan-SE also includes fast and highly accurate detection of mitochondrial tRNAs using newly developed models. Overall, tRNA detection sensitivity and specificity is improved for all isotypes, particularly those utilizing specialized models for selenocysteine and the three subtypes of tRNA genes encoding a CAU anticodon. These enhancements will provide researchers with more accurate and detailed tRNA annotation for a wider variety of tRNAs, and may direct attention to tRNAs with novel traits.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Schematic diagram of tRNAscan-SE 2.0 search algorithm. Three pathways were developed for cytosolic tRNA search modes with the addition of the mitochondrial tRNA search mode. The default method employs Infernal 1.1 (20) with newly built covariance models for similarity search while the legacy search remains the same as tRNAscan-SE 1.3 (1) for backward compatibility.
Figure 2.
Figure 2.
Score distributions of tRNA genes comparing tRNAscan-SE 2.0 to 1.3 predictions show substantial agreement, with most differences among low-scoring (<50 bit) tRNA genes. (A) 10 354 tRNA predictions in 217 archaeal genomes, (B) 246 271 predictions in 4036 bacterial genomes and (C) 80 824 predictions in 432 fungal genomes were detected using tRNAscan-SE 2.0 with default search modes for each respective domain. Prediction results were grouped into four categories: (i) ‘Consistent’ between versions, (ii) ‘Isotype Mismatch’ where v2.0 and v1.3 predictions have different isotype classifications, (iii) ‘Novel’ for predictions new in v2.0 and (iv) ‘Not detected’ for prior 1.3 predictions no longer detected by v2.0. Each histogram represents the prediction scores binned by 10 bits in the corresponding comparison category. Vertical black dashed lines represent the median score in the category, and vertical red dashed lines demarcate the 50-bit ‘low score’ threshold.
Figure 3.
Figure 3.
Direct comparison of isotype-specific covariance model scores provides an effective classifier of tRNAs with anticodon CAU. Dots represent individual tRNA genes with the highest scores from the initiator methionine/N-formylmethionine model (tRNAiMet/fMet, red), elongator methionine model (tRNAMet, blue) or isoleucine-decoding CAU model (tRNAIle2, green). Each tRNA gene was scanned with the isotype-specific covariance models of the corresponding genome's domain (A) eukaryotes, (B) bacteria, or (C) archaea.
Figure 4.
Figure 4.
Isotype uncertainty in human tRNA-Leu-CAA-5–1. The primary sequence and the secondary structure comparison of (A) tRNA-Met-CAT-3–1, (B) tRNA-Leu-CAA-5–1, and (C) tRNA-Leu-CAA-1–1 shows that tRNA-Leu-CAA-5–1 is more similar to a tRNAMet than a tRNALeu even though it has an anticodon that would decode leucine. The bases highlighted in orange represent the differences between the respective sequence and tRNA-Leu-CAA-5–1. The numbers next to the bases represent the Sprinzl canonical tRNA positions (16).

References

    1. Lowe T.M., Eddy S.R.. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997; 25:955–964. - PMC - PubMed
    1. Schattner P., Brooks A.N., Lowe T.M.. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005; 33:W686–W689. - PMC - PubMed
    1. Lowe T.M., Chan P.P.. tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 2016; 44:W54–W57. - PMC - PubMed
    1. Chan P.P., Lowe T.M.. GtRNAdb: a database of transfer RNA genes detected in genomic sequence. Nucleic Acids Res. 2009; 37:D93–D97. - PMC - PubMed
    1. Chan P.P., Lowe T.M.. GtRNAdb 2.0: an expanded database of transfer RNA genes identified in complete and draft genomes. Nucleic Acids Res. 2016; 44:D184–D189. - PMC - PubMed

Publication types

LinkOut - more resources