Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 15;26(12):5728.
doi: 10.3390/ijms26125728.

TARNAS: A Software Tool for Abstracting and Translating RNA Secondary Structures

Affiliations

TARNAS: A Software Tool for Abstracting and Translating RNA Secondary Structures

Michela Quadrini et al. Int J Mol Sci. .

Abstract

Ribonucleic acids (RNAs) fold into complex structures that are strongly associated with their biological functions. These can be abstracted into secondary structures, represented as nucleotide sequences annotated with base-pairing information. This abstraction is both biologically relevant and computationally manageable. Comparing and classifying RNA molecules typically relies on these secondary structure representations, which exist in multiple formats. In this work, we introduce TARNAS 1.0, a software tool designed to convert RNA secondary structure representations across multiple formats, including Base Pair Sequence (BPSEQ), Connect Table (CT), dot-bracket, Arc-Annotated Sequence (AAS), Fast-All (FASTA), and RNA Markup Language (RNAML). The tool offers options for retaining or removing comments, blank lines, and headers during the conversion process. These format translation and preprocessing capabilities are specifically designed to support the batch handling of large collections of RNA molecules, making TARNAS well suited for large dataset construction and database curation. Beyond format translation, TARNAS computes three levels of abstraction for RNA secondary structures, namely core, core plus, and shape, as well as a set of statistical descriptors for both primary and secondary structure. These abstraction and analysis features are intended to facilitate the comparison of molecules and the identification of recurring structural patterns, which are essential steps for associating structural motifs with molecular function. TARNAS is available as both a standalone desktop application and a web-based tool. The desktop version supports batch processing of large datasets, while the web version is optimized for the analysis of single molecules.

Keywords: RNA secondary structure formats; RNAML; abstractions of RNA secondary structures; statistics on RNA secondary structures.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Arc diagram representation of RNA secondary structures. Red circles indicate nucleotides, while arcs denote base-pair interactions. The motif shown in part (a) is pseudoknot-free, whereas the one in part (b) contains a pseudoknot.
Figure 2
Figure 2
The shape (a), core (b), and core plus (c) abstractions of the RNA secondary structure depicted in Figure 1.
Figure 3
Figure 3
Interface of the TARNAS web app.
Figure 4
Figure 4
Interface of the TARNAS standalone version.
Figure 5
Figure 5
BPSEQ (a) and CT (b) representations of the RNA secondary structure from Figure 1.
Figure 6
Figure 6
RNAML (a), AAS (b), dot-bracket (c), and FASTA (d) representations of the RNA secondary structure from Figure 1.
Figure 7
Figure 7
Schematic view of key RNAML markup elements. (a) Top-level RNAML elements. (b) Elements within the molecule description. (c) Elements used in tertiary structure annotation.
Figure 8
Figure 8
TARNAS workflow for translation. Each molecule file is parsed using either ANTLR4 or the W3C DOM, producing an internal representation of the secondary structure. This internal structure is then rendered in the requested output format.

References

    1. Leontis N.B., Westhof E. Geometric nomenclature and classification of RNA base pairs. RNA. 2001;7:499–512. doi: 10.1017/S1355838201002515. - DOI - PMC - PubMed
    1. Shapiro B.A., Zhang K. Comparing multiple RNA secondary structures using tree comparisons. Bioinformatics. 1990;6:309–318. doi: 10.1093/bioinformatics/6.4.309. - DOI - PubMed
    1. Chiu J.K.H., Chen Y.P.P. Pairwise RNA secondary structure alignment with conserved stem pattern. Bioinformatics. 2015;31:3914–3921. doi: 10.1093/bioinformatics/btv471. - DOI - PubMed
    1. Hochsmann M., Voss B., Giegerich R. Pure multiple RNA secondary structure alignments: A progressive profile approach. IEEE/ACM Trans. Comput. Biol. Bioinform. 2004;1:53–62. doi: 10.1109/TCBB.2004.11. - DOI - PubMed
    1. Hochsmann M., Toller T., Giegerich R., Kurtz S. Local similarity in RNA secondary structures; Proceedings of the 2003 IEEE Bioinformatics Conference—CSB2003; Stanford, CA, USA. 11–14 August 2003; Piscataway, NJ, USA: IEEE; 2003. pp. 159–168. - PubMed

LinkOut - more resources