Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun 1;16(1):7.
doi: 10.1186/s13015-021-00187-4.

The energy-spectrum of bicompatible sequences

Affiliations

The energy-spectrum of bicompatible sequences

Fenix W Huang et al. Algorithms Mol Biol. .

Abstract

Background: Genotype-phenotype maps provide a meaningful filtration of sequence space and RNA secondary structures are particular such phenotypes. Compatible sequences, which satisfy the base-pairing constraints of a given RNA structure, play an important role in the context of neutral evolution. Sequences that are simultaneously compatible with two given structures (bicompatible sequences), are beacons in phenotypic transitions, induced by erroneously replicating populations of RNA sequences. RNA riboswitches, which are capable of expressing two distinct secondary structures without changing the underlying sequence, are one example of bicompatible sequences in living organisms.

Results: We present a full loop energy model Boltzmann sampler of bicompatible sequences for pairs of structures. The sequence sampler employs a dynamic programming routine whose time complexity is polynomial when assuming the maximum number of exposed vertices, [Formula: see text], is a constant. The parameter [Formula: see text] depends on the two structures and can be very large. We introduce a novel topological framework encapsulating the relations between loops that sheds light on the understanding of [Formula: see text]. Based on this framework, we give an algorithm to sample sequences with minimum [Formula: see text] on a particular topologically classified case as well as giving hints to the solution in the other cases. As a result, we utilize our sequence sampler to study some established riboswitches.

Conclusion: Our analysis of riboswitch sequences shows that a pair of structures needs to satisfy key properties in order to facilitate phenotypic transitions and that pairs of random structures are unlikely to do so. Our analysis observes a distinct signature of riboswitch sequences, suggesting a new criterion for identifying native sequences and sequences subjected to evolutionary pressure. Our free software is available at: https://github.com/FenixHuang667/Bifold .

Keywords: Evolutionary transition; Riboswitch; Topological nerve.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Riboswitch. Alternative structures of the Adenine riboswitch [24] and its switching sequence (blue), involved two respective helices
Fig. 2
Fig. 2
From two secondary structures to a topological space: A two secondary structures drawn in the upper- and lower-half plan respectively. A rainbow arc (0, 9) is added to both of the structures to close to exterior loop. B The secondary structures are decomposed into loops. The loops induce a hyper-graph G=(V={0,,9},E), where E={e1,e2,e3,e4,e5,e6} with e1={0,1,4,5,7,8,9},e2={1,2,3,4},e3={2,3},e4={5,6,7}, e5={0,1,2,3,4,5,6,8,9} and e6={6,7,8} are loops. C The nerve over E. A (d-1)-simplex represent the nontrivial intersection of d loops. D The topological quotient space K(R) induced by C, where a ribbon induced by {e1,e2,e5,e3} is glued to a sphere induced by {e1,e5,e4,e6}
Fig. 3
Fig. 3
Geometric interpretation of the 0th-, 1st-, and 2nd- homology group. A The rank of the 0th homology group is 2, counting the number of connected components, where each of them can be contracted to a single point. B The rank of the 1st homology group is 2, counting the number of uncontactable circles on a torus. C The rank of the 2nd homology group is 2, counting the number of empty volume (2-dimensional holes) induced by the spheres
Fig. 4
Fig. 4
A An RNA secondary structure. B Diagram representation of A. C A secondary structure with a rainbow arc (dash). D Loops in a secondary structure. The loop (gray) contains a distinguished maximal arc (1, 17). E A bistructure B is a collection of loops {L1,,L9}. X={L3,L7,L9} (blue) is an irreducible substructure of B with its complement X¯={L1,L2,L4,L5,L6,L8}. We mark the exposed vertices EX=VXVX¯ in red. The closure of X is given by X~={L2,L3,L4,L6,L7,L8,L9}
Fig. 5
Fig. 5
Illustration of the recursion for the partition function. Given a bistructure X with exposed vertices v1 and v2, we decomposed X into a loop L(v1,u1,u2,u3) and a substructure X(v1,u1,u2,u3,v2). Here we assume v1, u1, u2, u3, and v2 are contained in loops of X, hence v1 and v2 remain exposed and u1, u2, and u3 are newly exposed vertices
Fig. 6
Fig. 6
A decomposition of a bistructure (LHS) and the evolution of its loop nerve (RHS). LHS: exposed vertices are labelled red. RHS: the loops 1, 2, 3, 5 and 6 form a sphere. Removing one loop from this sphere is tantamount to deleting one vertex of the loop nerve. The white vertices in the loop nerve represent the boundary of the substructure. The sphere corresponding to the crossing component is resolved by removing the S-arc
Fig. 7
Fig. 7
Examples of topological realizations of the loop nerves for different bistructures: A an empty tetrahedron or a sphere), B a filled tetrahedron, and C two filled triangles glued along a mutual edge
Fig. 8
Fig. 8
Splitting an overlap without inducing crossing arcs (A). The split is tantamount to removing an edge of the corresponding filled tetrahedron as well as its interior, ending up with two triangles that are still glued along the opposite edge from the edge we removed. B The split effect on the corresponding simplicial complex of A
Fig. 9
Fig. 9
A bistructure having no overlap and crossing arcs (left), its loop nerve (middle), and the -graph (right). Each vertex in the -graph presents a triangle in the loop nerve, labeled by a triple of loops. Its -graph is a tree
Fig. 10
Fig. 10
Topologically guided optimal tree decomposition: A a bistructure without crossing arcs or overlaps. B The hyper-graph of A. C Its quotient space is a ribbon. D The optimal tree decomposition of B obtained by following (C)
Fig. 11
Fig. 11
The R- and S spectra of the riboswitch structure pair, add and rand1. Top left: fQ(R)R versus fQ(R)|SR for add. Top right: fQ(S)S versus fQ(S)|RS for add. Bottom left: fQ(R)R versus fQ(R)|SR for rand1. Bottom right: fQ(S)S versus fQ(S)|RS for rand1
Fig. 12
Fig. 12
The R- and S spectra of the riboswitch structure pair, add and rand1. Top left: fQ(R)R versus fQ(R,S)R for add. Top right: fQ(S)S versus fQ(R,S)S for add. Bottom left: fQ(R)R versus fQ(R,S)R for rand1. Bottom right: fQ(S)S versus fQ(R,S)S for rand1
Fig. 13
Fig. 13
(rR,rS) for the riboswitches mgt (left) and lys (right): Boltzmann sampled sequences versus native sequences. For each riboswitch we Boltzmann sample 103 sequences and compute (rR,rS) for the sampled sequences (blue). We contrast this with (rR,rS) of for the respective, native riboswitch sequence (red)
Fig. 14
Fig. 14
The energy weighed space of bicompatible sequences. (wR,wS) of the six riboswitch structure pairs, presented in Table 1 (blue). Furthermore: (wR,wS) of 50 random structure pairs of length 150, as control set (red)

Similar articles

References

    1. Fontana W, Stadler PF, Bornberg-Bauer EG, Griesmacher T, Hofacker IL, Tacker M, Tarazona P, Weinberger ED. RNA folding and combinatory landscapes. Phys Rev E. 1993;47:2083–2099. doi: 10.1103/PhysRevE.47.2083. - DOI - PubMed
    1. Grüner W, Giegerich R, Strothmann D, Reidys C, Weber J, Hofacker IL, Stadler PF, Schuster P. Analysis of RNA sequence structure maps by exhaustive enumeration I. Neutral networks. Monatsh Chem. 1996;127:355–374. doi: 10.1007/BF00810881. - DOI
    1. Grüner W, Giegerich R, Strothmann D, Reidys C, Weber J, Hofacker IL, Stadler PF, Schuster P. Analysis of RNA sequence structure maps by exhaustive enumeration II. Structures of neutral networks and shape space covering. Monatsh Chem. 1996;127:375–389. doi: 10.1007/BF00810882. - DOI
    1. Schuster P, Fontana W, Stadler PF, Hofacker IL. From sequences to shapes and back: a case study in RNA secondary structures. Proc Biol Sci. 1994;255(1344):279–84. doi: 10.1098/rspb.1994.0040. - DOI - PubMed
    1. Reidys CM, Stadler PF, Schuster P. Generic properties of combinatory maps: neural networks of RNA secondary structures+ Bull Math Biol. 1997;59:339–397. doi: 10.1007/BF02462007. - DOI - PubMed