Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 25;85(11):107.
doi: 10.1007/s11538-023-01209-5.

Rearrangement Events on Circular Genomes

Affiliations

Rearrangement Events on Circular Genomes

Joshua Stevenson et al. Bull Math Biol. .

Abstract

Early literature on genome rearrangement modelling views the problem of computing evolutionary distances as an inherently combinatorial one. In particular, attention is given to estimating distances using the minimum number of events required to transform one genome into another. In hindsight, this approach is analogous to early methods for inferring phylogenetic trees from DNA sequences such as maximum parsimony-both are motivated by the principle that the true distance minimises evolutionary change, and both are effective if this principle is a true reflection of reality. Recent literature considers genome rearrangement under statistical models, continuing this parallel with DNA-based methods, with the goal of using model-based methods (for example maximum likelihood techniques) to compute distance estimates that incorporate the large number of rearrangement paths that can transform one genome into another. Crucially, this approach requires one to decide upon a set of feasible rearrangement events and, in this paper, we focus on characterising well-motivated models for signed, uni-chromosomal circular genomes, where the number of regions remains fixed. Since rearrangements are often mathematically described using permutations, we isolate the sets of permutations representing rearrangements that are biologically reasonable in this context, for example inversions and transpositions. We provide precise mathematical expressions for these rearrangements, and then describe them in terms of the set of cuts made in the genome when they are applied. We directly compare cuts to breakpoints, and use this concept to count the distinct rearrangement actions which apply a given number of cuts. Finally, we provide some examples of rearrangement models, and include a discussion of some questions that arise when defining plausible models.

Keywords: Combinatorics; Genome rearrangements; Phylogenetics.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
On the left is a drawing of the 5-region genome represented by the identity permutation 1234512345, which we refer to as the reference genome. On the right is a drawing of the genome represented by σ=123451¯435¯2¯. Alternate representations are summarised in Table 1
Fig. 2
Fig. 2
Diagrams representing each of the 2n=2(5)=10 instances of the example genome Dnσ arising from its dihedral symmetries. The bottom left diagram matches the instance σ=1¯435¯2¯ (also shown in Fig. 1). Positions run clockwise starting from position 1 in the top-right of the circle
Fig. 3
Fig. 3
A graph representation of the Markov chain for a model of inversions on circular genomes with 3 oriented regions. Node labels are of the form zσ where z is the symmetry element and σ is one particular instance of the genome (position paradigm, region position) written in one-row notation. Lines between genomes show the probability of a rearrangement which transforms one into the other. The minimum number of steps needed to traverse the graph from one genome to another is the ‘inversion distance’. The edge weights are the same for any choice of probability mapping w, since with only three regions, every inversion of two regions can be written as an inversion of a single region
Fig. 4
Fig. 4
An inversion represented by the permutation α=(33¯)(24¯)(2¯4) (as in Example 4.3), applied to the reference genome (Color figure online)
Fig. 5
Fig. 5
Different ways to reconnect three labelled segments to perform a rearrangement that requires 3 cuts (Color figure online)
Fig. 6
Fig. 6
A diagram representing the identity rearrangement, and three representing rearrangements that require 2 cuts (Color figure online)
Fig. 7
Fig. 7
Diagrams representing fission rearrangements, which we do not consider here (Color figure online)

References

    1. Alexandrino A, Oliveira A, Dias U, Dias Z. On the complexity of some variations of sorting by transpositions. JUCS J Univ Comput Sci. 2020;26(9):1076–1094. doi: 10.3897/jucs.2020.057. - DOI
    1. Alexandrino AO, Oliveira AR, Dias U, Dias Z. Incorporating intergenic regions into reversal and transposition distances with indels. J Bioinform Comput Biol. 2021;19:6. doi: 10.1142/s0219720021400114. - DOI - PubMed
    1. Alexeev N, Aidagulov R, Alekseyev MA. A computational method for the rate estimation of evolutionary transpositions. In: Ortuño F, Rojas I, editors. Bioinform Biomed Eng. Cham: Springer; 2015. pp. 471–480.
    1. Bader David A, Moret Bernard ME, Mi Y. A linear-time algorithm for computing inversion distance between signed permutations with an experimental study. J Comput Biol. 2001;8(5):483–491. doi: 10.1089/106652701753216503. - DOI - PubMed
    1. Bafna V, Pevzner PA. Genome rearrangements and sorting by reversals. SIAM J Comput. 1996;25(2):272–289. doi: 10.1137/s0097539793250627. - DOI

Publication types

LinkOut - more resources