Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2022 Feb 8:2021.04.28.441806.
doi: 10.1101/2021.04.28.441806.

Recombination patterns in coronaviruses

Affiliations

Recombination patterns in coronaviruses

Nicola F Müller et al. bioRxiv. .

Update in

Abstract

As shown during the SARS-CoV-2 pandemic, phylogenetic and phylodynamic methods are essential tools to study the spread and evolution of pathogens. One of the central assumptions of these methods is that the shared history of pathogens isolated from different hosts can be described by a branching phylogenetic tree. Recombination breaks this assumption. This makes it problematic to apply phylogenetic methods to study recombining pathogens, including, for example, coronaviruses. Here, we introduce a Markov chain Monte Carlo approach that allows inference of recombination networks from genetic sequence data under a template switching model of recombination. Using this method, we first show that recombination is extremely common in the evolutionary history of SARS-like coronaviruses. We then show how recombination rates across the genome of the human seasonal coronaviruses 229E, OC43 and NL63 vary with rates of adaptation. This suggests that recombination could be beneficial to fitness of human seasonal coronaviruses. Additionally, this work sets the stage for Bayesian phylogenetic tracking of the spread and evolution of SARS-CoV-2 in the future, even as recombinant viruses become prevalent.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest

The authors declare no conflict of interest

Figures

Figure 1:
Figure 1:. Evolutionary history of SARS-like viruses.
A Maximum clade credibility network of SARS-like viruses. Blue dots denote samples and green dots recombination events. B Common ancestor times of Wuhan-Hu1 (SARS-CoV-2) with different SARS-like viruses on different positions of the genome. The y-axis denotes common ancestor times in log scale. C Most recent time anywhere on the genome that Wuhan-Hu1 shared a common ancestor with different SARS-like viruses
Figure 2:
Figure 2:. Recombination networks and rates for coronaviruses MERS, 229E, OC43 and NL63.
Recombination networks for MERS (A) and seasonal human coronaviruses 229E (B), OC43 (C) and NL63 (D). E Recombination rates (per lineage and year) for the different coronaviruses compared to reassortment rates in seasonal human influenza A/H3N2 and influenza B viruses as estimated in Müller et al. (2020). For OC43 and NL63, the parts of the recombination networks that stretch beyond 1950 are not shown to increase readability of more recent parts of the networks.
Figure 3:
Figure 3:. Comparison of recombination rates with rates of adaptation on different parts of the genomes of seasonal human coronaviruses 229E, OC43 and NL63.
Relationship between estimated relative recombination rate (x-axis) and relative adaptation rate (y-axis) for three different seasonal human coronaviruses: 229E, OC43 and NL63. These estimates are shown for different parts of the genome, indicated by the different colors. These result from two different types of analysis: one using spike only (subunit 1 over subunit 2, shown in yellow) and one using the full genome (shown in orange, blue and green). The rate ratios denote the rate on a part of the genome divided by the average rate on the two other parts of the genome.
Figure 4:
Figure 4:. Example recombination network.
Events that can occur on a recombination network as considered here. We consider events to occur from present backwards in time to the past (as is the norm when looking at coalescent processes). Lineages can be added upon sampling events, which occur at predefined points in time and are conditioned on. Recombination events split the path of a lineage in two, with everything on one side of a recombination breakpoint going in one direction and everything on the other side of a breakpoint going in the other direction.

References

    1. Andersen K. G., Rambaut A., Lipkin W. I., Holmes E. C., and Garry R. F. (2020). The proximal origin of SARS-CoV-2. Nat Med, 26, 450–452. - PMC - PubMed
    1. Banner L. R. and Mc Lai M. (1991). Random nature of coronavirus rna recombination in the absence of selection pressure. Virology, 185(1), 441–445. - PMC - PubMed
    1. Barton N. (1995). A general model for the evolution of recombination. Genetics Research, 65(2), 123–144. - PubMed
    1. Bedford T., Greninger A. L., Roychoudhury P., Starita L. M., Famulare M., Huang M.-L., Nalla A., Pepper G., Reinhardt A., Xie H., et al. (2020). Cryptic transmission of sars-cov-2 in washington state. Science, 370(6516), 571–575. - PMC - PubMed
    1. Bhatt S., Katzourakis A., and Pybus O. G. (2010). Detecting natural selection in rna virus populations using sequence summary statistics. Infection, Genetics and Evolution, 10(3), 421–430. - PubMed

Publication types