Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 17;16(12):e1009272.
doi: 10.1371/journal.pgen.1009272. eCollection 2020 Dec.

Recombination events are concentrated in the spike protein region of Betacoronaviruses

Affiliations

Recombination events are concentrated in the spike protein region of Betacoronaviruses

Louis-Marie Bobay et al. PLoS Genet. .

Abstract

The Betacoronaviruses comprise multiple subgenera whose members have been implicated in human disease. As with SARS, MERS and now SARS-CoV-2, the origin and emergence of new variants are often attributed to events of recombination that alter host tropism or disease severity. In most cases, recombination has been detected by searches for excessively similar genomic regions in divergent strains; however, such analyses are complicated by the high mutation rates of RNA viruses, which can produce sequence similarities in distant strains by convergent mutations. By applying a genome-wide approach that examines the source of individual polymorphisms and that can be tested against null models in which recombination is absent and homoplasies can arise only by convergent mutations, we examine the extent and limits of recombination in Betacoronaviruses. We find that recombination accounts for nearly 40% of the polymorphisms circulating in populations and that gene exchange occurs almost exclusively among strains belonging to the same subgenus. Although experimental studies have shown that recombinational exchanges occur at random along the coronaviral genome, in nature, they are vastly overrepresented in regions controlling viral interaction with host cells.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Detecting recombination within Betacoronavirus subgenera based on ratios of homoplastic (h) to non-homoplastic (m) polymorphisms.
Within each bivariate plot, black dots and the grey-shaded area denote the median and standard deviation of h/m values of the indicated number of subsampled combinations of genomes; and red dots and pink-shaded area denote the median h/m values and standard deviation for simulated data in which all homoplasies are introduced by convergent mutations. Differences between the distributions of observed and simulated h/m values indicate the extent to which polymorphisms are attributable to recombination. Analyses were performed on the three subgenera for which genomes were sampled to sufficient depths to provide robust results.
Fig 2
Fig 2. Extent of recombination evident for each gene in Sarbecovirus genomes.
For each annotated gene (numbered), we calculated h/m ratios based on available sequences (grey) and on simulated datasets (pink). Box-and-whiskers plots of each gene show median (black line), interquartile range IQR (box), and 1.5*IQR (whiskers), and asterisks (and gene numbers highlighted in red) mark instances in which observed and simulated datasets differed significantly (p < 10−5, one-sided Wilcoxon test with Bonferroni correction) due to an excess of polymorphisms introduced by recombination.
Fig 3
Fig 3. Signal of recombination in strains of SARS-Cov-2.
Pattern of gene flow based on h/m ratios was analyzed in SARS-CoV-2 using 218 sequenced strains. As in Fig 1, black dots and the grey-shaded area denote the median and standard deviation of h/m values of the indicated number of subsampled combinations of genomes; and red dots and pink-shaded area denote the median h/m values and standard deviation for simulated data in which all homoplasies are introduced by convergent mutations.

References

    1. Ou X, Liu Y, Lei X, Li P, Mi D, Ren L, Guo L, Guo R, Chen T, Hu J, Xiang Z, Mu Z, Chen X, Chen J, Hu K, Jin Q, Wang J, Qian Z. Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV. Nat Commun. 2020; 11(1):1620 10.1038/s41467-020-15562-9 - DOI - PMC - PubMed
    1. Wu F, Zhao S, Yu B, Chen Y-M, Wang W, Song Z-G, Hu Y, Tao Z-W, Tian J-H, Pei Y-Y, Yuan M-L, Zhang Y-L, Dai F-H, Liu Y, Wang Q-M, Zheng J-J, Xu L, Holmes EC, Zhang Y-Z. A new coronavirus associated with human respiratory disease in China. Nature. 2020; 579(7798):265–269. 10.1038/s41586-020-2008-3 - DOI - PMC - PubMed
    1. Zhang Y-Z, Holmes EC. A genomic perspective on the origin and emergence of SARS-CoV-2. Cell. 2020; 181(2):223–227. 10.1016/j.cell.2020.03.035 ; PMCID: PubMed Central PMC7194821. - DOI - PMC - PubMed
    1. Zhou P, Yang X-L, Wang X-G, Hu B, Zhang L, Zhang W, Si H-R, Zhu Y, Li B, Huang C-L, Chen H-D, Chen J, Luo Y, Guo H, Jiang R-D, Liu M-Q, Chen Y, Shen X-R, Wang X, Zheng X-S, Zhao K, Chen Q-J, Deng F, Liu L-L, Yan B, Zhan F-X, Wang Y-Y, Xiao G-F, Shi Z-L. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020; 579(7798):270–273. 10.1038/s41586-020-2012-7 - DOI - PMC - PubMed
    1. Lan J, Ge J, Yu J, Shan S, Zhou H, Fan S, Zhang Q, Shi X, Wang Q, Zhang L, Wang X. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor. Nature. 2020; 581(7807):215–220. 10.1038/s41586-020-2180-5 . - DOI - PubMed

Publication types

MeSH terms

Substances