Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 30;119(35):e2206610119.
doi: 10.1073/pnas.2206610119. Epub 2022 Aug 10.

Distinct evolutionary trajectories of SARS-CoV-2-interacting proteins in bats and primates identify important host determinants of COVID-19

Affiliations

Distinct evolutionary trajectories of SARS-CoV-2-interacting proteins in bats and primates identify important host determinants of COVID-19

Marie Cariou et al. Proc Natl Acad Sci U S A. .

Abstract

The coronavirus disease 19 (COVID-19) pandemic is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), a coronavirus that spilled over from the bat reservoir. Despite numerous clinical trials and vaccines, the burden remains immense, and the host determinants of SARS-CoV-2 susceptibility and COVID-19 severity remain largely unknown. Signatures of positive selection detected by comparative functional genetic analyses in primate and bat genomes can uncover important and specific adaptations that occurred at virus-host interfaces. We performed high-throughput evolutionary analyses of 334 SARS-CoV-2-interacting proteins to identify SARS-CoV adaptive loci and uncover functional differences between modern humans, primates, and bats. Using DGINN (Detection of Genetic INNovation), we identified 38 bat and 81 primate proteins with marks of positive selection. Seventeen genes, including the ACE2 receptor, present adaptive marks in both mammalian orders, suggesting common virus-host interfaces and past epidemics of coronaviruses shaping their genomes. Yet, 84 genes presented distinct adaptations in bats and primates. Notably, residues involved in ubiquitination and phosphorylation of the inflammatory RIPK1 have rapidly evolved in bats but not primates, suggesting different inflammation regulation versus humans. Furthermore, we discovered residues with typical virus-host arms race marks in primates, such as in the entry factor TMPRSS2 or the autophagy adaptor FYCO1, pointing to host-specific in vivo interfaces that may be drug targets. Finally, we found that FYCO1 sites under adaptation in primates are those associated with severe COVID-19, supporting their importance in pathogenesis and replication. Overall, we identified adaptations involved in SARS-CoV-2 infection in bats and primates, enlightening modern genetic determinants of virus susceptibility and severity.

Keywords: SARS-CoV-2 and COVID-19; comparative genetics; positive selection; primates and bats; virus–host coevolution.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Identification of the SARS-CoV-2 interactome with signatures of positive selection (PS) in bats and primates. (A) Overview of the DGINN pipeline to detect adaptive evolution in SARS-CoV-2 VIPs. CDS, coding DNA sequence; ORF, open reading frame. (B) Natural selection acting on bat and primate VIP genes. Comparison of omega (dN/dS) values of the VIPs during bat (y axis) and primate (x axis) evolution, estimated by Bio++ Model M0. In black, the bisector. In red, the linear regression. The names correspond to genes that we comprehensively analyzed (Table 1). (C) Overview of the number of VIPs under significant PS (i.e., by at least three methods in the DGINN screen) in bats and/or primates. A total of 324 genes could be fully analyzed in the two mammalian orders. Numbers represent the number of genes in the categories: No PS or PS, within each host, is represented by a pictogram. The numbers correspond to the conservative values after visual inspection of the positively selected VIP alignments, while the italic numbers are from the automated screen. (D) Table showing the genes identified by x,y DGINN methods in bats and primates, respectively. For the genes with low DGINN scores (<3), only the number of genes in each category is shown (SI Appendix, Fig. S4 for details). Of note, seven primate genes are false positive, as follows: EMC1 (ER membrane protein complex subunit 1), MOV10 (Mov10 RISC complex RNA helicase), POR (cytochrome p450 oxidoreductase), PITRM1 (pitrilysin metallopeptidase 1), RAB14, RAB2A, and TIMM8B (translocase of inner mitochondrial membrane 8 homolog B).
Fig. 2.
Fig. 2.
SARS-CoV-2 VIPs under PS are interacting proteins of other coronaviruses, as well as other viral families. Virus–host protein–protein interaction network of VIP genes under PS and interconnected with (A) other coronaviruses (from alpha- or beta-coronavirus genus), and (B) viral families other than coronaviruses. VIPs interacting with more than one additional viral family are in the Center and arranged in columns (from Left to Right, interconnected with 2 to 6 different viral families). Node sizes at the virus families are proportional to the number of edges. The VIPs not interconnected are shown in SI Appendix, Table S1.
Fig. 3.
Fig. 3.
TMPRSS2 has evolved under strong PS in primates but not in bats. (A) Role of TMPRSS2 in SARS-CoV-2 entry. (B) Diagram of TMPRSS2 predicted domains, with sites under PS in primates represented by triangles (Table 1). Codon numbering and amino acid residue based on Homo sapiens TMPRSS2. (C) 3D structure modeling of human TMPRSS2 (amino acids 1 to 492) with the positively selected sites (red), the SARS-CoV-2 predicted interface (light blue), and the catalytic site (dark blue). (D) The positively selected sites identified in primate TMPRSS2 are highly variable in primates (Top) but more conserved in bats (Bottom) where they are not identified as under adaptive evolution. Left, cladograms of primate and bat TMPRSS2 with species abbreviation and accession number of sequences. Amino acid color-coding, RasMol properties (Geneious, Biomatters). Icon legend is embedded in the figure, with multicolored pictograms/triangles showing cases fulfilling multiple conditions. (E) Positively selected sites in primates exhibit different patterns of variability in other mammals, as follows: pangolin, carnivores, artiodactyls, and rodents. Right, numbers in brackets correspond to the number of species within the order with the same TMPRSS2 haplotype at these positions (e.g., the QSSKL motif in Mustela putoris was found in 14 rodent species). The corresponding motif in species/cells susceptible or permissive to coronaviruses is shown in SI Appendix, Fig. S8.
Fig. 4.
Fig. 4.
Domains of FYCO1 that are associated with severe COVID-19 in human have also evolved under significant PS in primates but not in bats. (A) Known cellular role of FYCO1. (B) Diagram of FYCO1 predicted domains, with sites under PS in primates represented by triangles (Table 1). Codon numbering and amino acid residue based on Homo sapiens FYCO1. (C) Amino acid variation at the positively selected sites in primates. Left, cladogram of primate FYCO1 with major clades highlighted. The exact species and accession number of sequences are shown in E. Amino acid color-coding, RasMol properties (Geneious, Biomatters). (D) Sites identified in the coding sequence of FYCO1 as under PS in primates (Top) and as associated with severe COVID-19 in human from two GWAS studies (Middle: GWAS1, COVID-19 Host Genetics Initiative, 2021; Bottom: GWAS2, Pairo-Castineira et al., 2020). x axis, nucleotide numbering. (E) Amino acid variations in primate species at the sites associated with severe COVID-19 in GWAS.
Fig. 5.
Fig. 5.
The multifunctional and inflammatory RIPK1 protein exhibits strong evidence of adaptation in bats at key regulatory residues. (A) Schematic diagram of the three main functions associated to human RIPK1 in TNF signaling. As part of the TNFR1-associated complex, RIPK1 induces prosurvival signals that notably lead to NFkB activation. When dissociating from this complex, as a result of multiple events involving both phosphorylation and ubiquitination, RIPK1 can associate to FADD and lead to apoptosis or necrosis. (B) Diagram of RIPK1 domains with the residues under PS in bats (black triangles) with the corresponding position and amino acid residue in human RIPK1 (Table 1). (C) 3D structure prediction of bat (Rhinolophus ferrumequinum) RIPK1, using RaptorX. The protein domains are color coded as in B. Residues under PS are in red and numbered is according to their position in bat RIPK1. (D) The positively selected sites identified in bat RIPK1 are highly variable in bats (Top), but more conserved in primates (Bottom), where they are not identified as under adaptive evolution. Left, bat and primate RIPK1 with species abbreviation and accession number of sequences. Amino acid color coding, polarity properties (Geneious, Biomatters). The correspondence of residues from Rhinolophus ferrumequinum bat RIPK1 (gray) to human numbering (black) is shown at the Top. Detailed representation is shown in SI Appendix, Fig. S10.

Similar articles

Cited by

References

    1. Temmam S., et al. , Bat coronaviruses related to SARS-CoV-2 and infectious for human cells. Nature 604, 330–336 (2022). - PubMed
    1. Wang L.-F., Anderson D. E., Viruses in bats and potential spillover to animals and humans. Curr. Opin. Virol. 34, 79–89 (2019). - PMC - PubMed
    1. Christie M. J., et al. , Of bats and men: Immunomodulatory treatment options for COVID-19 guided by the immunopathology of SARS-CoV-2 infection. Sci. Immunol. 6, eabd0205 (2021). - PubMed
    1. Gordon D. E., et al. , A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature 583, 459–468 (2020). - PMC - PubMed
    1. Parkinson N., et al. , Dynamic data-driven meta-analysis for prioritisation of host genes implicated in COVID-19. Sci. Rep. 10, 22303 (2020). - PMC - PubMed

Publication types

Substances