Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 4;121(23):e2314518121.
doi: 10.1073/pnas.2314518121. Epub 2024 May 31.

Biophysical principles predict fitness of SARS-CoV-2 variants

Affiliations

Biophysical principles predict fitness of SARS-CoV-2 variants

Dianzhuo Wang et al. Proc Natl Acad Sci U S A. .

Abstract

SARS-CoV-2 employs its spike protein's receptor binding domain (RBD) to enter host cells. The RBD is constantly subjected to immune responses, while requiring efficient binding to host cell receptors for successful infection. However, our understanding of how RBD's biophysical properties contribute to SARS-CoV-2's epidemiological fitness remains largely incomplete. Through a comprehensive approach, comprising large-scale sequence analysis of SARS-CoV-2 variants and the identification of a fitness function based on binding thermodynamics, we unravel the relationship between the biophysical properties of RBD variants and their contribution to viral fitness. We developed a biophysical model that uses statistical mechanics to map the molecular phenotype space, characterized by dissociation constants of RBD to ACE2, LY-CoV016, LY-CoV555, REGN10987, and S309, onto an epistatic fitness landscape. We validate our findings through experimentally measured and machine learning (ML) estimated binding affinities, coupled with infectivity data derived from population-level sequencing. Our analysis reveals that this model effectively predicts the fitness of novel RBD variants and can account for the epistatic interactions among mutations, including explaining the later reversal of Q493R. Our study sheds light on the impact of specific mutations on viral fitness and delivers a tool for predicting the future epidemiological trajectory of previously unseen or emerging low-frequency variants. These insights offer not only greater understanding of viral evolution but also potentially aid in guiding public health decisions in the battle against COVID-19 and future pandemics.

Keywords: SARS-CoV-2; antibody; fitness landscape; receptor binding domain; viral evolution.

PubMed Disclaimer

Conflict of interest statement

Competing interests statement:The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Model illustration. Link between fitness and thermodynamics of protein folding and binding. High fitness variants may exhibit improved stability in the folded state or in the ACE2-bound state to facilitate cellular entry or have the capacity to destabilize bound-to-antibody states, thereby enabling evasion.
Fig. 2.
Fig. 2.
Biophysical model analysis. (A) Fit of the predictive model on the training set. Each dot represents a variant in the training dataset, plotted against the fitness derived from population data on the x-axis and the predicted fitness by the model on the y-axis (R2=0.97). (B) Model’s performance on the testing set. The model’s predictions align well with the fitness observed in population, as reflected by R2=0.94 suggesting that the model maintains a strong predictive power on unseen data. (CG) The dependence of fitness function on the logarithm of each dissociation constant. Existing variants are depicted as yellow regions on the curve, and a histogram showing the distribution of the dissociation constants is provided beneath each plot. Red and blue dashed lines represent wt and BA.1 respectively.
Fig. 3.
Fig. 3.
Assessing predictive power of the model using experimental KD (AC) and computed KD (DF). (A) Fitness prediction for variants carrying the G446S mutation which was not included in the training set yields R2=0.92. (B) R2 derived from a model trained on variants excluding a specific mutation, then used to predict fitness of variants exhibiting that mutation. (C) Predictions of fitness compared with actual fitness trend for variants between Wuhan-Hu-1 and Omicron BA.1. Variants observed before May 2021 are used as training set for the model. The model uses experimental KD. (D) Predicted fitness from biophysical model against actual fitness derived from population data. The model uses KD acquired from ML and is fit on Wuhan-Omicron set. Selected variants are highlighted. (E) Predicted fitness compared with actual fitness for variants observed in 2020, 2021, and 2022. (F) Predictions of fitness using ML derived KD over three-month rolling windows, compared with the actual fitness trends for variants between 2021 and 2023.
Fig. 4.
Fig. 4.
Epistasis analysis. (A) Predicted fitness values with the nonepistatic model of Obermeyer et al. and with our epistatic biophysical model plotted against the genome’s mutation count, for all mutation combinations with mutation T478K. We define “Max Fitness” as the maximum fitness prediction from our biophysical model. Max Fitness curve begins to plateau with a higher mutation count, demonstrating a diminishing returns effect in epistatic. (B) Pairwise (second order) interaction coefficients against the spatial distances between the corresponding residues, with mutations colored in accordance with the absolute value of their pairwise coefficient. (C) Coefficients of epistasis: Diagonal coefficients denote first-order interactions, whereas off-diagonal coefficients represent second-order interactions. Coefficients smaller than 0.01 have been masked for clarity.

Update of

Comment in

  • The biophysical landscape of viral evolution.
    Wilke CO. Wilke CO. Proc Natl Acad Sci U S A. 2024 Jul 2;121(27):e2409667121. doi: 10.1073/pnas.2409667121. Epub 2024 Jun 24. Proc Natl Acad Sci U S A. 2024. PMID: 38913906 Free PMC article. No abstract available.

References

    1. Deng X., et al. , Transmission, infectivity, and neutralization of a spike L452R SARS-CoV-2 variant. Cell 184, 3426–3437.e8 (2021). - PMC - PubMed
    1. Leung K., Shum M. H., Leung G. M., Lam T. T., Wu J. T., Early transmissibility assessment of the N501Y mutant strains of SARS-CoV-2 in the United Kingdom, October to November 2020. Eurosurveillance 26, 2002106 (2021). - PMC - PubMed
    1. Cele S., et al. , Escape of SARS-CoV-2 501Y.V2 from neutralization by convalescent plasma. Nature 593, 142–146 (2021). - PMC - PubMed
    1. Upadhyay V., Lucas A., Panja S., Miyauchi R., Mallela K. M., Receptor binding, immune escape, and protein stability direct the natural selection of SARS-CoV-2 variants. J. Biol. Chem. 297, 101208 (2021). - PMC - PubMed
    1. Letko M., Marzi A., Munster V., Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses. Nat. Microbiol. 5, 562–569 (2020). - PMC - PubMed

MeSH terms

Substances

Supplementary concepts