Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Aug 20;10(1):14031.
doi: 10.1038/s41598-020-70827-z.

Evolutionary and structural analyses of SARS-CoV-2 D614G spike protein mutation now documented worldwide

Affiliations

Evolutionary and structural analyses of SARS-CoV-2 D614G spike protein mutation now documented worldwide

Sandra Isabel et al. Sci Rep. .

Abstract

The COVID-19 pandemic, caused by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), was declared on March 11, 2020 by the World Health Organization. As of the 31st of May, 2020, there have been more than 6 million COVID-19 cases diagnosed worldwide and over 370,000 deaths, according to Johns Hopkins. Thousands of SARS-CoV-2 strains have been sequenced to date, providing a valuable opportunity to investigate the evolution of the virus on a global scale. We performed a phylogenetic analysis of over 1,225 SARS-CoV-2 genomes spanning from late December 2019 to mid-March 2020. We identified a missense mutation, D614G, in the spike protein of SARS-CoV-2, which has emerged as a predominant clade in Europe (954 of 1,449 (66%) sequences) and is spreading worldwide (1,237 of 2,795 (44%) sequences). Molecular dating analysis estimated the emergence of this clade around mid-to-late January (10-25 January) 2020. We also applied structural bioinformatics to assess the potential impact of D614G on the virulence and epidemiology of SARS-CoV-2. In silico analyses on the spike protein structure suggests that the mutation is most likely neutral to protein function as it relates to its interaction with the human ACE2 receptor. The lack of clinical metadata available prevented our investigation of association between viral clade and disease severity phenotype. Future work that can leverage clinical outcome data with both viral and human genomic diversity is needed to monitor the pandemic.

PubMed Disclaimer

Conflict of interest statement

Dr. Gubbay has received research Grants from GlaxoSmithKline Inc. and Hoffman-La Roche Ltd to study antiviral resistance in influenza, and from Pfizer Inc. to conduct microbiological surveillance of Streptococcus pneumoniae. These activities are not relevant to this study. Dr. Poutanen has received honoraria from Merck related to advisory boards and talks, honoraria from Verity, Cipher, and Paladin Labs related to advisory boards, partial conference travel reimbursement from Copan, and research support from Accelerate Diagnostics and bioMerieux, all outside the submitted work. All other authors declared that they have no competing interests.

Figures

Figure 1
Figure 1
Global Distribution of SARS-CoV-2 Genome Sequences Possessing the Spike Protein D614G Mutation. G mutation as percentage of total sequences (% G) is represented with color shades as detailed in legend including data available as of (A) 17 March and (B) 30 March 2020. Hatched lines were added when less than 10 sequences were available for one country. The maps were built with the geographic information system QGIS (v2.18.21, https://qgis.org).
Figure 2
Figure 2
Estimated Molecular Dating of Evolutionary History of 442 Representative Global SARS-CoV-2 Sequences (Late-December 2019–Mid-March 2020) and the Emergence of the D614G Clade. Maximum clade credibility (MCC) tree with dated branches estimated by Bayesian Evolutionary Analysis Sampling Trees (BEAST). Node colors indicate continents of isolation; x-axis indicating dates by year and days in decimal notation; D614G clade sequences are highlighted in a yellow box.
Figure 3
Figure 3
Structural analysis of SARS-CoV-2 spike protein around position 614. (A) Location and distribution of SARS-CoV-2 viral proteins. The full trimeric form of the spike protein results from a complex of three identical spike monomers (right panel). (B) Three-dimensional depiction of a spike protein monomer. The receptor-binding domain is colored purple and the location of the aspartate residue in position 614 is highlighted in green. (C) Inter-atomic contacts between aspartate 614 (green) in a reference spike monomer (blue) and four residues (pink) in its adjacent spike protein monomer chain (white). These four contacts are destabilizing and create a hydrophilic-hydrophobic repelling effect that is lost upon replacement of aspartate by glycine in the D614G mutation (see Table 1). (D) Spatial distribution of aspartate 614 residue (green) and an adjacent glycosylated asparagine residue in position 616 (orange). The two residues point in opposite directions and thus it is unlikely they share a meaningful interaction. The image (A) was drawn using Affinity Designer (v1.8) (https://affinity.serif.com/en-gb/designer/). The trimeric and monomeric structures of the Spike protein were generated using Illustrate, (https://ccsb.scripps.edu/illustrate/) by rendering a protein structure from the Protein Data Bank with ID 6vsb (https://www.rcsb.org/structure/6vsb). The image (BD) was generated using UCSF Chimera (v1.14) (https://www.cgl.ucsf.edu/chimera/) with monomeric protein structure rendered in Chimera .
Figure 4
Figure 4
SARS-CoV-2 PCR Cycle threshold (Ct) values of different clinical samples plotted according to variant D (black dots) and G (white squares) at the position 614 in the spike protein. Dots represent individual Ct values; horizontal lines represent the mean and standard deviation.

References

    1. Zhu N, et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 2020;382:727–733. doi: 10.1056/NEJMoa2001017. - DOI - PMC - PubMed
    1. Phan T. Genetic diversity and evolution of SARS-CoV-2. Infect. Genet. Evol. 2020;81:104260. doi: 10.1016/j.meegid.2020.104260. - DOI - PMC - PubMed
    1. Zhang Y-Z, Holmes EC. A genomic perspective on the origin and emergence of SARS-CoV-2. Cell. 2020;1:1–5. doi: 10.1016/j.cell.2020.03.035. - DOI - PMC - PubMed
    1. Andersen, K. G., Rambaut, A., Lipkin, W. I., Holmes, E. C. & Garry, R. F. The proximal origin of SARS-CoV-2. Nat. Med. 2–4 (2020) doi:10.1038/s41591-020-0820-9. - PMC - PubMed
    1. Benvenuto D, et al. The global spread of 2019-nCoV: a molecular evolutionary analysis. Pathog. Glob. Health. 2020;114:64–67. doi: 10.1080/20477724.2020.1725339. - DOI - PMC - PubMed

Publication types

MeSH terms

Substances