Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb:312:114648.
doi: 10.1016/j.jviromet.2022.114648. Epub 2022 Nov 8.

SpikeSeq: A rapid, cost efficient and simple method to identify SARS-CoV-2 variants of concern by Sanger sequencing part of the spike protein gene

Affiliations

SpikeSeq: A rapid, cost efficient and simple method to identify SARS-CoV-2 variants of concern by Sanger sequencing part of the spike protein gene

Tue Sparholt Jørgensen et al. J Virol Methods. 2023 Feb.

Abstract

In 2020, the novel coronavirus, SARS-CoV-2, caused a pandemic, which is still raging at the time of writing this. Here, we present results from SpikeSeq, the first published Sanger sequencing-based method for the detection of Variants of Concern (VOC) and key mutations, using a 1 kb amplicon from the recognized ARTIC Network primers. The proposed setup relies entirely on materials and methods already in use in diagnostic RT-qPCR labs and on existing commercial infrastructure offering sequencing services. For data analysis, we provide an automated, open source, and browser-based mutation calling software (https://github.com/kblin/covid-spike-classification, https://ssi.biolib.com/covid-spike-classification). We validated the setup on 195 SARS-CoV-2 positive samples, and we were able to profile 85% of RT-qPCR positive samples, where the last 15% largely stemmed from samples with low viral count. We compared the SpikeSeq results to WGS results. SpikeSeq has been used as the primary variant identification tool on > 10.000 SARS-CoV-2 positive clinical samples during 2021. At approximately 4€ per sample in material cost, minimal hands-on time, little data handling, and a short turnaround time, the setup is simple enough to be implemented in any SARS-CoV-2 RT-qPCR diagnostic lab. Our protocol provides results that can be used to choose antibodies in a clinical setting and for the tracking and surveillance of all positive samples for new variants and known ones such as Alpha (B.1.1.7), Beta (B.1.351), Gamma (P.1) Delta (B.1.617.2), Omicron BA.1(B.1.1.529), BA.2, BA.4/5, BA.2.75.x, and many more, as of October 2022.

Keywords: BA.1; BA.2; BA.2.75.x; BA.4/5; COVID-19; CoV; Contact tracing; Coronavirus; Mutations; Omicron; Profiling; SARS-CoV-2; Sequencing; Spike protein; Surveillance; Variants of concern.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Profiling of selected SARS-CoV-2 variants based on key mutations from N440K to T716I in a single amplicon in the spike protein. Even without the near universal D614G mutation, all current and previous VOC variants have a unique mutation profile in this amplicon window, except for BA.4/5, which are commonly reported together. While not shown, the only mutation in the primer sequences in any VOC is on the likely inconsequential K417.
Fig. 2
Fig. 2
Workflow of SpikeSeq typing of SARS-CoV-2 variant mutations. Time estimates are based on a single 96 well PCR plate and no automation. The total hands-on time is one hour and 30 min, and only requires the set up of an RT-PCR from already extracted RNA from a diagnostics lab followed by transfer of 1.5 μl of the product to a new 96 well PCR plate. The plate is subsequently sent for Sanger sequencing at a commercial provider. The total cost per sample including plastware, enzymes, transport, and sequencing is approx. 4 €. After the raw data is received, data analysis takes less than five minutes for 96 samples. If Sanger sequencing is available in-house, the time to result could be less than six hours.
Fig. 3
Fig. 3
The sensitivity of the SpikeSeq method versus the Ct values of the diagnostic RT-qPCR test (n = 195). The number above each bar refers to the number of samples in that bin. Overall, Sanger sequencing data from 85% of samples were of sufficient quality for mapping. The non-template control is not depicted, as it did not produce a Sanger read that mapped to the reference genome. * : Ct values stem from a 2-step PCR where the first 7 cycles are not fluorescence registered, possibly meaning that the Sanger sequencing assay is even more sensitive than suggested by this figure.

References

    1. Andersen, C.Ø. et al. A major outbreak of COVID-19 at a residential care home. Dan. Med. J. 68(10):A03210227, 10 (2021). - PubMed
    1. Baker D.J., et al. CoronaHiT: high-throughput sequencing of SARS-CoV-2 genomes. Genome Med. 2021;13:21. - PMC - PubMed
    1. Bezerra M.F., et al. A Sanger-based approach for scaling up screening of SARS-CoV-2 variants of interest and concern. Infect. Genet. Evol. 2021;92 - PMC - PubMed
    1. Bushnell B., Rood J., Singer E. BBMerge – accurate paired shotgun read merging via overlap. PLOS ONE. 2017;12 - PMC - PubMed
    1. Daniels R.S., et al. A Sanger sequencing protocol for SARS-CoV-2 S-gene. Influenza Other Respir. Virus. 2021;15:707–710. - PMC - PubMed

Publication types

Substances

Supplementary concepts