Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 4;21(1):863.
doi: 10.1186/s12864-020-07283-6.

A rapid, cost-effective tailed amplicon method for sequencing SARS-CoV-2

Affiliations

A rapid, cost-effective tailed amplicon method for sequencing SARS-CoV-2

Daryl M Gohl et al. BMC Genomics. .

Abstract

Background: The global COVID-19 pandemic has led to an urgent need for scalable methods for clinical diagnostics and viral tracking. Next generation sequencing technologies have enabled large-scale genomic surveillance of SARS-CoV-2 as thousands of isolates are being sequenced around the world and deposited in public data repositories. A number of methods using both short- and long-read technologies are currently being applied for SARS-CoV-2 sequencing, including amplicon approaches, metagenomic methods, and sequence capture or enrichment methods. Given the small genome size, the ability to sequence SARS-CoV-2 at scale is limited by the cost and labor associated with making sequencing libraries.

Results: Here we describe a low-cost, streamlined, all amplicon-based method for sequencing SARS-CoV-2, which bypasses costly and time-consuming library preparation steps. We benchmark this tailed amplicon method against both the ARTIC amplicon protocol and sequence capture approaches and show that an optimized tailed amplicon approach achieves comparable amplicon balance, coverage metrics, and variant calls to the ARTIC v3 approach.

Conclusions: The tailed amplicon method we describe represents a cost-effective and highly scalable method for SARS-CoV-2 sequencing.

Keywords: COVID-19; Genome sequencing; SARS-CoV-2; Viral surveillance.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Methods for SARS-CoV-2 genome sequencing compared in this study. a In Illumina’s Nextera DNA Flex Enrichment protocol cDNA is tagmented and made into barcoded sequencing libraries, which are then enriched using sequence capture with a respiratory virus panel containing probes against SARS-CoV-2. b In the ARTIC protocol, first strand cDNA is enriched by amplifying with two pools of primers to generate amplicons tiling the SARS-CoV-2 genome. These amplicons are then subjected to either Illumina or Oxford Nanopore library preparation, using methods that either directly add adapters to the ends of the amplicons or to fragment them to enable sequencing on a wider variety of Illumina instruments. c The tailed amplicon approach, developed here, enriches first strand cDNA using ARTIC v3 primers containing adapter tails. This allows functional sequencing libraries to be created through a second indexing PCR reaction that adds sample-specific barcodes and flow cell adapters
Fig. 2
Fig. 2
Comparison of sequence capture, ARTIC v3 amplicon, and tailed amplicon workflows on SARS-CoV-2 isolate. a Percentage of the BEI WA1 isolate genome coverage at 10x at different subsampled read depths when sequenced with the indicated approach. b Percent of the BEI WA1 isolate genome coverage at 100x at different subsampled read depths when sequenced with the indicated approach. c Observed read depth for each of the expected amplicons for the BEI WA1 isolate amplified with the ARTIC v3 protocol at a subsampled read depth of 100,000 raw reads. d Observed read depth for each of the expected amplicons for the BEI WA1 isolate amplified with the tailed amplicon v1 (2 pool amplification) protocol at a subsampled read depth of 100,000 raw reads. e Observed read depth for each of the expected amplicons for the BEI WA1 isolate amplified with the tailed amplicon v2 protocol (4 pool amplification) at a subsampled read depth of 100,000 raw reads. f Variants detected for the BEI WA1 isolate at a read depth of up to 1,000,000 raw reads (or the maximum read depth for the sample if below 1,000,000 reads). The positions of all variants detected in this study are shown and bases where the sample matches the Wuhan-Hu-1 reference are shown in grey
Fig. 3
Fig. 3
Comparison of sequence capture, ARTIC v3 amplicon, and tailed amplicon workflows on clinical specimens spanning a range of viral loads. a Samples with N1 and N2 Ct values ranging from approximately 20–35 chosen for testing of SARS-CoV-2 sequencing workflows. Samples are colored as in panels c-f. b Evenness of representation of amplicons for different workflows as a function of sample N1 Ct value. Percentage of genome coverage at 100x at different subsampled read depths for each sample when sequenced using the following approaches: c Illumina Nextera DNA Enrichment; d ARTIC v3 with TruSeq library preparation. e Tailed amplicon v1 (2 pool amplification); f Tailed amplicon v2 (4 pool amplification)
Fig. 4
Fig. 4
Variants detected using different sequencing workflows. Variants detected for the indicated sample and sequencing protocol at a read depth of up to 1,000,000 raw reads (or the maximum read depth for the sample if below 1,000,000 reads). The positions of all variants detected in this study are shown and bases where the sample matches the Wuhan-Hu-1 reference shown in grey

References

    1. Bedford T, Riley S, Barr IG, Broor S, Chadha M, Cox NJ, et al. Global circulation patterns of seasonal influenza viruses vary with antigenic drift. Nature. 2015;523:217–220. doi: 10.1038/nature14460. - DOI - PMC - PubMed
    1. Quick J, Grubaugh ND, Pullan ST, Claro IM, Smith AD, Gangavarapu K, et al. Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples. Nat Protoc. 2017;12:1261–1266. doi: 10.1038/nprot.2017.066. - DOI - PMC - PubMed
    1. Grubaugh ND, Gangavarapu K, Quick J, Matteson NL, De Jesus JG, Main BJ, et al. An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar. Genome Biol. 2019;20:8. doi: 10.1186/s13059-018-1618-7. - DOI - PMC - PubMed
    1. Di Paola N, Sanchez-Lockhart M, Zeng X, Kuhn JH, Palacios G. Viral genomics in Ebola virus research. Nat Rev Microbiol. 2020:1–14. 10.1038/s41579-020-0354-7. - PMC - PubMed
    1. Wu F, Zhao S, Yu B, Chen YM, Wang W, Song ZG, et al. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579:265–269. doi: 10.1038/s41586-020-2008-3. - DOI - PMC - PubMed