Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 May:531:141-148.
doi: 10.1016/j.virol.2019.03.006. Epub 2019 Mar 9.

A complete protocol for whole-genome sequencing of virus from clinical samples: Application to coronavirus OC43

Affiliations

A complete protocol for whole-genome sequencing of virus from clinical samples: Application to coronavirus OC43

Florence Maurier et al. Virology. 2019 May.

Abstract

Genome sequencing of virus has become a useful tool for better understanding of virus pathogenicity and epidemiological surveillance. Obtaining virus genome sequence directly from clinical samples is still a challenging task due to the low load of virus genetic material compared to the host DNA, and to the difficulty to get an accurate genome assembly. Here we introduce a complete sequencing and analyzing protocol called V-ASAP for Virus Amplicon Sequencing Assembly Pipeline. Our protocol is able to generate the viral dominant genome sequence starting from clinical samples. It is based on a multiplex PCR amplicon sequencing coupled with a reference-free analytical pipeline. This protocol was applied to 11 clinical samples infected with coronavirus OC43 (HcoV-OC43), and led to seven complete and two nearly complete genome assemblies. The protocol introduced here is shown to be robust, to produce a reliable sequence, and could be applied to other virus.

Keywords: Bioinformatics; Complete genome; Coronavirus; High-throughput sequencing.

PubMed Disclaimer

Figures

fx1
Graphical abstract
Fig. 1
Fig. 1
Schematic representation of the complete protocol for viral sequencing directly from clinical samples.
Fig. 2
Fig. 2
Difference observed between sequences generated from alignment combined with consensus extraction and full-amplicon based extraction.
Fig. 3
Fig. 3
Coronavirus genome amplification trials by RT-PCR for different fragment lengths. A: Capillary electrophoresis profiles of RT-PCR products of the obtained different fragments. B: Agarose gel-like profiles. L: ladder with the sizes alongside (in bp); a to j: RT-PCR product from ~500p to ~5000 bp with an increment-step of ~500 bp.
Fig. 4
Fig. 4
Study of the cluster size distribution for the 99 amplicons (PCR01 to PCR99) from the MDS4 sample. The top panel shows the number of reads for each amplicon (in black) and the number of reads in the biggest cluster (in gray). The bottom panel shows the proportion of each clusters for each amplicon. The proportion of the biggest cluster is in black and the proportion of clusters containing less than 1% of reads were summed up and are in gray.

References

    1. Archer J., Rambaut A., Taillon B.E., Harrigan P.R., Lewis M., Robertson D.L. The evolutionary analysis of emerging low frequency HIV-1 CXCR4 using variants through time--an ultra-deep approach. PLoS Comput. Biol. 2010;6:e1001022. doi: 10.1371/journal.pcbi.1001022. - DOI - PMC - PubMed
    1. Caboche S., Audebert C., Lemoine Y., Hot D. Comparison of mapping algorithms used in high-throughput sequencing: application to Ion Torrent data. BMC Genom. 2014;15:264. - PMC - PubMed
    1. Cotten M., Lam T.T., Watson S.J., Palser A.L., Petrova V., Grant P., Pybus O.G., Rambaut A., Guan Y., Pillay D., Kellam P., Nastouli E. Full-genome deep sequencing and phylogenetic analysis of novel human betacoronavirus. Emerg. Infect. Dis. 2013;19 doi: 10.3201/eid1905.130057. (736–42B) - DOI - PMC - PubMed
    1. Dinwiddie D.L., Hardin O., Denson J.L., Kincaid J.C., Schwalm K.C., Stoner A.N., Abramo T.J., Thompson T.M., Putt C.M., Young S.A., Dehority W.N., Kennedy J.L. Complete genome sequences of four novel human coronavirus OC43 isolates associated with severe acute respiratory infection. Genome Announc. 2018;6 doi: 10.1128/genomeA.00452-18. (e00452-18) - DOI - PMC - PubMed
    1. Fu L., Niu B., Zhu Z., Wu S., Li W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012;28:3150–3152. - PMC - PubMed

Publication types

MeSH terms