The Easy-to-Use SARS-CoV-2 Assembler for Genome Sequencing: Development Study
- PMID: 35309411
- PMCID: PMC8924907
- DOI: 10.2196/31536
The Easy-to-Use SARS-CoV-2 Assembler for Genome Sequencing: Development Study
Abstract
Background: Early sequencing and quick analysis of the SARS-CoV-2 genome have contributed to the understanding of the dynamics of COVID-19 epidemics and in designing countermeasures at a global level.
Objective: Amplicon-based next-generation sequencing (NGS) methods are widely used to sequence the SARS-CoV-2 genome and to identify novel variants that are emerging in rapid succession as well as harboring multiple deletions and amino acid-changing mutations.
Methods: To facilitate the analysis of NGS sequencing data obtained from amplicon-based sequencing methods, here, we propose an easy-to-use SARS-CoV-2 genome assembler: the Easy-to-use SARS-CoV-2 Assembler (ESCA) pipeline.
Results: Our results have shown that ESCA could perform high-quality genome assembly from Ion Torrent and Illumina raw data and help the user in easily correct low-coverage regions. Moreover, ESCA includes the possibility of comparing assembled genomes of multisample runs through an easy table format.
Conclusions: In conclusion, ESCA automatically furnished a variant table output file, fundamental to rapidly recognizing variants of interest. Our pipeline could be a useful method for obtaining a complete, rapid, and accurate analysis even with minimal knowledge in bioinformatics.
Keywords: COVID-19; NGS data analysis; SARS-CoV-2 genome; algorithms; bioinformatic; bioinformatics tool; digital tools; genome; health informatics.
©Martina Rueca, Emanuela Giombini, Francesco Messina, Barbara Bartolini, Antonino Di Caro, Maria Rosaria Capobianchi, Cesare EM Gruber. Originally published in JMIR Bioinformatics and Biotechnology (https://bioinform.jmir.org), 14.03.2022.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures
References
-
- World Health Organization (WHO) Genomic sequencing of SARS-CoV-2: a guide to implementation for maximum impact on public health. World Health Organization. 2021. Jan 08, [2022-02-22]. https://www.who.int/publications/i/item/9789240018440 .
-
- Greaney AJ, Loes AN, Crawford KH, Starr TN, Malone KD, Chu HY, Bloom JD. Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies. Cell Host Microbe. 2021 Mar 10;29(3):463–476.e6. doi: 10.1016/j.chom.2021.02.003. http://europepmc.org/abstract/MED/33592168 S1931-3128(21)00082-2 - DOI - PMC - PubMed
-
- Smith GJD, Vijaykrishna D, Bahl J, Lycett SJ, Worobey M, Pybus OG, Ma SK, Cheung CL, Raghwani J, Bhatt S, Peiris JSM, Guan Y, Rambaut A. Origins and evolutionary genomics of the 2009 swine-origin H1N1 influenza A epidemic. Nature. 2009 Jun 25;459(7250):1122–5. doi: 10.1038/nature08182.nature08182 - DOI - PubMed
-
- Revez J, Espinosa L, Albiger B, Leitmeyer KC, Struelens MJ, ECDC National Microbiology Focal Points and Experts Group Survey on the use of whole-genome sequencing for infectious diseases surveillance: rapid expansion of European National Capacities, 2015-2016. Front Public Health. 2017;5:347. doi: 10.3389/fpubh.2017.00347. doi: 10.3389/fpubh.2017.00347. - DOI - DOI - PMC - PubMed
-
- Nadon C, Van Walle I, Gerner-Smidt P, Campos J, Chinen I, Concepcion-Acevedo J, Gilpin B, Smith AM, Man Kam K, Perez E, Trees E, Kubota K, Takkinen J, Nielsen EM, Carleton H, FWD-NEXT Expert Panel PulseNet International: Vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance. Euro Surveill. 2017 Jun 08;22(23):1. doi: 10.2807/1560-7917.ES.2017.22.23.30544. http://www.eurosurveillance.org/ViewArticle.aspx?ArticleId=22807 30544 - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous
