SGV-caller: SARS-CoV-2 genome variation caller
- PMID: 40040978
- PMCID: PMC11876889
- DOI: 10.1016/j.heliyon.2025.e42613
SGV-caller: SARS-CoV-2 genome variation caller
Abstract
Given the pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), continuous analysis of its genomic variations at the nucleotide level is imperative to monitor the emergence of novel variants of concern. The Global Initiative on Sharing All Influenza Data (GISAID) serves as the de facto standard database for the genomic information of SARS-CoV-2. However, limitations of its data-sharing policy hinder the comprehensive analysis of genomic variations. To address this problem, we developed SGV-caller, a bioinformatics pipeline for analyzing the frequently updated GISAID database. SGV-caller compares input datasets with pre-existing databases and generates local databases encompassing nucleotide, amino acid, and codon-level genomic variations for each SARS-CoV-2 genome. Furthermore, SGV-caller accommodates SARS-CoV-2 genomes from non-GISAID sources as well as other viral genomes. SGV-caller source code and test data are available at https://github.com/wujiaqi06/SGV-caller.
Keywords: Bioinformatics pipeline; GISAID; Genome surveillance; Mutations; Viral genomes.
© 2025 The Authors.
Conflict of interest statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures

Similar articles
-
CoV-Seq, a New Tool for SARS-CoV-2 Genome Analysis and Visualization: Development and Usability Study.J Med Internet Res. 2020 Oct 2;22(10):e22299. doi: 10.2196/22299. J Med Internet Res. 2020. PMID: 32931441 Free PMC article.
-
CovidPhy: A tool for phylogeographic analysis of SARS-CoV-2 variation.Environ Res. 2022 Mar;204(Pt A):111909. doi: 10.1016/j.envres.2021.111909. Epub 2021 Aug 20. Environ Res. 2022. PMID: 34419470 Free PMC article.
-
SARS-CoV-2 HaploGraph: visualization of SARS-CoV-2 haplotype spread in Japan.Genes Genet Syst. 2023 Nov 21;98(5):221-237. doi: 10.1266/ggs.23-00085. Epub 2023 Oct 14. Genes Genet Syst. 2023. PMID: 37839865
-
Efficient compression of SARS-CoV-2 genome data using Nucleotide Archival Format.Patterns (N Y). 2022 Sep 9;3(9):100562. doi: 10.1016/j.patter.2022.100562. Epub 2022 Jul 7. Patterns (N Y). 2022. PMID: 35818472 Free PMC article. Review.
-
Global SARS-CoV-2 genomic surveillance: What we have learned (so far).Infect Genet Evol. 2023 Mar;108:105405. doi: 10.1016/j.meegid.2023.105405. Epub 2023 Jan 18. Infect Genet Evol. 2023. PMID: 36681102 Free PMC article. Review.
References
-
- Gangavarapu K., Latif A.A., Mullen J.L., Alkuzweny M., Hufbauer E., Tsueng G., et al. Outbreak.info genomic reports: scalable and dynamic surveillance of SARS-CoV-2 variants and mutations. Nat. Methods. 2023;20(4):512–522. doi: 10.1038/s41592-023-01769-3. Epub 2023 Feb 23. - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous