Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 12:27:1081-1088.
doi: 10.1016/j.csbj.2025.03.010. eCollection 2025.

VirusWarn: A mutation-based early warning system to prioritize concerning SARS-CoV-2 and influenza virus variants from sequencing data

Affiliations

VirusWarn: A mutation-based early warning system to prioritize concerning SARS-CoV-2 and influenza virus variants from sequencing data

Christina Kirschbaum et al. Comput Struct Biotechnol J. .

Abstract

The rapid evolution of respiratory viruses is characterized by the emergence of variants with concerning phenotypes that are efficient in antibody escape or show high transmissibility. This necessitates timely identification of such variants by surveillance networks to assist public health interventions. Here, we introduce VirusWarn, a comprehensive system designed for detecting, prioritizing, and warning of emerging virus variants from large genomic datasets. VirusWarn uses both manually-curated rules and machine-learning (ML) classifiers to generate and rank pathogen sequences based on mutations of concern and regions of interest. Validation results for SARS-CoV-2 showed that VirusWarn successfully identifies variants of concern in both assessments, with manual- and ML-derived criteria from positive selection analyses. Although initially developed for SARS-CoV-2, VirusWarn was adapted to Influenza viruses and their dynamics, and provides a robust performance, integrating a scheme that accounts for fixed mutations from past seasons. HTML reports provide detailed results with searchable tables and visualizations, including mutation plots and heatmaps. Because VirusWarn is written in Nextflow, it can be easily adapted to other pathogens, demonstrating its flexibility and scalability for genomic surveillance efforts.

Keywords: Genomic surveillance; Influenza virus; SARS-CoV-2; Variant prioritization; Warning system.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

None
Graphical abstract
Fig. 1
Fig. 1
(a) VirusWarn steps: assembled genome sequences and metadata (sampling dates, geographical locations) are used as query samples, which are pre-processed to infer mutation profiles for the monitored proteins. This is achieved either by aligning the sequences to a reference genome (e.g., in covSonar [27]) or via direct database queries (e.g., in Nextclade [28]). The mutation profiles are then ranked according to their concern level and results for query samples are summarized in interactive HTML reports using Jinja2 or RMarkdown . (b) Frequency heatmaps show mutation frequencies over time (calendar weeks on the x axis). A histogram showing the number of sequences per calendar week is shown at the bottom.
Fig. 2
Fig. 2
Steps during annotation and ranking by VirusWarn. (a) Selected proteins to monitor (blue) for SARS-CoV-2 and Influenza. (b) Example annotations and genomic variation for a monitored protein. Annotations used for scoring: Mutation of Concern (MOC, yellow), Region of Interest (ROI, green), and Private Mutation (PM, blue). Genomic variation types (pink): Substitution, Deletion, or Insertion. (c) Example mutation profile: counts of variation types in the different annotation groups (MOC, ROI, PM). (d) Ranking to alert color levels: high impact (pink/red), medium impact (orange), low impact (yellow), and no alert (grey).
Fig. 3
Fig. 3
SARS-CoV-2 partial spike protein (the first 650 amino acids, incl. receptor-binding sites) The MOC, ROI, and PM color annotations appear as in Fig. 2), and are contrasted to sites under positive selection (PS, blue bars on top) and lineage-defining mutations for selected VOCs from the ECDC table (red bars).

Similar articles

References

    1. Gardy J.L., Loman N.J. Towards a genomics-informed, real-time, global pathogen surveillance system. Nat Rev Genet. 2018;19(1):9–20. doi: 10.1038/nrg.2017.88. - DOI - PMC - PubMed
    1. Wouters O.J., Shadlen K.C., Salcher-Konrad M., Pollard A.J., Larson H.J., Teerawattananon Y., et al. Challenges in ensuring global access to COVID-19 vaccines: production, affordability, allocation, and deployment. Lancet. 2021;397(10278):1023–1034. doi: 10.1016/S0140-6736(21)00306-8. - DOI - PMC - PubMed
    1. https://data.undp.org/insights/vaccine-equity Global dashboard for vaccine equity |. Data Futures Exchange.
    1. World Health Organization. Global genomic surveillance strategy for pathogens with pandemic and epidemic potential, 2022–2032. World Health Assembly, Geneva. 2022. - PMC - PubMed
    1. Subissi L., von Gottberg A., Thukral L., Worp N., Oude Munnink B.B., Rathore S., et al. An early warning system for emerging SARS-CoV-2 variants. Nat Med. 2022;28(6):1110–1115. doi: 10.1038/s41591-022-01836-w. - DOI - PMC - PubMed

LinkOut - more resources