Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 25;16(1):61.
doi: 10.1186/s13073-024-01334-3.

INSaFLU-TELEVIR: an open web-based bioinformatics suite for viral metagenomic detection and routine genomic surveillance

Collaborators, Affiliations

INSaFLU-TELEVIR: an open web-based bioinformatics suite for viral metagenomic detection and routine genomic surveillance

João Dourado Santos et al. Genome Med. .

Abstract

Background: Implementation of clinical metagenomics and pathogen genomic surveillance can be particularly challenging due to the lack of bioinformatics tools and/or expertise. In order to face this challenge, we have previously developed INSaFLU, a free web-based bioinformatics platform for virus next-generation sequencing data analysis. Here, we considerably expanded its genomic surveillance component and developed a new module (TELEVIR) for metagenomic virus identification.

Results: The routine genomic surveillance component was strengthened with new workflows and functionalities, including (i) a reference-based genome assembly pipeline for Oxford Nanopore technologies (ONT) data; (ii) automated SARS-CoV-2 lineage classification; (iii) Nextclade analysis; (iv) Nextstrain phylogeographic and temporal analysis (SARS-CoV-2, human and avian influenza, monkeypox, respiratory syncytial virus (RSV A/B), as well as a "generic" build for other viruses); and (v) algn2pheno for screening mutations of interest. Both INSaFLU pipelines for reference-based consensus generation (Illumina and ONT) were benchmarked against commonly used command line bioinformatics workflows for SARS-CoV-2, and an INSaFLU snakemake version was released. In parallel, a new module (TELEVIR) for virus detection was developed, after extensive benchmarking of state-of-the-art metagenomics software and following up-to-date recommendations and practices in the field. TELEVIR allows running complex workflows, covering several combinations of steps (e.g., with/without viral enrichment or host depletion), classification software (e.g., Kaiju, Kraken2, Centrifuge, FastViromeExplorer), and databases (RefSeq viral genome, Virosaurus, etc.), while culminating in user- and diagnosis-oriented reports. Finally, to potentiate real-time virus detection during ONT runs, we developed findONTime, a tool aimed at reducing costs and the time between sample reception and diagnosis.

Conclusions: The accessibility, versatility, and functionality of INSaFLU-TELEVIR are expected to supply public and animal health laboratories and researchers with a user-oriented and pan-viral bioinformatics framework that promotes a strengthened and timely viral metagenomic detection and routine genomics surveillance. INSaFLU-TELEVIR is compatible with Illumina, Ion Torrent, and ONT data and is freely available at https://insaflu.insa.pt/ (online tool) and https://github.com/INSaFLU (code).

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Architecture of the INSaFLU-TELEVIR platform, summarizing the implemented analytical modules for viral metagenomic detection (TELEVIR) and routine genomic surveillance (INSaFLU and NextStrain)
Fig. 2
Fig. 2
Simplified illustration of the main steps of the modular INSaFLU-TELEVIR bioinformatics pipeline for metagenomics virus detection (TELEVIR). Documentation for each step is provided at the website (https://insaflu.insa.pt)
Fig. 3
Fig. 3
Simplified illustration of the benchmark of the virus identification pipeline (TELEVIR) module components, which is described in detail in Additional file 1. A Tree representation of module combinations. From left to right, sections represent pipeline steps (exemplified for Illumina) as followed at runtime: (1) Quality Control, (2) Viral Enrichment, (3) Assembly, (4) Contig Classification, (5) Read Classification. Nodes represent software, parameters, or databases compared. Color gradient corresponds to the product of four assessment statistics: mapped reads proportion, horizontal coverage, true positive rate, and completeness (proportion of hits with both read and contig evidence). Statistics were standardized by their respective maxima. B Heatmap representation of software benchmarked for Illumina samples, parameters not discriminated, color code at the bottom. C Table of individual statistics for each node, standardized across samples as in A. For panels A and C, darker colors = lower values, lighter colors = higher values
Fig. 4
Fig. 4
Snapshot of dashboard reports of the INSaFLU-TELEVIR bioinformatics module for metagenomics virus detection. Interactive examples are available at the https://insaflu.insa.pt/ [22] through an open “demo” account
Fig. 5
Fig. 5
Rapid, robust, and cost-efficient diagnostics using findONTime in combination with MinION sequencing. A simulated scenario of hypothesis-free ONT sequencing using data from a MPXV-positive sample, prepared without prior viral enrichment/host depletion. The plot shows the number of reads mapping to a MPXV reference genome and the percentage of horizontal coverage at increasing time points, during the sequencing run. Reference genome identified with over 50% coverage after 20 min. Contigs mapped at the 40-min mark. Strong evidence (mapped contigs; > 90% reference genome covered by at least one read) is achieved in under 2 h (1 h 20 min)

Similar articles

Cited by

  • Improving the reporting of metagenomic virome-scale data.
    Chang WS, Harvey E, Mahar JE, Firth C, Shi M, Simon-Loriere E, Geoghegan JL, Wille M. Chang WS, et al. Commun Biol. 2024 Dec 20;7(1):1687. doi: 10.1038/s42003-024-07212-3. Commun Biol. 2024. PMID: 39706917 Free PMC article.
  • Molecular and phylogenetic analysis of influenza A and B viruses circulating in Sri Lanka following the COVID-19 pandemic.
    Perera T, Bowatte A, Perera S, Rathnayaka D, Francis V, Arunasalam S, Abeywardana S, Noordeen F, Sumathipala S, Muthugala R. Perera T, et al. Virus Genes. 2025 Aug;61(4):444-454. doi: 10.1007/s11262-025-02161-3. Epub 2025 Apr 25. Virus Genes. 2025. PMID: 40279032
  • Preparedness, prevention and control related to zoonotic avian influenza.
    EFSA Panel on Animal Health and Animal Welfare (AHAW); ECDC; Alvarez J, Boklund A, Dippel S, Dórea F, Figuerola J, Herskin MS, Michel V, Miranda Chueca MÁ, Nannoni E, Nielsen SS, Nonno R, Riber AB, Stegeman JA, Ståhl K, Thulke HH, Tuyttens F, Winckler C, Brugerolles C, Wolff T, Parys A, Lindh E, Latorre-Margalef N, Rameix Welti MA, Dürrwald R, Trebbien R, Van der Werf S, Gisslén M, Monne I, Fusaro A, Guinat C, Bortolami A, Alexakis L, Enkirch T, Svartstrom O, Willgert K, Baldinelli F, Preite L, Grant M, Broglia A, Melidou A. EFSA Panel on Animal Health and Animal Welfare (AHAW), et al. EFSA J. 2025 Jan 29;23(1):e9191. doi: 10.2903/j.efsa.2025.9191. eCollection 2025 Jan. EFSA J. 2025. PMID: 39882189 Free PMC article.
  • Role of artificial intelligence in advancing immunology.
    Alanazi HH. Alanazi HH. Immunol Res. 2025 Apr 24;73(1):76. doi: 10.1007/s12026-025-09632-7. Immunol Res. 2025. PMID: 40272607 Review.
  • Viral genetics and transmission dynamics in the second wave of mpox outbreak in Portugal and forecasting public health scenarios.
    Cordeiro R, Caetano CP, Sobral D, Ferreira R, Coelho L, Pelerito A, de Carvalho IL, Namorado S, Loyens DB, Mexia R, Fernandes C, Neves JM, João AL, Rocha M, Duque LM, Correia I, Baptista T, Brazão C, Sousa D, Filipe P, Alpalhão M, Maltez F, Póvoas D, Pinto R, Caria J, Patrocínio de Jesus R, Pacheco P, Peruzzu F, Méndez J, Ferreira L, Mansinho K, Alves JV, Vasconcelos J, Domingos J, Casanova S, Duarte F, Gonçalves MJ, Salvador MB, Guimarães MA, Martins S, Oliveira MS, Santos D, Vieira L, Núncio MS, Borges V, Gomes JP. Cordeiro R, et al. Emerg Microbes Infect. 2024 Dec;13(1):2412635. doi: 10.1080/22221751.2024.2412635. Epub 2024 Oct 11. Emerg Microbes Infect. 2024. PMID: 39360827 Free PMC article.

References

    1. Struelens MJ, Brisse S. From molecular to genomic epidemiology: transforming surveillance and control of infectious diseases. Eurosurveillance. 2013;18:20386. doi: 10.2807/ese.18.04.20386-en. - DOI - PubMed
    1. European Centre for Disease Prevention and Control (ECDC) Expert opinion on whole genome sequencing for public health surveillance. Stockholm: ECDC; 2016.
    1. Eyre DW. Infection prevention and control insights from a decade of pathogen whole-genome sequencing. J Hosp Infect. 2022;122:180–186. doi: 10.1016/j.jhin.2022.01.024. - DOI - PMC - PubMed
    1. Chen Z, Azman AS, Chen X, Zou J, Tian Y, Sun R, et al. Global landscape of SARS-CoV-2 genomic surveillance and data sharing. Nat Genet. 2022;54:499–507. doi: 10.1038/s41588-022-01033-y. - DOI - PMC - PubMed
    1. Gardy JL, Loman NJ. Towards a genomics-informed, real-time, global pathogen surveillance system. Nat Rev Genet. 2018;19:9–20. doi: 10.1038/nrg.2017.88. - DOI - PMC - PubMed

Publication types