A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples
- PMID: 24899342
- PMCID: PMC4079973
- DOI: 10.1101/gr.171934.113
A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples
Abstract
Unbiased next-generation sequencing (NGS) approaches enable comprehensive pathogen detection in the clinical microbiology laboratory and have numerous applications for public health surveillance, outbreak investigation, and the diagnosis of infectious diseases. However, practical deployment of the technology is hindered by the bioinformatics challenge of analyzing results accurately and in a clinically relevant timeframe. Here we describe SURPI ("sequence-based ultrarapid pathogen identification"), a computational pipeline for pathogen identification from complex metagenomic NGS data generated from clinical samples, and demonstrate use of the pipeline in the analysis of 237 clinical samples comprising more than 1.1 billion sequences. Deployable on both cloud-based and standalone servers, SURPI leverages two state-of-the-art aligners for accelerated analyses, SNAP and RAPSearch, which are as accurate as existing bioinformatics tools but orders of magnitude faster in performance. In fast mode, SURPI detects viruses and bacteria by scanning data sets of 7-500 million reads in 11 min to 5 h, while in comprehensive mode, all known microorganisms are identified, followed by de novo assembly and protein homology searches for divergent viruses in 50 min to 16 h. SURPI has also directly contributed to real-time microbial diagnosis in acutely ill patients, underscoring its potential key role in the development of unbiased NGS-based clinical assays in infectious diseases that demand rapid turnaround times.
© 2014 Naccache et al.; Published by Cold Spring Harbor Laboratory Press.
Figures







Similar articles
-
16SPIP: a comprehensive analysis pipeline for rapid pathogen detection in clinical samples based on 16S metagenomic sequencing.BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):568. doi: 10.1186/s12859-017-1975-3. BMC Bioinformatics. 2017. PMID: 29297318 Free PMC article.
-
VIP: an integrated pipeline for metagenomics of virus identification and discovery.Sci Rep. 2016 Mar 30;6:23774. doi: 10.1038/srep23774. Sci Rep. 2016. PMID: 27026381 Free PMC article.
-
PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples.Bioinformatics. 2018 Sep 1;34(17):i715-i721. doi: 10.1093/bioinformatics/bty595. Bioinformatics. 2018. PMID: 30423069 Free PMC article.
-
Next Generation Sequencing and Bioinformatics Methodologies for Infectious Disease Research and Public Health: Approaches, Applications, and Considerations for Development of Laboratory Capacity.J Infect Dis. 2020 Mar 28;221(Suppl 3):S292-S307. doi: 10.1093/infdis/jiz286. J Infect Dis. 2020. PMID: 31612214 Review.
-
Clinical Metagenomic Next-Generation Sequencing for Pathogen Detection.Annu Rev Pathol. 2019 Jan 24;14:319-338. doi: 10.1146/annurev-pathmechdis-012418-012751. Epub 2018 Oct 24. Annu Rev Pathol. 2019. PMID: 30355154 Free PMC article. Review.
Cited by
-
Taxonomer: an interactive metagenomics analysis portal for universal pathogen detection and host mRNA expression profiling.Genome Biol. 2016 May 26;17(1):111. doi: 10.1186/s13059-016-0969-1. Genome Biol. 2016. PMID: 27224977 Free PMC article.
-
Draft Genome Sequence of Mycobacterium heraklionense Strain Davo.Genome Announc. 2015 Jul 23;3(4):e00807-15. doi: 10.1128/genomeA.00807-15. Genome Announc. 2015. PMID: 26205863 Free PMC article.
-
viromeBrowser: A Shiny App for Browsing Virome Sequencing Analysis Results.Viruses. 2021 Mar 9;13(3):437. doi: 10.3390/v13030437. Viruses. 2021. PMID: 33803225 Free PMC article.
-
Draft Genome Sequence of Mycobacterium arupense Strain GUC1.Genome Announc. 2015 Jun 11;3(3):e00630-15. doi: 10.1128/genomeA.00630-15. Genome Announc. 2015. PMID: 26067970 Free PMC article.
-
Assuring the Quality of Next-Generation Sequencing in Clinical Microbiology and Public Health Laboratories.J Clin Microbiol. 2016 Dec;54(12):2857-2865. doi: 10.1128/JCM.00949-16. Epub 2016 Aug 10. J Clin Microbiol. 2016. PMID: 27510831 Free PMC article. Review.
References
-
- Akobeng AK 2007. Understanding diagnostic tests 3: receiver operating characteristic curves. Acta Paediatr 96: 644–647 - PubMed
-
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ 1990. Basic local alignment search tool. J Mol Biol 215: 403–410 - PubMed
-
- Bloch KC, Glaser C 2007. Diagnostic approaches for patients with suspected encephalitis. Curr Infect Dis Rep 9: 315–322 - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical