Optimizing next-generation sequencing efficiency in clinical settings: analysis of read length impact on cost and performance
- PMID: 39266944
- PMCID: PMC11396997
- DOI: 10.1186/s12864-024-10778-1
Optimizing next-generation sequencing efficiency in clinical settings: analysis of read length impact on cost and performance
Abstract
Background: The expansion of sequencing technologies as a result of the response to the COVID-19 pandemic enabled pathogen (meta)genomics to be deployed as a routine component of surveillance in many countries. Scaling genomic surveillance, however, comes with associated costs in both equipment and sequencing reagents, which should be optimized. Here, we evaluate the cost efficiency and performance of different read lengths in identifying pathogens in metagenomic samples. We carefully evaluated performance metrics, costs, and time requirements relative to choices of 75, 150 and 300 base pairs (bp) read lengths in pathogen identification.
Results: Our findings revealed that moving from 75 bp to 150 bp read length approximately doubles both the cost and sequencing time. Opting for 300 bp reads leads to approximately two- and three-fold increases, respectively, in cost and sequencing time compared to 75 bp reads. For viral pathogen detection, the sensitivity median ranged from 99% with 75 bp reads to 100% with 150-300 bp reads. However, bacterial pathogens detection was less effective with shorter reads: 87% with 75 bp, 95% with 150 bp, and 97% with 300 bp reads. These findings were consistent across different levels of taxa abundance. The precision of pathogen detection using shorter reads was comparable to that of longer reads across most viral and bacterial taxa.
Conclusions: During disease outbreak situations, when swift responses are required for pathogen identification, we suggest prioritizing 75 bp read lengths, especially if detection of viral pathogens is aimed. This practical approach allows better use of resources, enabling the sequencing of more samples using streamlined workflows, while maintaining a reliable response capability.
Keywords: Cost efficiency; Health surveillance; Metagenomics; Pathogen detection.
© 2024. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures



Similar articles
-
Metagenomic Next-Generation Sequencing of Nasopharyngeal Specimens Collected from Confirmed and Suspect COVID-19 Patients.mBio. 2020 Nov 20;11(6):e01969-20. doi: 10.1128/mBio.01969-20. mBio. 2020. PMID: 33219095 Free PMC article.
-
The impact of read length on quantification of differentially expressed genes and splice junction detection.Genome Biol. 2015 Jun 23;16(1):131. doi: 10.1186/s13059-015-0697-y. Genome Biol. 2015. PMID: 26100517 Free PMC article.
-
Optimized Sequencing Adaptors Enable Rapid and Real-Time Metagenomic Identification of Pathogens during Runtime of Sequencing.Clin Chem. 2022 Jun 1;68(6):826-836. doi: 10.1093/clinchem/hvac024. Clin Chem. 2022. PMID: 35290433
-
Hybrid-Capture Target Enrichment in Human Pathogens: Identification, Evolution, Biosurveillance, and Genomic Epidemiology.Pathogens. 2024 Mar 23;13(4):275. doi: 10.3390/pathogens13040275. Pathogens. 2024. PMID: 38668230 Free PMC article. Review.
-
How do emerging long-read sequencing technologies function in transforming the plant pathology research landscape?Plant Mol Biol. 2022 Dec;110(6):469-484. doi: 10.1007/s11103-022-01305-5. Epub 2022 Aug 13. Plant Mol Biol. 2022. PMID: 35962900 Review.
References
-
- Biswas N, Mallick P, Maity SK, Bhowmik D, Mitra AG, Saha S, et al. Genomic surveillance and phylodynamic analyses reveal the emergence of novel mutations and co-mutation patterns within SARS-CoV-2 variants prevalent in India. Front Microbiol. 2021;12:703933. 10.3389/fmicb.2021.703933 - DOI - PMC - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical