Diverse Facets of Nonhuman Sequences in Read Outputs of the Human Next-Generation Sequencing Data and Their Relevance with Viruses
- PMID: 40455162
- DOI: 10.1007/978-1-0716-4546-8_14
Diverse Facets of Nonhuman Sequences in Read Outputs of the Human Next-Generation Sequencing Data and Their Relevance with Viruses
Abstract
In human genomic studies, on average, 10% of next-generation sequencing (NGS) reads fail to align with the human reference genome. These unmapped reads vary across samples and have three main potential sources. First, they could represent contamination introduced during sample processing or from the sequencing technology itself. Second, these sequences might originate from microorganisms, like viruses, bacteria, and fungi, that have coevolved with humans and residing within humans. These natural inhabitants of the human body make up the human microbiota. During taking the human cell samples, the microbiota of the surroundings can infect the human samples. Third, these reads could come from active or dormant pathogens residing in the taken human cell samples, like viruses. Research shows that the composition of these microbial species changes with the health and condition of human tissues. In this study, author proposes that unmapped reads may serve as indicators of the pathological state of various tissues and cell types. An outline is marked for experimental approaches to test these ideas and explore the potential of these reads as diagnostic markers.
Keywords: Microbiome; Microbiota; Next-generation sequencing; Transcriptomics; Viriome; Virus infections.
© 2025. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.
Similar articles
-
From trash to treasure: detecting unexpected contamination in unmapped NGS data.BMC Bioinformatics. 2019 Apr 18;20(Suppl 4):168. doi: 10.1186/s12859-019-2684-x. BMC Bioinformatics. 2019. PMID: 30999839 Free PMC article.
-
Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples.BMC Genomics. 2017 May 15;18(1):378. doi: 10.1186/s12864-017-3721-7. BMC Genomics. 2017. PMID: 28506246 Free PMC article.
-
k-mer-Based Metagenomics Tools Provide a Fast and Sensitive Approach for the Detection of Viral Contaminants in Biopharmaceutical and Vaccine Manufacturing Applications Using Next-Generation Sequencing.mSphere. 2021 Apr 21;6(2):e01336-20. doi: 10.1128/mSphere.01336-20. mSphere. 2021. PMID: 33883263 Free PMC article.
-
Next-Generation Sequencing Technologies.Cold Spring Harb Perspect Med. 2019 Nov 1;9(11):a036798. doi: 10.1101/cshperspect.a036798. Cold Spring Harb Perspect Med. 2019. PMID: 30478097 Free PMC article. Review.
-
The bioinformatics tools for the genome assembly and analysis based on third-generation sequencing.Brief Funct Genomics. 2019 Feb 14;18(1):1-12. doi: 10.1093/bfgp/ely037. Brief Funct Genomics. 2019. PMID: 30462154 Review.
References
-
- Chrisman B et al (2022) The human “contaminome”: bacterial, viral, and computational contamination in whole genome sequences from 1000 families. Sci Rep 12:1–9 - DOI
-
- Afzaal M et al (2022) Human gut microbiota in health and disease: unveiling the relationship. Front Microbiol 13:1–14 - DOI
-
- Sangiovanni M, Granata I, Thind AS, Guarracino MR (2019) From trash to treasure : detecting unexpected contamination in unmapped NGS data. BMC Bioinform 20:1–12 - DOI
MeSH terms
LinkOut - more resources
Full Text Sources