MetaObtainer: A Tool for Obtaining Specified Species from Metagenomic Reads of Next-generation Sequencing
- PMID: 26293485
- DOI: 10.1007/s12539-015-0281-x
MetaObtainer: A Tool for Obtaining Specified Species from Metagenomic Reads of Next-generation Sequencing
Abstract
Reads classification is an important fundamental problem in metagenomics study. With the development of next-generation sequencing, metagenome samples can be generated using much less money and time. However, the short reads generated by next-generation sequencing make the problem of reads classification much more difficult than before. None of the existing tools can assign NGS short reads to each genome accurately, which limit their use in real application. Fortunately, in many applications, it is meaningless to separate all the species in the metagenome sample from each other. That is because we usually only focus on some specified species categories in the sample and do not care about the others. There is no existing tool that is designed technically for obtaining specified species from short metagenome reads generated by next-generation sequencing. In this paper, we propose a tool named MetaObtainer to obtain the specified species from next-generation sequencing short reads. The tool synthesizes some of newest technologies for processing of short reads, so it can have better performance than other tools. It can (1) deal with next-generation sequencing reads which are shorter than 100 bp with very high accuracy (both of precision and recall are more than 90%); (2) find unknown species using the reference genomes of species which are similar with it; (3) perform well when reads of specified species are very few in the dataset; (4) handle genomes of similar abundance levels as well as different abundance levels (1:10); and (5) obtain multiple species categories from metagenome sample.
Keywords: Classification; Metagenomics; Next-generation sequencing; Short reads.
Similar articles
-
Generation and application of pseudo-long reads for metagenome assembly.Gigascience. 2022 May 17;11:giac044. doi: 10.1093/gigascience/giac044. Gigascience. 2022. PMID: 35579554 Free PMC article.
-
MetaCluster 4.0: a novel binning algorithm for NGS reads and huge number of species.J Comput Biol. 2012 Feb;19(2):241-9. doi: 10.1089/cmb.2011.0276. J Comput Biol. 2012. PMID: 22300323
-
InteMAP: Integrated metagenomic assembly pipeline for NGS short reads.BMC Bioinformatics. 2015 Aug 7;16:244. doi: 10.1186/s12859-015-0686-x. BMC Bioinformatics. 2015. PMID: 26250558 Free PMC article.
-
Clinical Metagenomic Next-Generation Sequencing for Pathogen Detection.Annu Rev Pathol. 2019 Jan 24;14:319-338. doi: 10.1146/annurev-pathmechdis-012418-012751. Epub 2018 Oct 24. Annu Rev Pathol. 2019. PMID: 30355154 Free PMC article. Review.
-
Challenges of next-generation sequencing targeting anaerobes.Anaerobe. 2019 Aug;58:47-52. doi: 10.1016/j.anaerobe.2019.02.006. Epub 2019 Feb 12. Anaerobe. 2019. PMID: 30769104 Review.
Cited by
-
Literature on Applied Machine Learning in Metagenomic Classification: A Scoping Review.Biology (Basel). 2020 Dec 9;9(12):453. doi: 10.3390/biology9120453. Biology (Basel). 2020. PMID: 33316921 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources