LotuS2: an ultrafast and highly accurate tool for amplicon sequencing analysis
- PMID: 36258257
- PMCID: PMC9580208
- DOI: 10.1186/s40168-022-01365-1
LotuS2: an ultrafast and highly accurate tool for amplicon sequencing analysis
Abstract
Background: Amplicon sequencing is an established and cost-efficient method for profiling microbiomes. However, many available tools to process this data require both bioinformatics skills and high computational power to process big datasets. Furthermore, there are only few tools that allow for long read amplicon data analysis. To bridge this gap, we developed the LotuS2 (less OTU scripts 2) pipeline, enabling user-friendly, resource friendly, and versatile analysis of raw amplicon sequences.
Results: In LotuS2, six different sequence clustering algorithms as well as extensive pre- and post-processing options allow for flexible data analysis by both experts, where parameters can be fully adjusted, and novices, where defaults are provided for different scenarios. We benchmarked three independent gut and soil datasets, where LotuS2 was on average 29 times faster compared to other pipelines, yet could better reproduce the alpha- and beta-diversity of technical replicate samples. Further benchmarking a mock community with known taxon composition showed that, compared to the other pipelines, LotuS2 recovered a higher fraction of correctly identified taxa and a higher fraction of reads assigned to true taxa (48% and 57% at species; 83% and 98% at genus level, respectively). At ASV/OTU level, precision and F-score were highest for LotuS2, as was the fraction of correctly reported 16S sequences.
Conclusion: LotuS2 is a lightweight and user-friendly pipeline that is fast, precise, and streamlined, using extensive pre- and post-ASV/OTU clustering steps to further increase data quality. High data usage rates and reliability enable high-throughput microbiome analysis in minutes.
Availability: LotuS2 is available from GitHub, conda, or via a Galaxy web interface, documented at http://lotus2.earlham.ac.uk/ . Video Abstract.
Keywords: 16S rRNA; Amplicon data analysis; Amplicon sequencing; ITS; Long read; Microbiome; Short read.
© 2022. The Author(s).
Conflict of interest statement
The authors declare that they have no competing interests.
Figures




Similar articles
-
A comprehensive evaluation of the sl1p pipeline for 16S rRNA gene sequencing analysis.Microbiome. 2017 Aug 14;5(1):100. doi: 10.1186/s40168-017-0314-2. Microbiome. 2017. PMID: 28807046 Free PMC article.
-
A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the gut microbiome.BMC Microbiol. 2017 Sep 13;17(1):194. doi: 10.1186/s12866-017-1101-8. BMC Microbiol. 2017. PMID: 28903732 Free PMC article.
-
Impact of DNA extraction, PCR amplification, sequencing, and bioinformatic analysis on food-associated mock communities using PacBio long-read amplicon sequencing.BMC Microbiol. 2024 Dec 6;24(1):521. doi: 10.1186/s12866-024-03677-8. BMC Microbiol. 2024. PMID: 39643893 Free PMC article.
-
CDSnake: Snakemake pipeline for retrieval of annotated OTUs from paired-end reads using CD-HIT utilities.BMC Bioinformatics. 2020 Jul 24;21(Suppl 12):303. doi: 10.1186/s12859-020-03591-6. BMC Bioinformatics. 2020. PMID: 32703166 Free PMC article.
-
Improved OTU-picking using long-read 16S rRNA gene amplicon sequencing and generic hierarchical clustering.Microbiome. 2015 Oct 5;3:43. doi: 10.1186/s40168-015-0105-6. Microbiome. 2015. PMID: 26434730 Free PMC article.
Cited by
-
Host dispersal relaxes selective pressures in rafting microbiomes and triggers successional changes.Nat Commun. 2024 Dec 30;15(1):10759. doi: 10.1038/s41467-024-54954-z. Nat Commun. 2024. PMID: 39737966 Free PMC article.
-
A pile of pipelines: An overview of the bioinformatics software for metabarcoding data analyses.Mol Ecol Resour. 2024 Jul;24(5):e13847. doi: 10.1111/1755-0998.13847. Epub 2023 Aug 7. Mol Ecol Resour. 2024. PMID: 37548515 Free PMC article. Review.
-
Global trends in research of high-throughput sequencing technology associated with chronic wounds from 2002 to 2022: A bibliometric and visualized study.Front Surg. 2023 Feb 22;10:1089203. doi: 10.3389/fsurg.2023.1089203. eCollection 2023. Front Surg. 2023. PMID: 36911623 Free PMC article.
-
Intragenomic diversity of the V9 hypervariable domain in eukaryotes has little effect on metabarcoding.iScience. 2023 Jul 12;26(8):107291. doi: 10.1016/j.isci.2023.107291. eCollection 2023 Aug 18. iScience. 2023. PMID: 37554448 Free PMC article.
-
Patterns in soil microbial diversity across Europe.Nat Commun. 2023 Jun 8;14(1):3311. doi: 10.1038/s41467-023-37937-4. Nat Commun. 2023. PMID: 37291086 Free PMC article.
References
-
- Bahram M, Hildebrand F, Forslund SK, Anderson JL, Soudzilovskaia NA, Bodegom PM, et al. Structure and function of the global topsoil microbiome. Nature. 2018;560:233–237. - PubMed
-
- Tedersoo L, Anslan S, Bahram M, Põlme S, Riit T, Liiv I, et al. Shotgun metagenomes and multiple primer pair-barcode combinations of amplicons reveal biases in metabarcoding analyses of fungi. MycoKeys. 2015;10:1–43.
Publication types
MeSH terms
Substances
Grants and funding
- BBS/E/T/000PR9817/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
- BB/R012490/1/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
- BBS/E/T/000PR9814/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
- BB/CCG1720/1/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
- BBS/E/F/000PR10355/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
LinkOut - more resources
Full Text Sources
Miscellaneous