This is a preprint.
Taxonize-gb: A tool for filtering GenBank non-redundant databases based on taxonomy
- PMID: 38585727
- PMCID: PMC10996545
- DOI: 10.1101/2024.03.22.586347
Taxonize-gb: A tool for filtering GenBank non-redundant databases based on taxonomy
Abstract
Analyzing taxonomic diversity and identification in diverse ecological samples has become a crucial routine in various research and industrial fields. While DNA barcoding marker-gene approaches were once prevalent, the decreasing costs of next-generation sequencing have made metagenomic shotgun sequencing more popular and feasible. In contrast to DNA-barcoding, metagenomic shotgun sequencing offers possibilities for in-depth characterization of structural and functional diversity. However, analysis of such data is still considered a hurdle due to absence of taxa-specific databases. Here we present taxonize-gb, a command-line software tool to extract GenBank non-redundant nucleotide and protein databases, related to one or more input taxonomy identifier. Our tool allows the creation of taxa-specific reference databases tailored to specific research questions, which reduces search times and therefore represents a practical solution for researchers analyzing large metagenomic data on regular basis. Taxonize-gb is an open-source command-line Python-based tool freely available for installation at https://pypi.org/project/taxonize-gb/ and on GitHub https://github.com/msabrysarhan/taxonize_genbank. It is released under Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
Conflict of interest statement
Conflict of interest The authors declare there is no conflict of interest.
Figures


References
-
- Rishan S.T., Kline R.J., Rahman M.S.J.E.A. (2023) Applications of environmental DNA (eDNA) to detect subterranean and aquatic invasive species: A critical review on the challenges and limitations of eDNA metabarcoding. 100370.
-
- Ruppert K.M., Kline R.J., Rahman M.S. (2019) Past, present, and future perspectives of environmental DNA (eDNA) metabarcoding: A systematic review in methods, monitoring, and applications of global eDNA. Global Ecology and Conservation, 17, e00547.
-
- Rodríguez M.d.S.T., Vanhollebeke J., Derycke S.J.F.C. (2023) Evaluation of DNA metabarcoding using Oxford Nanopore sequencing for authentication of mixed seafood products. 145, 109388.
-
- Baksay S., Andalo C., Galop D., et al. (2022) Using Metabarcoding to Investigate the Strength of Plant-Pollinator Interactions From Surveys of Visits to DNA Sequences. 10, 735588.
-
- Van Nynatten A., Gallage K.S., Lujan N.K., et al. (2023) Ichthyoplankton metabarcoding: An efficient tool for early detection of invasive species establishment. - PubMed
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources