Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 29;40(4):btae152.
doi: 10.1093/bioinformatics/btae152.

GINSA: an accumulator for paired locality and next-generation small ribosomal subunit sequence data

Affiliations

GINSA: an accumulator for paired locality and next-generation small ribosomal subunit sequence data

Eric Odle et al. Bioinformatics. .

Abstract

Motivation: Motivated by the challenges of decentralized genetic data spread across multiple international organizations, GINSA leverages the Global Biodiversity Information Facility infrastructure to automatically retrieve and link small ribosomal subunit sequences with locality information.

Results: Testing on taxa from major organism groups demonstrates broad applicability across taxonomic levels and dataset sizes.

Availability and implementation: GINSA is a freely accessible Python program under the MIT License and can be installed from PyPI via pip.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
Chart visualizing the GINSA workflow. User input is taken as a GBIF search taxon. Occurrences are then linked with their source sequences archived on ENA. Output CSV and FASTA files link GBIF occurrence IDs, localities, and sequences.

References

    1. Adl SM, Bass D, Lane CE. et al. Revisions to the classification, nomenclature, and diversity of eukaryotes. J Eukaryot Microbiol 2019;66:4–119. - PMC - PubMed
    1. Benson D, Lipman DJ, Ostell J.. Genbank. Nucleic Acids Res 1993;21:2963–5. - PMC - PubMed
    1. Burgin J, Ahamed A, Cummins C. et al. The European nucleotide archive in 2022. Nucleic Acids Res 2023;51:D121–5. - PMC - PubMed
    1. Câmara PE, Bones FLV, Lopes FAC. et al. DNA metabarcoding reveals cryptic diversity in Forest soils on the isolated Brazilian Trindade Island, South Atlantic. Microb Ecol 2023;85:1056–71. - PubMed
    1. Choudhary S. pysradb: a python package to query next-generation sequencing metadata and data from NCBI sequence read archive. F1000Res 2019;8:532. - PMC - PubMed

Publication types