Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep;1(9):e242.
doi: 10.1002/cpz1.242.

GALAXY Workflow for Bacterial Next-Generation Sequencing De Novo Assembly and Annotation

Affiliations

GALAXY Workflow for Bacterial Next-Generation Sequencing De Novo Assembly and Annotation

Soon Keong Wee et al. Curr Protoc. 2021 Sep.

Abstract

Whole-genome sequencing of prokaryotes is now readily available and affordable on next-generation sequencing platforms. However, the process of de novo assembly can be complicated and tedious for those without a background in computational biology, bioinformatics, or UNIX. Licenses for commercial bioinformatics software may be costly and limited in flexibility. GALAXY is a powerful graphical open-source code-free bioinformatics platform that is freely available on multiple public and private servers. Here, we describe a bacterial de novo assembly workflow using GALAXY. It performs de novo genome assembly using short reads, long reads, or a hybrid method using both short and long reads. Genome annotation, prediction of antimicrobial resistance genes, and multi-locus sequence typing are subsequently performed to characterize the draft genome. Performing genome assembly and annotation on this pipeline allows documentation, parameterization, and sharing, facilitating replication, reuse, and reproducibility of both data and methods. © 2021 Wiley Periodicals LLC. Basic Protocol 1: Quality check of NGS reads Basic Protocol 2: De novo assembly using Unicycler Basic Protocol 3: Assembly quality check using QUAST and Bandage Basic Protocol 4: Genome annotation using Prokka Basic Protocol 5: Prediction of antimicrobial resistance genes (ARGs) Basic Protocol 6: Multi-locus sequence typing (MLST).

Keywords: De novo assembly; GALAXY; bacterial genomics; next generation sequencing; whole-genome sequencing.

PubMed Disclaimer

References

Literature Cited

References
    1. Afgan, E., Baker, D., Batut, B., van den Beek, M., Bouvier, D., Čech, M., … Blankenberg, D. (2018). The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Research, 46(W1), W537-W544. doi: 10.1093/nar/gky379.
    1. Andrews, S. (2010). FastQC a quality control tool for high throughput sequence data. Available at https://www.bioinformatics.babraham.ac.uk/projects/fastqc/.
    1. Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics, 30(15), 2114-2120. doi: 10.1093/bioinformatics/btu170.
    1. Carattoli, A., Zankari, E., Garcia-Fernandez, A., Voldby Larsen, M., Lund, O., Villa, L., … Hasman, H. (2014). In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrobial Agents and Chemotherapy, 58(7), 3895-3903. doi: 10.1128/aac.02412-14.
    1. Gurevich, A., Saveliev, V., Vyahhi, N., & Tesler, G. (2013). QUAST: Quality assessment tool for genome assemblies. Bioinformatics (Oxford, England), 29(8), 1072-1075. doi: 10.1093/bioinformatics/btt086.

LinkOut - more resources