Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct 17;9(10):giaa105.
doi: 10.1093/gigascience/giaa105.

NanoGalaxy: Nanopore long-read sequencing data analysis in Galaxy

Affiliations

NanoGalaxy: Nanopore long-read sequencing data analysis in Galaxy

Willem de Koning et al. Gigascience. .

Abstract

Background: Long-read sequencing can be applied to generate very long contigs and even completely assembled genomes at relatively low cost and with minimal sample preparation. As a result, long-read sequencing platforms are becoming more popular. In this respect, the Oxford Nanopore Technologies-based long-read sequencing "nanopore" platform is becoming a widely used tool with a broad range of applications and end-users. However, the need to explore and manipulate the complex data generated by long-read sequencing platforms necessitates accompanying specialized bioinformatics platforms and tools to process the long-read data correctly. Importantly, such tools should additionally help democratize bioinformatics analysis by enabling easy access and ease-of-use solutions for researchers.

Results: The Galaxy platform provides a user-friendly interface to computational command line-based tools, handles the software dependencies, and provides refined workflows. The users do not have to possess programming experience or extended computer skills. The interface enables researchers to perform powerful bioinformatics analysis, including the assembly and analysis of short- or long-read sequence data. The newly developed "NanoGalaxy" is a Galaxy-based toolkit for analysing long-read sequencing data, which is suitable for diverse applications, including de novo genome assembly from genomic, metagenomic, and plasmid sequence reads.

Conclusions: A range of best-practice tools and workflows for long-read sequence genome assembly has been integrated into a NanoGalaxy platform to facilitate easy access and use of bioinformatics tools for researchers. NanoGalaxy is freely available at the European Galaxy server https://nanopore.usegalaxy.eu with supporting self-learning training material available at https://training.galaxyproject.org.

Keywords: Galaxy; Nanopore; long-read sequencing; reproducibility; workflows.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1:
Figure 1:
Representation of the output of Wick et al. [16]. The plasmid assembly graphs output created by Bandage [31] are shown to confirm that the workflow functions as expected. The length distribution, total yield, and N50 of the Oxford Nanopore Technologies (ONT) reads of each Klebsiella pneumoniae represent the input data. Mb: megabase pairs.

References

    1. Gilissen C, Hoischen A, Brunner HG, et al. . Unlocking Mendelian disease using exome sequencing. Genome Biol. 2011;12(9):228. - PMC - PubMed
    1. de Koning AJ, Gu W, Castoe TA, et al. . Repetitive elements may comprise over two-thirds of the human genome. PLoS Genet. 2011;7(12):e1002384. - PMC - PubMed
    1. Goodwin S, McPherson JD, McCombie WR. Coming of age: Ten years of next-generation sequencing technologies. Nat Rev Genet. 2016;17(6):333. - PMC - PubMed
    1. Feuk L, Carson AR, Scherer SW. Structural variation in the human genome. Nat Rev Genet. 2006;7(2):85. - PubMed
    1. Jain M, Olsen HE, Paten B, et al. . The Oxford Nanopore MinION: Delivery of nanopore sequencing to the genomics community. Genome Biol. 2016;17(1):239. - PMC - PubMed

Publication types