Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 May 15;30(10):1471-2.
doi: 10.1093/bioinformatics/btu036. Epub 2014 Jan 26.

NGSANE: a lightweight production informatics framework for high-throughput data analysis

Affiliations

NGSANE: a lightweight production informatics framework for high-throughput data analysis

Fabian A Buske et al. Bioinformatics. .

Abstract

Summary: The initial steps in the analysis of next-generation sequencing data can be automated by way of software 'pipelines'. However, individual components depreciate rapidly because of the evolving technology and analysis methods, often rendering entire versions of production informatics pipelines obsolete. Constructing pipelines from Linux bash commands enables the use of hot swappable modular components as opposed to the more rigid program call wrapping by higher level languages, as implemented in comparable published pipelining systems. Here we present Next Generation Sequencing ANalysis for Enterprises (NGSANE), a Linux-based, high-performance-computing-enabled framework that minimizes overhead for set up and processing of new projects, yet maintains full flexibility of custom scripting when processing raw sequence data.

Availability and implementation: Ngsane is implemented in bash and publicly available under BSD (3-Clause) licence via GitHub at https://github.com/BauerLab/ngsane.

Contact: Denis.Bauer@csiro.au

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
(a) Separation of project data from NGSANE core. (b) Workflow of NGSANE. (c) Example of automatically created project summary

References

    1. Anders S, et al. Count-based differential expression analysis of RNA sequencing data using R and Bioconductor. Nat. Protoc. 2013;8:1765–1786. - PubMed
    1. Auer PL, Doerge RW. Statistical design and analysis of RNA sequencing data. Genetics. 2010;185:405–416. - PMC - PubMed
    1. Goecks J, et al. Galaxy: a comprehensive approach for supporting accessible, reproducible and transparent computational research in the life sciences. Genome Biol. 2010;11:R86. - PMC - PubMed
    1. Köster J, Rahmann S. Snakemake – a scalable bioinformatics workflow engine. Bioinformatics. 2012;28:2520–2522. - PubMed
    1. McCoy CO, et al. Nestly – a framework for running software with nested parameter choices and aggregating results. Bioinformatics. 2013;29:387–388. - PMC - PubMed

Publication types