Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Feb 20:4:50.
doi: 10.12688/f1000research.6157.2. eCollection 2015.

ViennaNGS: A toolbox for building efficient next- generation sequencing analysis pipelines

Affiliations

ViennaNGS: A toolbox for building efficient next- generation sequencing analysis pipelines

Michael T Wolfinger et al. F1000Res. .

Abstract

Recent achievements in next-generation sequencing (NGS) technologies lead to a high demand for reuseable software components to easily compile customized analysis workflows for big genomics data. We present ViennaNGS, an integrated collection of Perl modules focused on building efficient pipelines for NGS data processing. It comes with functionality for extracting and converting features from common NGS file formats, computation and evaluation of read mapping statistics, as well as normalization of RNA abundance. Moreover, ViennaNGS provides software components for identification and characterization of splice junctions from RNA-seq data, parsing and condensing sequence motif data, automated construction of Assembly and Track Hubs for the UCSC genome browser, as well as wrapper routines for a set of commonly used NGS command line tools.

Keywords: Perl; RNA-seq; next generation sequencing; pipelines; read mapping.

PubMed Disclaimer

Conflict of interest statement

Competing interests: No competing interests were disclosed.

Figures

Figure 1.
Figure 1.. Schematic overview of ViennaNGS components.
Core modules can be combined within a data analysis script in a flexible manner to meet individual analysis objectives and experimental setup.
Figure 2.
Figure 2.. Class diagram illustrating the relations among generic Moose classes which are used as abstract representations of genomic intervals (only attributes are shown).
Figure 3.
Figure 3.. Schematic representation of genomic interval classes in terms of their corresponding feature annotation.
Simple intervals (“features”) are characterized by ViennaNGS::Feature objects (bottom box). At the next level, ViennaNGS::FeatureChain bundles these, thereby maintaining individual annotation chains for e.g. UTRs, exons, introns, splice junctions, etc. (middle box). The topmost level is given by ViennaNGS::FeatureLine objects, representing individual transcripts.

References

    1. Förstner KU, Vogel J, Sharma CM: READemption-a tool for the computational analysis of deep-sequencing-based transcriptome data. Bioinformatics. 2014;30(23):3421–3. 10.1093/bioinformatics/btu533 - DOI - PubMed
    1. Breese MR, Liu Y: NGSUtils: a software suite for analyzing and manipulating next-generation sequencing datasets. Bioinformatics. 2013;29(4):494–6. 10.1093/bioinformatics/bts731 - DOI - PMC - PubMed
    1. Heinz S, Benner C, Spann N, et al. : Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010;38(4):576–89. 10.1016/j.molcel.2010.05.004 - DOI - PMC - PubMed
    1. Goecks J, Nekrutenko A, Taylor J, et al. : Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010;11(8):R86. 10.1186/gb-2010-11-8-r86 - DOI - PMC - PubMed
    1. Anders S, Pyl PT, Huber W: HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31(2):166–9. 10.1093/bioinformatics/btu638 - DOI - PMC - PubMed