Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Feb 4:17:66.
doi: 10.1186/s12859-016-0923-y.

SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis

Affiliations

SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis

Benjamin K Johnson et al. BMC Bioinformatics. .

Abstract

Background: Many tools exist in the analysis of bacterial RNA sequencing (RNA-seq) transcriptional profiling experiments to identify differentially expressed genes between experimental conditions. Generally, the workflow includes quality control of reads, mapping to a reference, counting transcript abundance, and statistical tests for differentially expressed genes. In spite of the numerous tools developed for each component of an RNA-seq analysis workflow, easy-to-use bacterially oriented workflow applications to combine multiple tools and automate the process are lacking. With many tools to choose from for each step, the task of identifying a specific tool, adapting the input/output options to the specific use-case, and integrating the tools into a coherent analysis pipeline is not a trivial endeavor, particularly for microbiologists with limited bioinformatics experience.

Results: To make bacterial RNA-seq data analysis more accessible, we developed a Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis (SPARTA). SPARTA is a reference-based bacterial RNA-seq analysis workflow application for single-end Illumina reads. SPARTA is turnkey software that simplifies the process of analyzing RNA-seq data sets, making bacterial RNA-seq analysis a routine process that can be undertaken on a personal computer or in the classroom. The easy-to-install, complete workflow processes whole transcriptome shotgun sequencing data files by trimming reads and removing adapters, mapping reads to a reference, counting gene features, calculating differential gene expression, and, importantly, checking for potential batch effects within the data set. SPARTA outputs quality analysis reports, gene feature counts and differential gene expression tables and scatterplots.

Conclusions: SPARTA provides an easy-to-use bacterial RNA-seq transcriptional profiling workflow to identify differentially expressed genes between experimental conditions. This software will enable microbiologists with limited bioinformatics experience to analyze their data and integrate next generation sequencing (NGS) technologies into the classroom. The SPARTA software and tutorial are available at sparta.readthedocs.org.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
SPARTA workflow diagram. Single-end Illumina FASTQ files, a FASTA formatted reference genome, and genome feature file (gff or gtf) are given as inputs to the workflow. Trimmomatic and FastQC perform trimming of adapters and low quality bases/reads and quality assessment reports, respectively. Bowtie maps the trimmed reads to the reference. HTSeq quantifies transcript abundance. R/edgeR tests for statistically significant genes and warns the user of potential batch effects present in the analyzed data set
Fig. 2
Fig. 2
Data analysis execution time comparison between SPARTA and Rockhopper2. The two programs were compared for execution time when processing one, two, or three experimental conditions as compared to a reference condition. Both SPARTA (1.0) and Rockhopper2 (2.03) were installed and tested on an off-the-shelf iMac (2.7 GHz i5, 8 GB memory, OSX 10.11.2). Dependencies: Java (1.6.0_65), Python (2.7.9), and R (3.2.2). Data are the mean of three software executions and error bars represent the standard deviation. Data files (100,000 reads/file) utilized were the example data bundled with SPARTA

Similar articles

Cited by

References

    1. Golosova O, Henderson R, Vaskin Y, Gabrielian A, Grekhov G, Nagarajan V, et al. Unipro UGENE NGS pipelines and components for variant calling, RNA-seq and ChIP-seq data analyses. PeerJ. 2014;2 doi: 10.7717/peerj.644. - DOI - PMC - PubMed
    1. Michalovova M, Kubat Z, Hobza R, Vyskot B, Kejnovsky E. Fully automated pipeline for detection of sex linked genes using RNA-Seq data. BMC Bioinformatics. 2015;16(1):78. doi: 10.1186/s12859-015-0509-0. - DOI - PMC - PubMed
    1. D’Antonio M, D’Onorio De Meo P, Pallocca M, Picardi E, D’Erchia AM, Calogero RA, et al. RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application. BMC Genomics. 2015;16:S3. doi: 10.1186/1471-2164-16-S6-S3. - DOI - PMC - PubMed
    1. McClure R, Balasubramanian D, Sun Y, Bobrovskyy M, Sumby P, Genco CA, et al. Computational analysis of bacterial RNA-Seq data. Nucleic Acids Res. 2013;41(14) doi: 10.1093/nar/gkt444. - DOI - PMC - PubMed
    1. Tjaden B. De novo assembly of bacterial transcriptomes from RNA-seq data. Genome Biol. 2015;16:1. doi: 10.1186/s13059-014-0572-2. - DOI - PMC - PubMed

Publication types

MeSH terms