Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jun 1;31(11):1824-6.
doi: 10.1093/bioinformatics/btv056. Epub 2015 Jan 30.

RAMPART: a workflow management system for de novo genome assembly

Affiliations

RAMPART: a workflow management system for de novo genome assembly

Daniel Mapleson et al. Bioinformatics. .

Abstract

Motivation: The de novo assembly of genomes from whole- genome shotgun sequence data is a computationally intensive, multi-stage task and it is not known a priori which methods and parameter settings will produce optimal results. In current de novo assembly projects, a popular strategy involves trying many approaches, using different tools and settings, and then comparing and contrasting the results in order to select a final assembly for publication.

Results: Herein, we present RAMPART, a configurable workflow management system for de novo genome assembly, which helps the user identify combinations of third-party tools and settings that provide good results for their particular genome and sequenced reads. RAMPART is designed to exploit High performance computing environments, such as clusters and shared memory systems, where available.

Availability and implementation: RAMPART is available under the GPLv3 license at: https://github.com/TGAC/RAMPART.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
A simplified representation of RAMPART’s architecture. Although user’s workflow must conform to the linear structure depicted here, each stage is optional and highly configurable. Most stages allow the user to select which third-party tool(s) and parameters are used, although primary input and output parameters to all tools are managed automatically. The most important pipeline stage, MASS, allows the user to execute multiple assemblers, with varying parameters. In the subsequent step, the resultant assemblies are analyzed before a single assembly is selected for use in the second half of the pipeline. Input to the MASS and AMP stages can be selected from any raw input library or from any modified libraries generated during the MECQ stage

References

    1. Abbas M.M., et al. (2014) Assessment of de novo assemblers for draft genomes: a case study with fungal genomes. BMC Genomics , 15(Suppl 9), S10. - PMC - PubMed
    1. Gnerre S., et al. (2011) High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc. Natl Acad. Sci. , 108, 1513–1518. - PMC - PubMed
    1. Koren S., et al. (2014) Automated ensemble assembly and validation of microbial genomes. BMC Bioinformatics , 15, 126. - PMC - PubMed
    1. Seemann T. (2014) Prokka: rapid prokaryotic genome annotation. Bioinformatics , 30, 2068–2069. - PubMed
    1. Tritt A., et al. (2012) An integrated pipeline for de novo assembly of microbial genomes. PLoS One, 7, e42304. - PMC - PubMed

Publication types