Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Aug;20(4):383-90.
doi: 10.1093/dnares/dst017. Epub 2013 May 8.

DDBJ read annotation pipeline: a cloud computing-based pipeline for high-throughput analysis of next-generation sequencing data

Affiliations

DDBJ read annotation pipeline: a cloud computing-based pipeline for high-throughput analysis of next-generation sequencing data

Hideki Nagasaki et al. DNA Res. 2013 Aug.

Abstract

High-performance next-generation sequencing (NGS) technologies are advancing genomics and molecular biological research. However, the immense amount of sequence data requires computational skills and suitable hardware resources that are a challenge to molecular biologists. The DNA Data Bank of Japan (DDBJ) of the National Institute of Genetics (NIG) has initiated a cloud computing-based analytical pipeline, the DDBJ Read Annotation Pipeline (DDBJ Pipeline), for a high-throughput annotation of NGS reads. The DDBJ Pipeline offers a user-friendly graphical web interface and processes massive NGS datasets using decentralized processing by NIG supercomputers currently free of charge. The proposed pipeline consists of two analysis components: basic analysis for reference genome mapping and de novo assembly and subsequent high-level analysis of structural and functional annotations. Users may smoothly switch between the two components in the pipeline, facilitating web-based operations on a supercomputer for high-throughput data analysis. Moreover, public NGS reads of the DDBJ Sequence Read Archive located on the same supercomputer can be imported into the pipeline through the input of only an accession number. This proposed pipeline will facilitate research by utilizing unified analytical workflows applied to the NGS data. The DDBJ Pipeline is accessible at http://p.ddbj.nig.ac.jp/.

Keywords: analytical pipeline; cloud computing; genome analysis; next-generation sequencing; sequence read archive.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Interface for modifying the settings of analysis tools in basic analysis of the DDBJ Pipeline.
Figure 2.
Figure 2.
Job status list in basic analysis of the DDBJ Pipeline. Jobs executed in the DDBJ Pipeline are shown in lists, and users may manage the jobs, for example, by downloading results or by halting the jobs. The bars at the right end of the list indicate elapsed times.

References

    1. Schuster S.C. Next-generation sequencing transforms today's biology. Nat. Methods. 2008;5:16–8. - PubMed
    1. Chi K.R. The year of sequencing. Nat. Methods. 2008;5:11–4. - PubMed
    1. Mardis E.R. The impact of next-generation sequencing technology on genetics. Trends Genet. 2008;24:133–41. - PubMed
    1. Lister R., Gregory B.D., Ecker J.R. Next is now: new technologies for sequencing of genomes, transcriptomes, and beyond. Curr. Opin. Plant Biol. 2009;12:107–18. - PMC - PubMed
    1. Metzker M.L. Sequencing technologies—the next generation. Nat. Rev. Genet. 2010;11:31–46. - PubMed

Publication types