Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Oct;46(5):774-81.
doi: 10.1016/j.jbi.2013.07.001. Epub 2013 Jul 18.

'Big data', Hadoop and cloud computing in genomics

Affiliations
Free article

'Big data', Hadoop and cloud computing in genomics

Aisling O'Driscoll et al. J Biomed Inform. 2013 Oct.
Free article

Abstract

Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology's big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community.

Keywords: Big data; Bioinformatics; Cloud computing; Genomics; Hadoop.

PubMed Disclaimer

Publication types

LinkOut - more resources