Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul;28(7):691-3.
doi: 10.1038/nbt0710-691.

Cloud computing and the DNA data race

Affiliations

Cloud computing and the DNA data race

Michael C Schatz et al. Nat Biotechnol. 2010 Jul.
No abstract available

PubMed Disclaimer

Figures

Figure 1
Figure 1. Map-Shuffle-Scan framework used by Crossbow
Users begin by uploading the sequencing reads into the cloud storage. Hadoop, running on a cluster of virtual machines in the cloud, then maps the unaligned reads to the reference genome using many parallel instances of Bowtie. Hadoop then automatically shuffles the alignments into sorted bins determined by chromosome region. Finally, many parallel instances of SOAPsnp scan the sorted alignments in each bin. The final output is a stream of SNP calls stored within the cloud that can be downloaded back to the user's local computer.

References

    1. Stein LD. The case for cloud computing in genome informatics. Genome Biol. 2010;11:207. - PMC - PubMed
    1. Moore GE. Cramming more components onto integrated circuits. Electronics. 1965;38:4–1965.
    1. Dongarra JJ, Otto SW, Snir M, Walker D. A message passing standard for MPP and workstations. Commun. ACM. 1996;39:84–1996.
    1. Litzkow M, Livny M, Mutka M. Condor: A Hunter of Idle Workstations. 8th International Conference of Distributed Computing Systems.1988.
    1. Dagum L, Menon R. OpenMP: An Industry-Standard API for Shared-Memory Programming. IEEE Comput. Sci. Eng. 1998;5:46–1998.

Publication types