Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Aug;59(8):72-80.
doi: 10.1145/2957324.

Computational Biology in the 21st Century: Scaling with Compressive Algorithms

Affiliations

Computational Biology in the 21st Century: Scaling with Compressive Algorithms

Bonnie Berger et al. Commun ACM. 2016 Aug.
No abstract available

PubMed Disclaimer

Figures

Figure 1
Figure 1
(a) Moore’s and (b) Kryder’s laws contrasted with genomic sequence data.
Figure 2
Figure 2
The next-generation sequencing (NGS) pipeline.
Figure 3
Figure 3
Cartoon depiction of points in an arbitrary high-dimensional space, as might arise from genomes generated by mutation and selection during the course of evolution. Although high dimensional locally, at the global scale of covering spheres, the data cloud looks nearly 1-dimensional, which enables entropy scaling of similarity search. Clusters cover the data points but do not cover unoccupied regions of space. The green triangle represents a query, with two concentric search radii (red circles) around it. Thanks to low fractal dimension, the large circle does not contain vastly more points than the small circle.

References

    1. 1000 Genomes Project Consortium et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491(7422):56–65. - PMC - PubMed
    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. Journal of Molecular Biology. 1990;215(3):403–410. - PubMed
    1. Berger B, Peng J, Singh M. Computational solutions for omics data. Nature Reviews Genetics. 2013;14(5):333–346. - PMC - PubMed
    1. Bonfield JK, Mahoney MV. Compression of FASTQ and SAM format sequencing data. PLoS ONE. 2013;8(3):e59190. - PMC - PubMed
    1. Bredel M, Jacoby E. Chemogenomics: An emerging strategy for rapid target and drug discovery. Nature Reviews Genetics. 2004;5(4):262–275. - PubMed

LinkOut - more resources