Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2014 May;15(3):390-406.
doi: 10.1093/bib/bbt088. Epub 2013 Dec 17.

Compressive biological sequence analysis and archival in the era of high-throughput sequencing technologies

Affiliations
Review

Compressive biological sequence analysis and archival in the era of high-throughput sequencing technologies

Raffaele Giancarlo et al. Brief Bioinform. 2014 May.

Abstract

High-throughput sequencing technologies produce large collections of data, mainly DNA sequences with additional information, requiring the design of efficient and effective methodologies for both their compression and storage. In this context, we first provide a classification of the main techniques that have been proposed, according to three specific research directions that have emerged from the literature and, for each, we provide an overview of the current techniques. Finally, to make this review useful to researchers and technicians applying the existing software and tools, we include a synopsis of the main characteristics of the described approaches, including details on their implementation and availability. Performance of the various methods is also highlighted, although the state of the art does not lend itself to a consistent and coherent comparison among all the methods presented here.

Keywords: analysis of large biological sequence collections; compressive sequence analysis; data compression in bioinformatics; data compression of large sequence collections; storage and management of HTS data; succinct data structures for bioinformatics.

PubMed Disclaimer

Publication types

MeSH terms