Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 1;35(22):4791-4793.
doi: 10.1093/bioinformatics/btz508.

GTShark: genotype compression in large projects

Affiliations

GTShark: genotype compression in large projects

Sebastian Deorowicz et al. Bioinformatics. .

Abstract

Summary: Nowadays large sequencing projects handle tens of thousands of individuals. The huge files summarizing the findings definitely require compression. We propose a tool able to compress large collections of genotypes almost 30% better than the best tool to date, i.e. squeezing human genotype to less than 62 KB. Moreover, it can also compress single samples in reference to the existing database achieving comparable results.

Availability and implementation: https://github.com/refresh-bio/GTShark.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Publication types

LinkOut - more resources