Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep 9;37(17):2753-2754.
doi: 10.1093/bioinformatics/btab087.

Crypt4GH: a file format standard enabling native access to encrypted data

Affiliations

Crypt4GH: a file format standard enabling native access to encrypted data

Alexander Senf et al. Bioinformatics. .

Abstract

Motivation: The majority of genome analysis tools and pipelines require data to be decrypted for access. This potentially leaves sensitive genetic data exposed, either because the unencrypted data is not removed after analysis, or because the data leaves traces on the permanent storage medium.

Results: : We defined a file container specification enabling direct byte-level compatible random access to encrypted genetic data stored in community standards such as SAM/BAM/CRAM/VCF/BCF. By standardizing this format, we show how it can be added as a native file format to genomic libraries, enabling direct analysis of encrypted data without the need to create a decrypted copy.

Availability and implementation: The Crypt4GH specification can be found at: http://samtools.github.io/hts-specs/crypt4gh.pdf.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

References

    1. Kelleher J. et al.; GA4GH Streaming Task Team. (2019) htsget: a protocol for securely streaming genomic data. Bioinformatics, 35, 119–121. - PMC - PubMed
    1. Kim M. et al. (2020) Ultra-fast homomorphic encryption models enable secure outsourcing of genotype imputation. bioRxiv 2020.07.02.183459, doi:10.1101/2020.07.02.183459. - PMC - PubMed
    1. Morteza H. et al. (2019) CRYFA: a secure encryption tool for genomic data. Bioinformatics, 35, 146–148. - PMC - PubMed
    1. Rescorla E. (2018) The Transport Layer Security (TLS) Protocol Version 1.3. RFC Editor, doi: 10.17487/RFC8446.
    1. Zhicong H. et al. (2016) A privacy-preserving solution for compressed storage and selective retrieval of genomic data. Genome Res., 26, 1687–1696. - PMC - PubMed