Beacon Reconstruction Attack: Reconstruction of genomes in genomic data-sharing beacons using summary statistics
- PMID: 40388204
- PMCID: PMC12133290
- DOI: 10.1093/bioinformatics/btaf273
Beacon Reconstruction Attack: Reconstruction of genomes in genomic data-sharing beacons using summary statistics
Abstract
Motivation: Genomic data-sharing beacon protocol, developed by the Global Alliance for Genomics and Health, offers a privacy-preserving mechanism for querying genomic datasets while restricting direct data access. Despite their design, beacons remain vulnerable to privacy attacks. This study introduces a novel privacy vulnerability of the protocol: one can reconstruct large portions of the genomes of all beacon participants by only using the summary statistics reported by the protocol.
Results: We introduce a novel optimization-based algorithm that leverages beacon responses and SNP correlations for reconstruction. By optimizing for the SNP correlations and allele frequencies, the proposed approach achieves genome reconstruction with a substantially higher F1-score (70%) compared to baseline methods (45%) on beacons generated using individuals from the HapMap and OpenSNP datasets. We show that reconstructed genomes can be used by downstream applications such as in membership inference attacks against other beacons. Our findings reveal that beacons releasing allele frequencies substantially increase the reconstruction risk, underscoring the need for enhanced privacy-preserving mechanisms to protect genomic data.
Availability and implementation: Our implementation is available at https://github.com/ASAP-Bilkent/Beacon-Reconstruction-Attack.
© The Author(s) 2025. Published by Oxford University Press.
Figures


References
-
- Cho H, Simmons S, Kim R et al. Privacy-preserving biomedical database queries with optimal privacy-utility trade-offs. Cell Syst 2020;10:408–16.e9. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources