Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 6;4(1):vbae017.
doi: 10.1093/bioadv/vbae017. eCollection 2024.

ZygosityPredictor

Affiliations

ZygosityPredictor

Marco Rheinnecker et al. Bioinform Adv. .

Abstract

Summary: ZygosityPredictor provides functionality to evaluate how many copies of a gene are affected by mutations in next generation sequencing data. In cancer samples, the tool processes both somatic and germline mutations. In particular, ZygosityPredictor computes the number of affected copies for single nucleotide variants and small insertions and deletions (Indels). In addition, the tool integrates information at gene level via phasing of several variants and subsequent logic to derive how strongly a gene is affected by mutations and provides a measure of confidence. This information is of particular interest in precision oncology, e.g. when assessing whether unmutated copies of tumor-suppressor genes remain.

Availability and implementation: ZygosityPredictor was implemented as an R-package and is available via Bioconductor at https://bioconductor.org/packages/ZygosityPredictor. Detailed documentation is provided in the vignette including application to an example genome.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
Schematic for determination of the number of affected copies for a somatic or germline mutation in a tumor sample. Tumor cells are depicted in red, admixed healthy cells are depicted in blue. The relevant parameters are color-coded. Calculation of affected copies are depicted in grey boxes.
Figure 2.
Figure 2.
Phasing. (A) Schematic for read-level phasing in a diploid genome with two variants in a region of interest. On the left, the variants are located on different alleles; on the right, variants are located on the same allele and therefore leave one wt allele intact. Below, repercussions on NGS reads/read-pairs are shown. (B) Example of a genomic region with 7 mutations/variants to be phased. Variants are depicted in red and SNPs (germline variation including polymorphisms) in blue. AIP, allelic imbalance phasing. (C) Phasing-matrix of the same example as in (B) with detailed indirect reasoning for all pairwise variant combinations that cannot be determined by direct phasing. Colors of phased combinations correspond to read-pair colors in (B).

References

    1. Campbell P, Getz G, Korbel J, et al.Pan-cancer analysis of whole genomes. Nature (London) 2020;578:129–36. - PubMed
    1. Garcia-Prieto CA, Martínez-Jiménez F, Valencia A. et al. Detection of oncogenic and clinically actionable mutations in cancer genomes critically depends on variant calling tools. Bioinformatics 2022;38:3181–91. - PMC - PubMed
    1. Horak P, Heining C, Kreutzfeldt S. et al. Comprehensive genomic and transcriptomic analysis for guiding therapeutic decisions in patients with rare cancers. Cancer Discov 2021;11:2780–95. - PubMed
    1. Hübschmann D, Schlesner M.. Evaluation of whole genome sequencing data. Methods Mol Biol 2019;1956:321–36. - PubMed
    1. Kleinheinz K, Bludau I, Hübschmann D. et al. ACEseq - Allele-specific Copy Number Estimation from Whole Genome Sequencing and its Application in Heterogeneity and Integrative Analyses of Cancer. BioRxiv, Cold Spring Harbor, NY, USA: Cold Spring Harbor Laboratory, 2017.