Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Jun 13:2024.06.12.593113.
doi: 10.1101/2024.06.12.593113.

Exploring penetrance of clinically relevant variants in over 800,000 humans from the Genome Aggregation Database

Affiliations

Exploring penetrance of clinically relevant variants in over 800,000 humans from the Genome Aggregation Database

Sanna Gudmundsson et al. bioRxiv. .

Update in

Abstract

Incomplete penetrance, or absence of disease phenotype in an individual with a disease-associated variant, is a major challenge in variant interpretation. Studying individuals with apparent incomplete penetrance can shed light on underlying drivers of altered phenotype penetrance. Here, we investigate clinically relevant variants from ClinVar in 807,162 individuals from the Genome Aggregation Database (gnomAD), demonstrating improved representation in gnomAD version 4. We then conduct a comprehensive case-by-case assessment of 734 predicted loss of function variants (pLoF) in 77 genes associated with severe, early-onset, highly penetrant haploinsufficient disease. We identified explanations for the presumed lack of disease manifestation in 701 of the variants (95%). Individuals with unexplained lack of disease manifestation in this set of disorders rarely occur, underscoring the need and power of deep case-by-case assessment presented here to minimize false assignments of disease risk, particularly in unaffected individuals with higher rates of secondary properties that result in rescue.

PubMed Disclaimer

Conflict of interest statement

COMPETING INTERESTS A.O-D.L. is on the scientific advisory board for Congenica, receives research funding in the form of reagents from Pacific Biosciences, and is a paid advisor to Addition Therapeutics and former advisor to Tome Biosciences and Ono Pharma USA. D.G.M. is a paid adviser to GlaxoSmithKline, Insitro, and Overtone Therapeutics, and receives research funding from Microsoft Corporation. H.L.R. receives research funding from Microsoft. T.L. is an advisor and has equity in Variant Bio.

Figures

Fig. 1:
Fig. 1:
Representation of ClinVar variants in 807,162 individuals in gnomAD v4. (A) Variant count of ClinVar variants in ClinVar (grey) vs. gnomAD (blue) in each classification category pathogenic/likely pathogenic (P/LP), variant of uncertain significance (VUS), benign/likely benign (B/LB) or with conflicting classifications. (B) Percentage of ClinVar variants reported in gnomAD in at least one individual. (C) The proportion of variants by variant type within each clinical significance classification and, (D) within each allele frequency bin, both for variants in gnomAD. (E) Total number of P/LP variants within each allele count bin (including variants absent from gnomAD for comparison). (F) The inheritance pattern of the gene harboring the P/LP variants in gnomAD versus all variants in ClinVar.
Fig. 2:
Fig. 2:. The p.Gly45Glu GJB2 variant associated with severe pediatric disease is rescued by a modifying variant in the East Asian genetic ancestry group.
(A) Schematic of the rescue mechanisms where the modifying variant, a stop gained p.Tyr136Ter in cis with the dominant disease-causing p.Gly45Glu, results in incomplete penetrance by mono-allelic expression of the reference allele only (green). (B) Paired individual-level read data of the two variants occurring in cis. (C) East Asian (n=34) and Admixed American (n=1) individuals carrying both the pathogenic p.Gly45Glu variant (left panel) and the p.Tyr136Ter modifying variant (right panel).
Fig. 3:
Fig. 3:. Deep assessment of 734 predicted high-confidence (HC) loss-of-function (pLoF) variants found in 77 haploinsufficient (HI) genes in 76,215 genomes.
(A) Schematic of how modifying variants (black) may result in lack of a phenotype. (B) Filtering approach. 1KG: 1000 Genomes Project, AB: Allele balance, CH: clonal hematopoesis, MANE: Matched Annotation from NCBI and EMBL-EBI. (C) Variant count per HI gene colored by (also for E-G): explained (blue), uncertain (grey), unexplained (red). The number indicates unexplained variants (red). (D) The explanation for evading LoF in 511 of 734 (69.6%) of variants, MNV: multi-nucleotide variant, pext: per-base-expression score, tx: transcript, cons: conservation. (E) Comparison of outcome in different gene sets, this set (left), heterozygous pLoF variants in autosomal recessive (AR) disease genes in gnomAD v2 (middle), all homozygous (hom) pLoF variants in gnomAD v2 (right). (F) Allele frequency (AF) of variants in this set by LoF curation outcome (log10 scale). (G) The proportion of pLoF variants explained, uncertain, or unexplained within each ClinVar clinical classification category pathogenic/likely pathogenic (P/LP), uncertain significance (VUS), benign/likely benign (B/LB).
Fig. 4:
Fig. 4:. Splicing quantitative trait loci (sQTL) analysis.
(A) Schematic of alternative splicing as a mechanism for incomplete penetrance. (B) Labels for each of the nine predicted loss of function (pLoF) variants. Bars are colored by variant count per gene. Tx: transcript, pext: per base expression score, AB: allele balance. (C) Overview of pLoF variants in MEF2C in ClinVar and in gnomAD v4 (all variants, including exomes). Black box marks region of reduced mean pext score (from v2, blue bar, for one exon the score is not available [N/A]) and enrichment of MEF2C pLoF variants in gnomAD.

References

    1. Tarailo-Graovac M., Zhu J. Y. A., Matthews A., van Karnebeek C. D. M. & Wasserman W. W. Assessment of the ExAC data set for the presence of individuals with pathogenic genotypes implicated in severe Mendelian pediatric disorders. Genet. Med. 19, 1300–1308 (2017). - PMC - PubMed
    1. Carlston C. M. et al. Pathogenic ASXL1 somatic variants in reference databases complicate germline variant interpretation for Bohring-Opitz Syndrome. Hum. Mutat. 38, 517–523 (2017). - PMC - PubMed
    1. Lek M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291 (2016). - PMC - PubMed
    1. Karczewski K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020). - PMC - PubMed
    1. Wright C. F. et al. Assessing the Pathogenicity, Penetrance, and Expressivity of Putative Disease-Causing Variants in a Population Setting. Am. J. Hum. Genet. 104, 275–286 (2019). - PMC - PubMed

Publication types