Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Aug;43(8):1012-1030.
doi: 10.1002/humu.24309. Epub 2021 Dec 16.

Variant interpretation using population databases: Lessons from gnomAD

Affiliations
Review

Variant interpretation using population databases: Lessons from gnomAD

Sanna Gudmundsson et al. Hum Mutat. 2022 Aug.

Abstract

Reference population databases are an essential tool in variant and gene interpretation. Their use guides the identification of pathogenic variants amidst the sea of benign variation present in every human genome, and supports the discovery of new disease-gene relationships. The Genome Aggregation Database (gnomAD) is currently the largest and most widely used publicly available collection of population variation from harmonized sequencing data. The data is available through the online gnomAD browser (https://gnomad.broadinstitute.org/) that enables rapid and intuitive variant analysis. This review provides guidance on the content of the gnomAD browser, and its usage for variant and gene interpretation. We introduce key features including allele frequency, per-base expression levels, constraint scores, and variant co-occurrence, alongside guidance on how to use these in analysis, with a focus on the interpretation of candidate variants and novel genes in rare disease.

Keywords: allele frequency; constraint; database; gnomAD; reference population; variant interpretation.

PubMed Disclaimer

Conflict of interest statement

D.G.M. is a founder with equity of Goldfinch Bio, and serves as a paid advisor to GSK, Variant Bio, Insitro, and Foresite Labs. A.O.‐D.L. is on the Scientific Advisory Board for Congenica.

Figures

Figure 1
Figure 1
The gnomAD database aids variant interpretation world‐wide. (a) Weekly page views of gnomAD (dark blue) and ExAC (light blue) from release in October 2014 to mid‐2021. (b) Number of unique gnomAD page views in each country the past 12 months (since 2020‐06‐14) colored by none (grey), 5–10,000 (pink), and more than 10,000 (purple). (c) Schematic of the distribution and overlap of more than 195,000 unique individuals in gnomAD v2 exomes (orange), v2 genomes (green) and v3 genomes (violet)
Figure 2
Figure 2
(a) Mean count of coding very rare variants (allele frequency < 0.1%), and (b) mean count of unique coding variants (across v2 and v3) grouped by population; black bar represents the 95% confidence interval. (c) Comparison of genome‐wide distribution of loss of function (LoF) constraint scores in 19,197 genes, colored by LOEUF decile; a continuous distribution for LOEUF score and a dichotomous‐like distribution for pLI scores. Dotted line marks suggested thresholds for LoF constrained genes at pLI ≥ 0.9 and LOEUF < 0.35 in gene interpretation. LOEUF, LoF observed/expected upper bound fraction; PLI, probability of being LoF intolerant
Figure 3
Figure 3
The gnomAD gene page, displaying NSD1 as an example. Includes gene‐level information of metrics and variant distribution, and allows customized filtering. Some highlighted features are: (1–2) navigating datasets; (3) exome and (4) genome gene coverage; (5) direction of the gene (NSD1 on forward strand); (6–8) transcript and expression information; (9–10) constraint table; (11) proportion expressed across transcripts (pext) score and (12) example of a region with low pext; (13) filtering options for ClinVar variants and (14) expansion of ClinVar variant view; (15–16) gnomAD variant tracks and (17) filter gnomAD variants by consequence; (18) variant search bar; (19) variant table; (20) filter by sequencing method, variant type, and option to include low‐quality filtered variants; (21) customize variant table; (22) download variant table
Figure 4
Figure 4
The gnomAD variant page, displaying the NSD1 missense variant 5‐176562246‐A‐G, p.Met48Val (NM_022455.5:c.142A>G) as an example. Includes variant level information and site specific metrics. Some highlighted features are: (1) external resources and (2) variant feedback forms; (3) allele frequency summary table with filtering allele frequency; (4) population frequency table and (5) visualization of subcontinental populations; (6) navigating datasets; (7) liftover link for gnomAD v3 and (8) visualization of v3 non‐v2 dataset; (9) age data; (10) genotype/depth/allele balance for heterozygotes and (11) site quality metrics; (12) read data and (13) the option to load read data for additional individuals

References

    1. 1000 Genomes Project Consortium Abecasis, G. R. , Altshuler, D. , Auton, A. , Brooks, L. D. , Durbin, R. M. , & McVean, G. A. (2010). A map of human genome variation from population‐scale sequencing. Nature, 467(7319), 1061–1073. - PMC - PubMed
    1. Abou Tayoun, A. N. , Pesaran, T. , DiStefano, M. T. , Oza, A. , Rehm, H. L. , & Biesecker, L. G. , ClinGen Sequence Variant Interpretation Working Group (ClinGen SVI) . (2018). Recommendations for interpreting the loss of function PVS1 ACMG/AMP variant criterion. Human Mutation, 39(11), 1517–1524. - PMC - PubMed
    1. Van der Auwera, G. A. , Carneiro, M. O. , Hartl, C. , Poplin, R. , Del Angel, G. , Levy‐Moonshine, A. , DePristo, … , & M. A. (2013). From FastQ data to high confidence variant calls: The Genome Analysis Toolkit best practices pipeline. Current Protocols in Bioinformatics/Editoral Board, Andreas D. Baxevanis … [et Al.], 43, 11.10.1–11.10.33. - PMC - PubMed
    1. Bamshad, M. J. , Nickerson, D. A. , & Chong, J. X. (2019). Mendelian gene discovery: Fast and furious with no end in sight. American Journal of Human Genetics, 105(3), 448–455. - PMC - PubMed
    1. Carlston, C. M. , O'Donnell‐Luria, A. H. , Underhill, H. R. , Cummings, B. B. , Weisburd, B. , Minikel, E. V. , & Mao, R. (2017). Pathogenic ASXL1 somatic variants in reference databases complicate germline variant interpretation for Bohring‐Opitz Syndrome. Human Mutation, 38(5), 517–523. - PMC - PubMed

Publication types