Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov;6(11):1577-1586.
doi: 10.1038/s41562-022-01408-5. Epub 2022 Aug 4.

Rare genetic variants explain missing heritability in smoking

Seon-Kyeong Jang  1 Luke Evans  2   3 Allison Fialkowski  1 Donna K Arnett  4 Allison E Ashley-Koch  5 Kathleen C Barnes  6 Diane M Becker  7 Joshua C Bis  8 John Blangero  9 Eugene R Bleecker  10 Meher Preethi Boorgula  6 Donald W Bowden  11 Jennifer A Brody  8 Brian E Cade  12 Brenda W Campbell Jenkins  13 April P Carson  14 Sameer Chavan  6 L Adrienne Cupples  15 Brian Custer  16 Scott M Damrauer  17   18 Sean P David  19   20 Mariza de Andrade  21 Carla L Dinardo  22 Tasha E Fingerlin  23   24 Myriam Fornage  25 Barry I Freedman  26 Melanie E Garrett  5 Sina A Gharib  8   27 David C Glahn  28 Jeffrey Haessler  29 Susan R Heckbert  30   31 John E Hokanson  32 Lifang Hou  33 Shih-Jen Hwang  34 Matthew C Hyman  35 Renae Judy  17 Anne E Justice  36 Robert C Kaplan  29   37 Sharon L R Kardia  38 Shannon Kelly  39 Wonji Kim  40 Charles Kooperberg  29 Daniel Levy  34   41 Donald M Lloyd-Jones  33 Ruth J F Loos  42   43 Ani W Manichaikul  44 Mark T Gladwin  45 Lisa Warsinger Martin  46 Mehdi Nouraie  45 Olle Melander  47   48 Deborah A Meyers  10 Courtney G Montgomery  49 Kari E North  50 Elizabeth C Oelsner  51 Nicholette D Palmer  11 Marinelle Payton  52 Anna L Peljto  53 Patricia A Peyser  38 Michael Preuss  42   43 Bruce M Psaty  54 Dandi Qiao  40 Daniel J Rader  35   55 Nicholas Rafaels  6 Susan Redline  12 Robert M Reed  56 Alexander P Reiner  29 Stephen S Rich  44 Jerome I Rotter  57 David A Schwartz  58   59 Aladdin H Shadyab  60 Edwin K Silverman  40 Nicholas L Smith  30   31 J Gustav Smith  61   62 Albert V Smith  63 Jennifer A Smith  38 Weihong Tang  64 Kent D Taylor  57 Marilyn J Telen  5 Ramachandran S Vasan  65   66 Victor R Gordeuk  67 Zhe Wang  42   43 Kerri L Wiggins  8 Lisa R Yanek  7 Ivana V Yang  53 Kendra A Young  32 Kristin L Young  50 Yingze Zhang  45 Dajiang J Liu  68 Matthew C Keller  2 Scott Vrieze  69
Affiliations

Rare genetic variants explain missing heritability in smoking

Seon-Kyeong Jang et al. Nat Hum Behav. 2022 Nov.

Abstract

Common genetic variants explain less variation in complex phenotypes than inferred from family-based studies, and there is a debate on the source of this 'missing heritability'. We investigated the contribution of rare genetic variants to tobacco use with whole-genome sequences from up to 26,257 unrelated individuals of European ancestries and 11,743 individuals of African ancestries. Across four smoking traits, single-nucleotide-polymorphism-based heritability ([Formula: see text]) was estimated from 0.13 to 0.28 (s.e., 0.10-0.13) in European ancestries, with 35-74% of it attributable to rare variants with minor allele frequencies between 0.01% and 1%. These heritability estimates are 1.5-4 times higher than past estimates based on common variants alone and accounted for 60% to 100% of our pedigree-based estimates of narrow-sense heritability ([Formula: see text], 0.18-0.34). In the African ancestry samples, [Formula: see text] was estimated from 0.03 to 0.33 (s.e., 0.09-0.14) across the four smoking traits. These results suggest that rare variants are important contributors to the heritability of smoking.

PubMed Disclaimer

Conflict of interest statement

Competing interests statement

Psaty serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. EKS has received grant support from GSK and Bayer Research support to University of Pennsylvania from RenalytixAI and personal fees from Calico Labs, both outside the current work. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. Other authors declare no competing interests.

Figures

Figure 1.
Figure 1.
SNP-based heritability estimates in the European ancestry sample for each of the six MAF/LD bins, and sums across bins. Error bars represent standard errors. The “Rare” bin is the sum of the MAF 0.1–1% and MAF 0.01–0.1%. “Common” is the sum of the other MAF bins. “Total” is the sum of “Rare” and “Common”. HI and LO each indicate high and low LD. All estimates were adjusted for by demographic variables and 20 PCs (half of them from rare variants) as fixed effects along with random effect of cohort except for CigDay which was adjusted for 5 common PCs to allow model convergence.
Figure 2.
Figure 2.
SNP-based heritability estimates in the European ancestry sample from sensitivity analyses Error bars represent standard errors. The figure shows SNP-based heritability estimates from different sensitivity conditions. Heritability was estimated after adjusting for 20 common and 20 rare variant PCs (“40 PCs”), 50 common and 50 rare variant PCs (“100 PCs”), after removing individuals who share IBD segments more than 2.5% of the total genome length (“long IBD”), after adjusting for the top 20 PCs from the IBD-based GRM matrix (“IBD PCs”), and after adjusting for recruitment site as a random effect (“Site”).
Figure 3.
Figure 3.
Comparison of heritability estimates between current and published studies. The figure shows SNP heritability estimates across different studies. Error bars denote standard errors. “Pedigree” and “Pedigree-FHS” refer to h^2ped from whole TOPMed pedigree samples and FHS only. “WGS_EUR” and “WGS_AFR” refer to WGS-based SNP heritability estimates in individuals of European and African ancestries. Note that WGS_AFR is based on common variants only for all phenotypes except for SmkInit which includes the contribution from MAF 0.1–1% variants additionally. “Evans_imputed” and “Liu_LDSC” each refer to SNP heritability estimates from Evans et al. (MAF:1–50%, relatedness threshold=.0279) and LDSC analysis from a recent meta-analysis of tobacco use. The red dotted line indicates the heritability estimate of smoking from a recent large meta-analysis of twin studies.

References

    1. Johnson T & Barton N Theoretical models of selection and mutation on quantitative traits. Philosophical Transactions of the Royal Society B: Biological Sciences 360, 1411–1425 (2005). - PMC - PubMed
    1. Keinan A & Clark AG Recent Explosive Human Population Growth Has Resulted in an Excess of Rare Genetic Variants. Science 336, 740–743 (2012). - PMC - PubMed
    1. Yengo L et al. Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry. Hum Mol Genet 27, 3641–3649 (2018). - PMC - PubMed
    1. Graham SE et al. The power of genetic diversity in genome-wide association studies of lipids. Nature 600, 675–679 (2021). - PMC - PubMed
    1. Vujkovic M et al. Discovery of 318 new risk loci for type 2 diabetes and related vascular outcomes among 1.4 million participants in a multi-ancestry meta-analysis. Nat Genet 52, 680–691 (2020). - PMC - PubMed

Publication types

Grants and funding