Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Oct 16;16(1):9194.
doi: 10.1038/s41467-025-64236-x.

Genetic determinants and genomic consequences of non-leukemogenic somatic point mutations

Affiliations

Genetic determinants and genomic consequences of non-leukemogenic somatic point mutations

Joshua S Weinstock et al. Nat Commun. .

Abstract

Clonal hematopoiesis (CH) is defined by the expansion of a lineage of genetically identical cells in blood. Genetic lesions that confer a fitness advantage, such as leukemogenic point mutations or mosaic chromosomal alterations (mCAs), are frequent mediators of CH. However, recent analyses of both single cell-derived colonies of hematopoietic cells and population sequencing cohorts have revealed CH frequently occurs in the absence of known driver genetic lesions. To characterize CH without known driver genetic lesions, we use 51,399 deeply sequenced whole genomes from the NHLBI TOPMed sequencing initiative to perform simultaneous germline and somatic mutation analyses among individuals without leukemogenic point mutations (LPM), which we term CH-LPMneg. We quantify CH by estimating the total mutation burden. Because estimating somatic mutation burden without a paired-tissue sample is challenging, we develop a novel statistical method, the Genomic and Epigenomic informed Mutation (GEM) rate, that uses external genomic and epigenomic data sources to distinguish artifactual signals from true somatic mutations. We perform a genome-wide association study of GEM to discover the germline determinants of CH-LPMneg. We identify seven genes associated with CH-LPMneg (TCL1A, TERT, SMC4, NRIP1, PRDM16, MSRA, SCARB1).Functional analyses of SMC4 and NRIP1 implicated altered hematopoietic stem cell self-renewal and proliferation as the primary mediator of mutation burden in blood. We then perform comprehensive multi-tissue transcriptomic analyses, finding that the expression levels of 404 genes are associated with GEM. Finally, we perform phenotypic association meta-analyses across four cohorts, finding that GEM is associated with increased white blood cell count, but is not significantly associated with incident stroke or coronary disease events. Overall, we develop GEM for quantifying mutation burden from WGS and use GEM to discover the genetic, genomic, and phenotypic correlates of CH-LPMneg.

PubMed Disclaimer

Conflict of interest statement

Competing interests: L.M.R. is a consultant for the TOPMed Administrative Coordinating Center (through Westat). B.M.P. serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. J.Y. reports grant support from Bayer. M.C. reports grant support from Bayer and GSK, Consulting and speaking fees from Illumina and AstraZeneca. A.G.B., P.N., and S.J. are cofounders, equity holders, and on the scientific advisory board of TenSixteen Bio. G.R.A. is an employee of Regeneron Pharmaceuticals and receives a salary, stock and stock options as compensation. A.B. is a co-founder and equity holder of CellCipher, Inc., a stockholder in Alphabet, Inc. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Study overview.
Schematic, describing the development of GEM (top panel), the use of GEM to discover the genetic determinants of mutation burden in blood (middle panel), and the use of GEM to identify the transcriptomic and clinical correlates of mutation burden in blood (bottom panel).
Fig. 2
Fig. 2. Development of GEM.
A The Spearman correlation between mutation burden and chronological age stratified by chromHMM annotations in CD34 + cells. B The Spearman correlation between mutation burden and chronological age stratified by functional consequence as annotated by the variant effect predictor (VEP). C The Spearman correlation between mutation burden and chronological age, stratified by quintiles of CADD scores. D Plate annotation for the GEM statistical model. θ_0 and are intercepts; θ_1reflects the association between the log2-transformed value of ∑z_ij and chronological age Y_i; z_ij denotes the probability that the jth mutation in the ith individual is a true somatic mutation. X is a matrix of annotations.
Fig. 3
Fig. 3. The genetic determinants of GEM.
A The GWAS of GEM. Summary statistics were estimated with SAIGE, λ^gc = 1.0. B Fine-mapping of the TCL1A locus. Note rs11846938 is 10 bp from rs2887399. Fine-mapping was performed with SuSIE.
Fig. 4
Fig. 4. The consequence of SMC4 and NRIP1 knock down on HSC self-renewal.
A SMC4 and NRIP1 were knocked-down with shRNA, and the proportion of CD34 + cells was quantified with FACs. Quantities were compared referent to a non-targeting control. B proportion of CD34 + CD38- was quantified with FACS. C Number of colonies formed in a colony-forming unit (CFU) assay. D Number of colonies in a burst-forming unit assay. A linear model was used to estimate effect sizes across technical replicates (n = 4). Error bars indicate 95% confidence intervals.
Fig. 5
Fig. 5. The sex specific genetic determinants of mutation burden.
A Regressions were performed for each quantile-transformed somatic principal component (sPC) on study and sex as covariates. The partial variance explained by sex is displayed on the y-axis. B Circular Manhattan plot. The outer-most ring is the GWAS of GEM on all individuals, the middle ring is the GWAS of GEM on males (λ^gc = 0.99), and the inner-most ring is the GWAS of GEM in females (λ^gc = 0.99). Inset, a scatter plot of the two sex-specific GWAS plotting all SNPs with p-values < 1 × 10-8 in either GWAS. Asymptotic confidence intervals are plotted with a width corresponding to genome-wide significance.
Fig. 6
Fig. 6. The transcriptomic correlates of GEM.
A Association analyses were performed between GEM and gene expression in whole blood, including age, sex, genotype PCs 1-5, and expression PCs 1-20 as covariates. B Enrichment analyses were performed using pathfindR and KEGG pathways as reference. C Association statistics among CHIP GWAS genes in whole blood. D Association statistics among mCA GWAS genes in whole blood. Analyses were performed using 2248 samples in whole blood, 1215 for PBMC, 347 for T cells, 333 for monocytes, and 323 for nasal epithelial.
Fig. 7
Fig. 7. The phenotype correlates of GEM.
A Cox proportional-hazard regressions were performed, regressing incident events on a spline of age, sex, smoking status, and germline PCs. Individuals with prevalent disease were excluded. CAD coronary artery disease, PAD peripheral artery disease, CABG coronary artery bypass graft, MI myocardial infarction. CAD events were defined as at least one of an MI, CABG, angina, or angioplasty during the follow-up period. A random effects meta-analysis was performed. GEM was inverse normal transformed. Sex was excluded from the WHI regression, and smoking was excluded from the COPD regression. B A linear regression of the inverse normal transformed biomarker, including a spline of age, sex, smoking status, and germline PCs as covariates. GEM was inverse normal transformed. Sex was excluded from the WHI regression, and smoking was excluding from the COPD regression.

Update of

  • The Genetic Determinants and Genomic Consequences of Non-Leukemogenic Somatic Point Mutations.
    Weinstock JS, Chaudhry SA, Ioannou M, Viskadourou M, Reventun P, Jakubek YA, Liggett LA, Laurie C, Broome JG, Khan A, Taylor KD, Guo X, Peyser PA, Boerwinkle E, Chami N, Kenny EE, Loos RJ, Psaty BM, Russell TP, Brody JA, Yun JH, Cho MH, Vasan RS, Kardia SL, Smith JA, Raffield LM, Bidulescu A, O'Brien E, de Andrade M, Rotter JI, Rich SS, Tracy RP, Chen YI, Gu CC, Hsiung CA, Kooperberg C, Haring B, Nassir R, Mathias R, Reiner A, Sankaran V, Lowenstein CJ, Blackwell TW, Abecasis GR, Smith AV, Kang HM, Natarajan P, Jaiswal S, Bick A, Post WS, Scheet P, Auer P, Karantanos T, Battle A, Arvanitis M. Weinstock JS, et al. medRxiv [Preprint]. 2024 Aug 26:2024.08.22.24312319. doi: 10.1101/2024.08.22.24312319. medRxiv. 2024. Update in: Nat Commun. 2025 Oct 16;16(1):9194. doi: 10.1038/s41467-025-64236-x. PMID: 39228737 Free PMC article. Updated. Preprint.

References

    1. Jaiswal, S. et al. Age-related clonal hematopoiesis associated with adverse outcomes. N. Engl. J. Med26, 2488–2498 (2014). - PMC - PubMed
    1. Genovese, G. et al. Clonal hematopoiesis and blood-cancer risk inferred from blood DNA sequence. N. Engl. J. Med.371, 2477–2487 (2014). - PMC - PubMed
    1. Xie, M. et al. Age-related mutations associated with clonal hematopoietic expansion and malignancies. Nat. Med.20, 1472–1478 (2014). - PMC - PubMed
    1. Jaiswal, S. et al. Clonal hematopoiesis and risk of atherosclerotic cardiovascular disease. N. Engl. J. Med. 377, 111–121 (2017). - PMC - PubMed
    1. Desai, P. et al. Somatic mutations precede acute myeloid leukemia years before diagnosis. Nat. Med.24, 1015–1023 (2018). - PMC - PubMed

LinkOut - more resources