Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2026 Feb 27;22(2):e1012037.
doi: 10.1371/journal.pgen.1012037. eCollection 2026 Feb.

Genetic correlation-guided mega-analysis of DO mice provides mechanistic insight and candidate genes for age-related pathologies

Affiliations

Genetic correlation-guided mega-analysis of DO mice provides mechanistic insight and candidate genes for age-related pathologies

Martin N Mullis et al. PLoS Genet. .

Abstract

Diversity Outbred (DO) mice are a powerful model system for mapping complex traits due to their high genetic diversity and mapping resolution. However, while there are extensive tools available for standard genetic analysis in DO mice, fewer techniques have been implemented to facilitate integrated, cross-study analysis. Here, we implement Haseman-Elston regression to estimate genetic correlations among 7,233 phenotypes measured across eleven independent DO mouse studies. We used this network of genetic correlations to cluster phenotypes according to shared genetics, which enhanced the power to detect quantitative trait loci (QTL). This approach empowered the detection of 884 QTL for 383 meta-phenotypes, explaining an average of 40.36% of the total genetic variance per mega-analysis. We leveraged this network for insights into specific areas of biology, including lifespan, frailty, immune composition, histological and functional lung phenotypes, and histological phenotypes of the aorta. We found the genetics of lifespan to share limited correlation with the genetics of frailty but stronger correlation with the genetics of immune cell composition. Additionally, mega-analyses driven by genetic correlations identified candidate genes (e.g., Cdkn2b) associated with degraded extracellular matrix in the aorta. Finally, an ensemble of genetic analyses implicated pulmonary neuroendocrine cell signaling and/or differentiation as a key driver of multiple lung pathophenotypes.

PubMed Disclaimer

Conflict of interest statement

I have read the journal’s policy and the authors of this manuscript have the following competing interests: the research was funded by Calico Life Sciences LLC, South San Francisco, CA, where all authors were employees at the time the study was conducted. The authors declare no other competing financial interests.

Figures

Fig 1
Fig 1. Performance of Haseman-Elston regression implementation and estimation of rg among traits measured across diet and time point.
A, Comparison of h2 estimates for 36 DO mouse traits derived from our implementation of Haseman-Elston regression to those from an orthogonal method, Residual Maximum Likelihood (REML). The line of identity is denoted by a solid line; the best-fit line is denoted by a dashed line. The Pearson correlation coefficient and accompanying p-value are reported in the upper-left of the plot area. Traits are colored by type: ‘AS’ - Acoustic Startle. ‘DX’ - DEXA scan. ‘EC’ - Echocardiogram. ‘GS’ - Grip Strength. ‘RR’ - Rotarod. ‘WR’ - Wheel Running. B, Comparison of rg estimates among all pairs of 36 DO mouse traits derived from Haseman-Elston regression to those from REML. C, Mean intratrait rg,t across time points for individual traits measured in the DRiDO study. Black bars denote the average mean intratrait rg for traits of a particular type, and gray bars represent the mean intratrait rg + /- the standard error of the mean. ‘AS’ - acoustic startle. ‘BW’ - body weight. ‘CBC’ - complete blood count. ‘FACS’ - immune cell composition via fluorescence activated cell sorting. ‘PIXI’ - body composition (lean, fat, bone density). D, Mean intratrait rg,d dietary groups for individual traits measured in the DRiDO study. E, Mean intratrait rg across diet and time point for each category of trait, with accompanying standard errors. F, Mean genetic correlations among traits measured at different time points in the Dietary Restriction in DO Mice (DRiDO) study. Size of the squares are inversely proportional to the standard error of each mean genetic correlation (left). G, Mean genetic correlations among traits measured in different dietary groups in the DRiDO study.
Fig 2
Fig 2. An atlas of rg and rpg estimates among complex phenotypes in DO mice.
A, The distribution of h2, rg, and rpg, estimates for DO mouse phenotypes used in downstream analysis. The median, 25th percentile, and 75th percentile of the distribution are denoted via box and whisker plots. B, The relationship between the pairwise genetic correlation (rg) and Pearson correlation of genetic correlations (rpg) for each pair of traits in the dataset. Pearson and Spearman correlation coefficients are reported. C, A hierarchically clustered heatmap of rg (lower triangle) and rpg (upper triangle) estimates among 1,898 DO phenotypes. Phenotypes measured in individual dietary groups from the DRiDO study are excluded. The accompanying dendrogram shows euclidean distances among traits based on their rpg estimates. D, The distribution of rpg values among traits measured in the same study (intra-study, blue) or in different studies (inter-study, red). The median, 25th percentile, and 75th percentile of the distribution are denoted via box and whisker plots. E-F, Force-directed network diagrams of the 1,898 DO mouse phenotypes used in downstream analysis, with nodes corresponding to individual traits. Networks are colored by study (E) and trait type (F). Cal. Int. - Calico internal phenotypes. Sven - Svenson high fat diet study. Pazdro - Heart study. Ches - Chesler striatum study. Meta - meta analysis of lifespan. Attie - Pancreas and insulin secretion study. BW - bodyweight. DFI & BCS - Digital frailty index and body condition score. G, Plot of mean h2 ratios among clusters of traits at different levels of hierarchical clustering (k). H, The location and LOD scores of the 884 QTL detected in meta-analyses of 383 clusters of genetically correlated traits (meta-traits). I, The number of QTL detected at LOD ≥ 6 in each of the 383 meta-traits plotted against the h2 of each meta-trait. Points are colored by the percent of heritable trait variation explained by the detected QTL.
Fig 3
Fig 3. Genetic correlations involving measurements of lifespan in DO mice.
A, Heatmap of genetic correlations and heritability estimates among three independent measurements of lifespan in DO mice. The diagonal of the heatmap reports the h2 of each lifespan trait. The lower triangle of the heatmap shows rg among the lifespan traits and the upper triangle shows the rpg among lifespan traits. Asterisks denote statistically significant h2, rg, or rpg estimates. The dendrogram shows the results of hierarchical clustering of the lifespan traits based on euclidean distance derived from the rpg values among each pair of traits. Clusters of traits that result in a maximal mean h2 ratio when combined are represented by colors in the dendrogram as well as black boxes in the heatmap. B, Scatterplots of rpg estimates between lifespan traits and the 380 other non-lifespan meta-traits. C, The strongest positive and negative rpg estimates between meta-traits and combined lifespan data. Error bars correspond to the standard error of each rpg estimate. Meta-trait number (cluster number) is shown on the left hand side of the plot and a brief summary of the phenotypes included in each meta-trait is shown on the right. D, Examples of trait clusters that, when aggregated into meta-traits, have high rpg estimates with combined lifespan data. Call 53 (top) comprises midlife (year 2) traits reflecting relative abundances of neutrophils and lymphocytes in the blood and includes frailty index and body condition scores. Call 251 (bottom) comprises CD11 + B and chronic lymphoid leukemia-like (CLL) cell abundance traits from early and midlife (years 1 and 2). E, Manhattan plot of an additive whole-genome scan of a mega-analysis of lifespan consisting of data from the DRiDO, Harrison, and Shock lifespan studies. The red line indicates a genome-wide significance threshold of ⍺ = 0.05 based on 1,000 permutations of the data (see Methods). The dashed red line indicates a FDR-based significance threshold of LOD >= 6 (corresponding to an FDR of ~9%). QTL mapped at genome-wide significance in the mega-analysis or one of the individual studies comprising the analysis are denoted by asterisks. F, Volcano plot depicting the correlation coefficient and -log10(p) of haplotype effects at lifespan loci with overlapping QTL from meta-analyses of other trait clusters. G, Expression of genes within the 2LOD support interval of the lifespan QTL on chr. 18 in human Natural Killer (NK) cells. H, Expression of genes within the 2LOD support interval of the lifespan QTL on chr. 12 in human NK, B, and T cells.
Fig 4
Fig 4. Genetic correlations among frailty and lifespan phenotypes.
A, Pairwise (rg, bottom triangle) and Pearson (rpg, upper triangle), genetic correlations among Manual Frailty Index (MFI), Digital Frailty Index (DFI), and lifespan. Statistically significant genetic correlations are denoted with an asterisk. Box color represents the magnitude of the genetic correlation and box size is inversely correlated with the size of the standard error. B-C, Heritability and standard error of individual (B) and aggregate (C) frailty phenotypes. The dashed red line at h2 = 0.1 is the threshold used to determine if phenotypes were included in downstream analysis. D, Hierarchically clustered heatmap showing rg (lower triangle) and rpg (upper triangle) among frailty phenotypes. The accompanying dendrogram shows euclidean distances among traits based on their rpg estimates. Clusters of frailty traits (F1 - F10) that result in a maximal mean h2 ratio when combined are represented by colors in the dendrogram as well as black boxes in the heatmap. rg and rpg among frailty and unclustered lifespan phenotypes are shown in the adjacent panels of the heatmap. E, Heritability and standard error of the meta-traits comprising clusters of genetically correlated frailty phenotypes. F, rpg and standard error among frailty meta-traits and a meta-analysis of lifespan spanning three studies.
Fig 5
Fig 5. Genetic correlations among extracellular matrix traits in the aorta.
A, Hierarchically clustered heatmap showing rg (lower triangle) and rpg (upper triangle) among aorta phenotypes. The accompanying dendrogram shows euclidean distances among traits based on their rpg estimates. Clusters of aorta phenotypes that result in a maximal mean h2 ratio when combined are represented by colors in the dendrogram as well as black boxes in the heatmap. B, Manhattan plot of an additive whole-genome scan on aggregated aortic elastin thickness and pct. area phenotypes. The red line indicates a genome-wide significance threshold of ⍺ = 0.05 based on 1,000 permutations of the data (see Methods). The dashed red line indicates a nominal significance threshold of LOD >= 6. C, Manhattan plot of an additive whole-genome scan on aggregated aortic elastin breakage phenotypes. D, Manhattan plot of an additive whole-genome scan on aggregated aortic elastin layers phenotypes. E–I, Correlations of haplotype effects between the peak markers at QTL influencing elastin phenotypes and cis-eQTL in an external DO cardiac tissue transcriptomic dataset. Candidate genes are highlighted with red text.
Fig 6
Fig 6. Genetic correlations among lung structural and functional phenotypes.
A, Hierarchically clustered heatmap showing rg (lower triangle) and rpg (upper triangle) among lung phenotypes. The accompanying dendrogram shows euclidean distances among traits based on their rpg estimates. Clusters of lung phenotypes that result in a maximal mean h2 ratio when combined are represented by colors in the dendrogram as well as black boxes in the heatmap. B, Manhattan plot of an additive whole-genome scan on aggregated fibrosis volume phenotypes. The red line indicates a genome-wide significance threshold of ⍺ = 0.05 based on 1,000 permutations of the data (see Methods). The dashed red line indicates a nominal significance threshold of LOD >= 6. C–E, Allelic effects (BLUPs) and corresponding standard errors of the QTL influencing fibrosis volume in the lung (left), and variant association mapping of the 2 LOD support intervals around each QTL (right). Genes within the confidence intervals are shown in the bottom panel, and candidate genes are shown above variant association results. F, Manhattan plot of an additive whole-genome scan on aggregated fibrosis intensity phenotypes. G–I, Allelic effects (BLUPs) and corresponding standard errors of the QTL influencing fibrosis intensity in the lung (left), and variant association mapping of the 2 LOD support intervals around each QTL (right). J, Model depicting the cellular functions of pulmonary neuroendocrine cells (PNECs) and showcasing the roles of candidate genes may play in contributing to fibrotic µCT readouts in the lung. Candidate genes are colored by the whole-genome scan in which they were detected.

References

    1. Churchill GA, Gatti DM, Munger SC, Svenson KL. The Diversity Outbred mouse population. Mamm Genome. 2012;23(9–10):713–8. doi: 10.1007/s00335-012-9414-2 - DOI - PMC - PubMed
    1. Gatti DM, Svenson KL, Shabalin A, Wu L-Y, Valdar W, Simecek P, et al. Quantitative trait locus mapping methods for diversity outbred mice. G3 (Bethesda). 2014;4(9):1623–33. doi: 10.1534/g3.114.013748 - DOI - PMC - PubMed
    1. Svenson KL, Gatti DM, Valdar W, Welsh CE, Cheng R, Chesler EJ, et al. High-resolution genetic mapping using the Mouse Diversity outbred population. Genetics. 2012;190(2):437–47. doi: 10.1534/genetics.111.132597 - DOI - PMC - PubMed
    1. Nadon NL, Strong R, Miller RA, Harrison DE. NIA Interventions Testing Program: Investigating Putative Aging Intervention Agents in a Genetically Heterogeneous Mouse Model. EBioMedicine. 2017;21:3–4. doi: 10.1016/j.ebiom.2016.11.038 - DOI - PMC - PubMed
    1. Vincent M, Gerdes Gyuricza I, Keele GR, Gatti DM, Keller MP, Broman KW, et al. QTLViewer: an interactive webtool for genetic analysis in the Collaborative Cross and Diversity Outbred mouse populations. G3 (Bethesda). 2022;12(8):jkac146. doi: 10.1093/g3journal/jkac146 - DOI - PMC - PubMed

LinkOut - more resources