Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2026 Jan;649(8096):406-415.
doi: 10.1038/s41586-025-09820-3. Epub 2025 Dec 10.

Mapping the genetic landscape across 14 psychiatric disorders

Andrew D Grotzinger #  1   2 Josefin Werme #  3 Wouter J Peyrot #  3   4 Oleksandr Frei #  5   6 Christiaan de Leeuw  3 Lucy K Bicks  7 Qiuyu Guo  7   8 Michael P Margolis  9   10 Brandon J Coombes  11 Anthony Batzler  11 Vanessa Pazdernik  11 Joanna M Biernacka  11   12 Ole A Andreassen  6   13 Verneri Anttila  14   15   16 Anders D Børglum  17   18   19 Gerome Breen  20 Na Cai  21   22   23   24 Ditte Demontis  17   18   19   25 Howard J Edenberg  26   27 Stephen V Faraone  28   29 Barbara Franke  30   31   32 Michael J Gandal  33   34 Joel Gelernter  35   36 Alexander S Hatoum  37 John M Hettema  38 Emma C Johnson  37 Katherine G Jonas  39 James A Knowles  40   41 Karestan C Koenen  42   43   44 Adam X Maihofer  45   46   47 Travis T Mallard  48   49   50 Manuel Mattheisen  18   51   52   53 Karen S Mitchell  54   55 Benjamin M Neale  16   56 Caroline M Nievergelt  45   46   47 John I Nurnberger  57 Kevin S O'Connell  6 Roseann E Peterson  58 Elise B Robinson  16   59 Sandra S Sanchez-Roige  46   60   61 Susan L Santangelo  62   63 Jeremiah M Scharf  50   64   65   66 Hreinn Stefansson  67 Kari Stefansson  67   68 Murray B Stein  46   69   70 Nora I Strom  51   53   71 Laura M Thornton  72 Elliot M Tucker-Drob  73   74   75 Brad Verhulst  38 Irwin D Waldman  76 G Bragi Walters  67   68 Naomi R Wray  77   78 Dongmei Yu  50   66 Anxiety Disorders Working Group of the Psychiatric Genomics ConsortiumAttention-Deficit/Hyperactivity Disorder (ADHD) Working Group of the Psychiatric Genomics ConsortiumAutism Spectrum Disorders Working Group of the Psychiatric Genomics ConsortiumBipolar Disorder Working Group of the Psychiatric Genomics ConsortiumEating Disorders Working Group of the Psychiatric Genomics ConsortiumMajor Depressive Disorder Working Group of the Psychiatric Genomics ConsortiumNicotine Dependence GenOmics (iNDiGO) ConsortiumObsessive-Compulsive Disorder and Tourette Syndrome Working Group of the Psychiatric Genomics ConsortiumPost-Traumatic Stress Disorder Working Group of the Psychiatric Genomics ConsortiumSchizophrenia Working Group of the Psychiatric Genomics ConsortiumSubstance Use Disorders Working Group of the Psychiatric Genomics ConsortiumPhil H Lee  50 Kenneth S Kendler  79   80 Jordan W Smoller  81   82   83
Collaborators, Affiliations

Mapping the genetic landscape across 14 psychiatric disorders

Andrew D Grotzinger et al. Nature. 2026 Jan.

Abstract

Psychiatric disorders display high levels of comorbidity and genetic overlap1,2, challenging current diagnostic boundaries. For disorders for which diagnostic separation has been most debated, such as schizophrenia and bipolar disorder3, genomic methods have revealed that the majority of genetic signal is shared4. While over a hundred pleiotropic loci have been identified by recent cross-disorder analyses5, the full scope of shared and disorder-specific genetic influences remains poorly defined. Here we addressed this gap by triangulating across a suite of cutting-edge statistical and functional genomic analyses applied to 14 childhood- and adult-onset psychiatric disorders (1,056,201 cases). Using genetic association data from common variants, we identified and characterized five underlying genomic factors that explained the majority of the genetic variance of the individual disorders (around 66% on average) and were associated with 238 pleiotropic loci. The two factors defined by (1) Schizophrenia and bipolar disorders (SB factor); and (2) major depression, PTSD and anxiety (Internalizing factor) showed high levels of polygenic overlap6 and local genetic correlation and very few disorder-specific loci. The genetic signal shared across all 14 disorders was enriched for broad biological processes (for example, transcriptional regulation), while more specific pathways were shared at the level of the individual factors. The shared genetic signal across the SB factor was substantially enriched in genes expressed in excitatory neurons, whereas the Internalizing factor was associated with oligodendrocyte biology. These observations may inform a more neurobiologically valid psychiatric nosology and implicate targets for therapeutic development designed to treat commonly occurring comorbid presentations.

PubMed Disclaimer

Conflict of interest statement

Competing interests: J.W.S. is a member of the scientific advisory board of Sensorium Therapeutics (with stock options) and has received grant support from Biogen. K.G.J. is a consultant for Allia Health. A.D.B. has received a speaker fee from Lundbeck. In the past year, S.V.F. received income, potential income, travel expenses continuing education support and/or research support from Aardvark, Aardwolf, AIMH, Akili, Atentiv, Axsome, Genomind, Ironshore, Johnson & Johnson/Kenvue, Kanjo, KemPharm/Corium, Noven, Otsuka, Sky Therapeutics, Sandoz, Supernus, Tris and Vallon. With his institution, S.V.F. has US patent US20130217707 A1 for the use of sodium-hydrogen exchange inhibitors in the treatment of ADHD. S.V.F. also receives royalties from books published by Guilford Press (Straight Talk about Your Child’s Mental Health), Oxford University Press (Schizophrenia: The Facts) and Elsevier (ADHD: Non-Pharmacologic Interventions) and is program director of www.ADHDEvidence.org and www.ADHDinAdults.com . The other authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Genome-wide structural models.
a, Heatmap of rgs across the 14 disorders as estimated using LDSC on the lower diagonal and the correlations among the psychiatric factors as estimated using GenomicSEM above the diagonal. Two-sided P values were derived from the Z-statistics, calculated as the point estimate of the rg divided by its s.e. Cells depicted with an asterisk reflect values that were significant at a Bonferroni-corrected threshold for multiple comparisons. Exact values are reported in Supplementary Table 1. Disorders that load on the same factor are shown in the same colour. Per the legend at the bottom of the panel, darker blue shading indicates larger, positive rgs. LDSC estimates were used as the input to genomic SEM to produce the results in b and c. b, Estimates from the five-factor model along with standard errors in parentheses. Estimates are standardized relative to SNP-based heritabilities, where this is equal to the sum of the squared factor loading (the single-headed arrow(s) from the factor to the disorder) and the residual variance (the values on the double-headed arrows on the single-colour circles with text labels that begin with u). Disorders are shown as pie charts; the proportion of residual variance is shaded in purple and the variance explained by the psychiatric factors is shaded in the colour of the corresponding factor. c, Standardized estimates from the p-factor model. The disorders are colour coded as in b, and the first-order factors (F1–F5) are also colour coded to show variance explained by the second-order p-factor in yellow.
Fig. 2
Fig. 2. External trait genetic correlations for psychiatric factors.
Point estimates for the rgs between 14 external traits and the 5 psychiatric factors from the correlated factors model and the p-factor from the hierarchical model. These traits were selected as they were significantly correlated with at least one factor at >0.35 or <−0.35. Bars depicted with a dashed outline were significant for the QTrait heterogeneity statistic, which indicates that the pattern of rgs for that trait did not fit the factor structure. Bars depicted with an asterisk reflect values that were significant at a Bonferroni-corrected threshold for multiple comparisons, that were also not significant at this same Bonferroni corrected threshold for QTrait. This is with the exception that the p-factor is depicted with an asterisk even if it is significant for the QTrait, provided that the same trait was significantly correlated with the majority (at least three) of the five other factors. The two-sided P values used to evaluate significance were derived from the Z-statistics, calculated as the point estimate of the rg divided by its s.e. Error bars are ±1.96 s.e., centred around the point estimate of the rgs. Traits are ordered by the point estimate for the p-factor. The implied sample size for the psychiatric factors was: Compulsive (nˆ = 54,100), SB (nˆ = 127,202), Neurodevelopmental (nˆ  = 84,760), Internalizing (nˆ = 1,637,337), SUD (nˆ = 313,395) and p-factor (nˆ = 2,168,621). Sample sizes for the external traits are reported in Supplementary Table 12 and exact P values are reported in Supplementary Table 13.
Fig. 3
Fig. 3. Local genetic correlations.
a, An overview of the average patterns of local rgs across the genome for all pairs of disorders, shown as a heatmap (below diagonal) and a network plot (above diagonal). The colours of the heatmap represent the average local rgs across all evaluated loci, with darker red and blue shading indicating more negative and positive rg, respectively; the dot size reflects the strengths of average associations; and the numbers indicate how many of the local rgs were significant. These results are mirrored in the network plot, where the width or the edges reflect the number of significant associations, meaning that only disorders with at least one significant local rg are connected, and the edge opacity reflects the strength of the average local rg across tested loci. Note that label colours are concordant with the genomic SEM factor structure from Fig. 1 and, as shown, disorders of similar colours also tend to be proximally located within the network. b, The local rg structure within the top rg hotspot on chromosome (chr.) 11 (112755447–114742317, GRCh37 reference genome), that is, the region where the greatest number of significant rgs were found across all disorder pairs. Here, the network plot illustrates all significant rgs detected in this region, with both edge width and opacity reflecting the strength of the association. The region plot in the middle displays the genes contained within the hotspot, and the table below shows the rg estimates (Rho), 95% confidence intervals (CIlower, CIupper), variance explained (R2) and P values for all significant pairwise local rgs in this region. Label colours are again concordant with those used for the genomic SEM factor structure in Fig. 1.
Fig. 4
Fig. 4. Locus-level results.
a, Heatmap of CC-GWAS loci below the diagonal across pairwise combinations of disorders; the darker orange shading indicates a higher number of CC-GWAS hits. CC-GWAS results are not shown for the Internalizing disorders as their rgs were too high, or for nicotine dependence as this is a continuously measured trait. Genomic SEM results (number of hits and mean χ2 for each factor and factor-specific QSNP estimate) are reported above the diagonal. Results for the p-factor are shown above the plot along with a Venn diagram of the overlap between p-factor, p-factor QSNP and overall CC-GWAS hits. The disorders are ordered and coloured according to the genomic SEM factor structure from Fig. 1. b,c, The Miami and QQ-plots for the p-factor (b) and SBs factors (c), respectively. These panels show the results for the −log10-transformed two-tailed P values for the factor on the top half of the Miami plot and the log10-transformed one-tailed P values for QSNP on the bottom half. Factor hits that were within 100 kb of univariate hits are shown as black triangles, novel hits for the factors that were not within 100 kb of a univariate or QSNP hit are shown as red triangles and QSNP hits are shown as purple diamonds. d, The two-tailed −log10[P] in a Manhattan plot for the CC-GWAS comparison across MD and SCZ, which produced the most hits (orange diamonds), as well as the scatterplot of standardized case–control effect sizes of MD (x axis) versus SCZ (y axis), with CC-GWAS significant SNPs labelled in red. For bd, the grey dashed lines indicate the significance threshold, which was defined using Bonferroni correction for multiple comparisons.
Fig. 5
Fig. 5. Functional annotation of factor variants.
a, GO enrichment analysis of predicted target genes with transdiagnostic associations (that is, variants associated with the p-factor), or those target genes associated with the SB factor that were not overlapping with p-factor target genes. Depicted −log10-transformed P values are one-sided, calculated using a χ2 test; false-discovery rate (FDR) correction was applied for multiple comparisons. b, The averaged and normalized expression levels of target genes of the indicated classes along the temporal trajectory of human brain development. Shading around the lines reflects 95% CIs. pcw10, post-conception week 10. c,d, Average log10[P] values across EWCE and MAGMA enrichment for genes associated with the indicated factors in fetal brain cell types using two independent single-cell RNA-sequencing (scRNA-seq) datasets, (c) or adult brain cell types using three independent single-nucleus RNA-seq (snRNA-seq) datasets (d). The P values from EWCE and MAGMA were two-sided and each had an FDR correction applied for multiple comparisons before averaging the two sets of results. EWCE P values were empirically derived using a permutation test; MAGMA P values were calculated using an F-test. Int, Internalizing disorders factor. The implied sample size for the three depicted psychiatric factors was: SB (nˆ = 127,202), Internalizing (nˆ = 1,637,337) and p-factor (nˆ = 2,168,621). CycProg, cycling progenitor; Endo/BBB, endothelial/blood brain barrier; ExNeu, excitatory neuron; InNeu, interneurons; IP, intermediate progenitor; OPC, oligodendrocyte progenitor cell; RG, radial glia; Astro, astrocyte; MSN, medium spiny neuron; ODC/Oligo, oligodendrocyte.
Extended Data Fig. 1
Extended Data Fig. 1. Univariate MiXeR Results.
Power curves estimating the sample size of a GWAS study are needed to saturate the yield of genome-wide significant loci. The legend shows the current effective sample size of today’s GWAS, followed by the projected effective sample size needed for the GWAS yield to saturate.
Extended Data Fig. 2
Extended Data Fig. 2. External trait genetic correlations: Comparison across psychiatric factors.
Bar graphs depict genetic correlations with the 31 complex traits for the five psychiatric factors from the correlated factors model and the second-order, p-factor from the hierarchical model. Panels are separated by the different groupings of traits (e.g., cognitive; socioeconomic). Bars depicted with a dashed outline were significant at a Bonferroni-corrected threshold for the QTrait heterogeneity metric that flags traits whose patterns of genetic correlations from LDSC do not conform to those implied by the factor model. Error bars are +/− 1.96 SE that are centred around the point estimate of the genetic correlations. Bar depicted with a * reflect values that were significant at a Bonferroni corrected threshold for multiple comparisons, that were also not significant at this same Bonferroni corrected threshold for QTrait. This is with exception of the p-factor, which is depicted with a ‘*’ even if it is significant for the QTrait, as long as that same trait was significantly correlated with the majority (at least three) of the five other factors. The two-sided P-values used to evaluate significance were derived from the Z-statistics, calculated as the point estimate of the genetic correlation divided by its standard error. Correlations are ordered according to the point estimate for the p-factor. The implied sample size for the psychiatric factors was: Compulsive (nˆ = 54,100); Schizophrenia/Bipolar (nˆ = 127,202); Neurodevelopmental (nˆ  = 84,760); Internalizing (nˆ = 1,637,337); Substance Use (nˆ = 313,395); p-factor (nˆ = 2,168,621). See Suppl. Table 12 for sample sizes for the external traits and Suppl. Table 13 for exact P-values.
Extended Data Fig. 3
Extended Data Fig. 3. External trait genetic correlations: Comparison within factors.
Bar graphs depict genetic correlations with the 31 complex traits that are ordered by magnitude within each factor for the five psychiatric factors from the correlated factors model and the second-order, p-factor from the hierarchical model. Bars depicted with a dashed outline for the QTrait heterogeneity metric. Bar depicted with a * reflect values that were significant at a Bonferroni corrected threshold for multiple comparisons, that were also not significant at this same Bonferroni corrected threshold for QTrait. This is with exception of the p-factor, which is depicted with a ‘*’ even if it is significant for the QTrait, as long as that same trait was significantly correlated with the majority (at least three) of the five other factors. The two-sided P-values used to evaluate significance were derived from the Z-statistics, calculated as the point estimate of the genetic correlation divided by its standard error. Error bars are +/− 1.96 SE that are centred around the point estimate of the genetic correlations. The implied sample size for the psychiatric factors was: Compulsive (nˆ = 54,100); Schizophrenia/Bipolar (nˆ = 127,202); Neurodevelopmental (nˆ  = 84,760); Internalizing (nˆ = 1,637,337); Substance Use (nˆ = 313,395); p-factor (nˆ = 2,168,621). See Suppl. Table 12 for sample sizes for the external traits and Suppl. Table 13 for exact P-values.
Extended Data Fig. 4
Extended Data Fig. 4. Stratified Genomic SEM results.
Bar graph depicts the enrichment results for different brain cell types, protein-truncating variant intolerant (PI) genes, and the intersection across PI genes and brain cell types. Results are shown only for the SB, Internalizing, and p-factor due to the limited signal for the other factors. Enrichment for height is depicted in purple to benchmark results and evaluate specificity in signal for the psychiatric factors relative to another human complex trait. Error bars are +/- 1.96 SE that are centred around the enrichment point estimate. Enrichment estimates that were significant at a strict Bonferroni corrected threshold for multiple comparisons are shown with a *. The one-sided P-values used to evaluate significance were derived from the Z-statistics, calculated as the enrichment point estimate divided by its standard error. Exact P-values are reported in Suppl. Table 50. The implied sample size for the psychiatric factors was: Compulsive (nˆ = 54,100); Schizophrenia/Bipolar (nˆ = 127,202); Neurodevelopmental (nˆ  = 84,760); Internalizing (nˆ = 1,637,337); Substance Use (nˆ = 313,395); p-factor (nˆ = 2,168,621).

References

    1. Kessler, R. C., Chiu, W. T., Demler, O. & Walters, E. E. Prevalence, severity, and comorbidity of 12-month DSM-IV disorders in the National Comorbidity Survey Replication. Arch. Gen. Psychiatry62, 617–627 (2005). - DOI - PMC - PubMed
    1. Smoller, J. W. et al. Psychiatric genetics and the structure of psychopathology. Mol. Psychiatry24, 409–420 (2019). - DOI - PMC - PubMed
    1. Möller, H.-J. Bipolar disorder and schizophrenia: distinct illnesses or a continuum? J. Clin. Psychiatry64, 23–27 (2003). discussion 28. - PubMed
    1. Bulik-Sullivan, B. B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet.47, 1236–1241 (2015). - DOI - PMC - PubMed
    1. Lee, P. H. et al. Genomic relationships, novel loci, and pleiotropic mechanisms across eight psychiatric disorders. Cell179, 1469–1482 (2019). - DOI - PMC - PubMed

Grants and funding