Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2025 Aug;57(8):1835-1847.
doi: 10.1038/s41588-025-02267-2. Epub 2025 Jul 28.

Large-scale genome-wide analyses of stuttering

Collaborators, Affiliations
Meta-Analysis

Large-scale genome-wide analyses of stuttering

Hannah G Polikowsky et al. Nat Genet. 2025 Aug.

Abstract

Developmental stuttering is a highly heritable, common speech condition characterized by prolongations, blocks and repetitions of speech. Although stuttering is highly heritable and enriched within families, the genetic architecture is largely understudied. We reasoned that there are both shared and distinct genetic variants impacting stuttering risk within sex and ancestry groups. To test this idea, we performed eight primary genome-wide association analyses of self-reported stuttering that were stratified by sex and ancestry, as well as secondary meta-analyses of more than one million individuals (99,776 cases and 1,023,243 controls), identifying 57 unique loci. We validated the genetic risk of self-reported stuttering in two independent datasets. We further show genetic similarity of stuttering with autism, depression and impaired musical rhythm across sexes, with follow-up analyses highlighting potentially causal relationships among these traits. Our findings provide well-powered insights into genetic factors underlying stuttering.

PubMed Disclaimer

Conflict of interest statement

Competing interests: S.M. and members of the 23andMe Research Team were or are employed by and hold stock or stock options in 23andMe, Inc. D.M.S. is currently employed by AstraZeneca. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Miami plot of EUR female and EUR male GWAS.
The EUR female association study (top panel) included 570,071 total samples (40,137 self-reported stuttering cases) and 29,449,463 autosomal variants. Nine loci reached genome-wide significance (dotted line, P < 5.00 × 10−8) through logistic regression (see Methods). The EUR male association study (bottom panel) included 374,279 total samples (38,257 self-reported stuttering cases) and 29,409,446 autosomal variants. Ten loci reached genome-wide significance (dotted line, P < 5.00 × 10−8) through logistic regression (see Methods). The x axis represents chromosome base pair coordinates in human genome build 37, and the y axis represents observed −log10(P) for each analysis. Annotated genes for each GWAS are the predicted functional gene for each locus (when available) according to the Open Targets Genetics V2G pipeline (see Methods).
Fig. 2
Fig. 2. Forest plot showing genetic correlations for stuttering and previously reported comorbid traits.
a, Ancestry-specific and sex-specific genetic correlations performed for each indicated trait with self-reported stuttering in EUR males and EUR females. Each trait is color-coded according to descriptive category (behavioral, circadian rhythm, immune, metabolic, motor, neurological, speech and language). b, Ancestry-specific and sex-specific genetic correlation estimates (EUR male and EUR female) and sex-combined, ancestry-specific (sex-combined EUR) genetic correlation estimates for beat synchronization and autism. Male-specific correlations are designated by triangles, female-specific correlations are designated by circles and sex-combined correlations are designated by squares. Data points represent the correlation coefficients; error bars, SE. Asterisks denote significant genetic correlations by LDSC. See Supplementary Table 15 for full information on traits used for analyses.
Fig. 3
Fig. 3. Forest plot showing results of Mendelian randomization analysis for stuttering and previously reported comorbid traits.
a, Debiased inverse-variance weighted Mendelian randomization analysis estimating causal inference (comorbid trait → stuttering). b, Debiased inverse-variance weighted Mendelian randomization analysis estimating causal inference (stuttering → comorbid trait). Data are represented as estimates; error bars, SE. Filled circles are statistically significant (debiased inverse-variance weighted estimator P < 3.33 × 10−3 after Bonferroni correction for testing 15 associations that were genetically correlated with stuttering). Full information on traits used for analyses can be found in Supplementary Table 15. Full results along with other Mendelian randomization methods can be found in Supplementary Table 17. ASD, autism spectrum disorder.
Fig. 4
Fig. 4. Performance of self-reported stuttering PRS model in independent EUR stuttering datasets.
PRS were developed using EUR male or EUR female GWAS results and applied to clinically validated ISP and self-report Add Health subjects, and demonstrate increased stuttering liability within stuttering cases. The model was developed and trained using the default auto-phi shrinkage parameter through PRScs. LD panels were constructed using 1000 Genomes Project phase 3 EUR reference data. a, AUC model performance in the ISP cohort. b, AUC model performance in the Add Health cohort.
Extended Data Fig. 1
Extended Data Fig. 1. Study design.
a, Overview of primary and secondary GWAS analyses. b, Flow chart of study design and downstream analyses. Primary analyses are indicated in yellow. Secondary analyses are indicated in blue. Green boxes represent analyses that consist of using data from both the primary and secondary analyses. ‘*’ indicate analyses performed in European ancestry only.
Extended Data Fig. 2
Extended Data Fig. 2. Manhattan and Q-Q plots for primary genome-wide association analyses.
Manhattan plots for genome-wide association analyses performed via logistic regression (see Methods). The dotted line represents genome-wide significance (P < 5 × 10−8). Q-Q plot x-axes represent expected −log10(P-value), and the y-axes represent observed −log10(P-value) with 95% CI indicated by shaded region. a, The AFR female association study included 24,783 total samples (3,008 self-reported stuttering cases) and 29,555,214 autosomal variants. No loci reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0165. b, The AFR male association study included 13,977 total samples (2,480 self-reported stuttering cases) and 29,443,778 autosomal variants. Three loci reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0156. c, The AMR female association study included 74,187 total samples (7,209 self-reported stuttering cases) and 29,814,878 autosomal variants. One locus reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0340. d, The AMR male association study included 45,206 total samples (6,346 self-reported stuttering cases) and 29,759,321 autosomal variants. One locus reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0177. e, The EAS female association study included 13,347 total samples (1,205 self-reported stuttering cases) and 10,595,969 autosomal variants. No loci reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0081. f, The EAS male association study included 7,169 total samples (1,134 self-reported stuttering cases) and 10,586,108 autosomal variants. No loci reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0182. g, Q-Q plot for the EUR female GWAS; λgc = 1.0855. h, Q-Q plot for the EUR male GWAS; λgc = 1.0834.
Extended Data Fig. 3
Extended Data Fig. 3. Forest plots showing significant differences in effect size heterogeneity.
Wald tests were performed to test for heterogeneity of effect estimates for the 24 sentinel variants that reached genome-wide significance in the eight ancestry- and sex-specific GWAS compared to the other seven primary GWAS. Plot displays the eight sentinel variants that demonstrated significant differences in effect size by ancestry and/or sex. Data points represent effect size +/− standard error. Each genetic ancestry is color coded. Female-specific effect estimates are shown using circles. Male-specific effect estimates are shown using triangles. Effect estimates that are not shown indicate that the variant was not found. The symbol ‘+’ represents the effect of the result in the Discovery GWAS. The symbol ‘*’ denotes significant differences in effect size estimates compared to the Discovery GWAS after Bonferroni correction. Full GWAS samples sizes can be found in Table 1.
Extended Data Fig. 4
Extended Data Fig. 4. Manhattan and Q-Q plots for the sex-combined meta-analyses.
Manhattan plots for sex-combined meta-analyses performed using the inverse variance weighted option in METAL (see Methods). The dotted line represents genome-wide significance (P < 5 × 10−8). Q-Q plot x-axes represent expected −log10(P-value), and y-axes represent observed −log10(P-value) with 95% CI indicated by shaded region. a, The meta-analysis includes 944,350 total samples (78,394 self-reported stuttering cases) and 29,566,899 autosomal variants observed across EUR male and EUR female GWAS; λgc = 1.0586. b, The meta-analysis includes 119,393 total samples (13,555 self-reported stuttering cases) and 29,948,846 autosomal variants observed across AMR male and AMR female GWAS; λgc = 1.0070. c, The meta-analysis includes 20,516 total samples (2,339 self-reported stuttering cases) and 10,656,266 autosomal variants observed across EAS male and EAS female GWAS; λgc = 1.0023. d, The meta-analysis includes 38,760 total samples (5,488 self-reported stuttering cases) and 29,705,842 autosomal variants observed across AFR male and AFR female GWAS; λgc = 1.0019.
Extended Data Fig. 5
Extended Data Fig. 5. Miami plot for female-specific meta-analysis and male-specific meta-analysis.
Ancestry-combined, sex-specific meta-analyses were performed using multi-ancestry meta-regression via MR-MEGA (see Methods). a, The multi-ancestry female meta-analysis (top panel in orange) included 682,388 total samples (51,559 self-reported stuttering cases) and 41,960,804 autosomal variants (see Methods). Five loci reached genome-wide significance (dotted line P < 5 × 10−8). The multi-ancestry male meta-analysis (bottom panel in blue) included 440,631 total samples (48,217 self-reported stuttering cases) and 41,937,718 autosomal variants (see Methods). Three loci reached genome-wide significance (dotted line P < 5 × 10−8). The x-axis represents chromosome base pair coordinates in human genome build 37, and the y-axis represents observed −log10(P-value) for each GWAS. Annotated genes for each GWAS includes the predicted functional gene for each loci (when available) according to Open Targets V2G pipeline. b, Q-Q plot for the female-specific meta-analysis. c, Q-Q plot for the male-specific meta-analysis. Q-Q plot x-axes represent expected −log10(P-value), and y-axes represent observed −log10(P-value), with 95% CI indicated by shaded region. Note: P-values were adjusted for genomic control.
Extended Data Fig. 6
Extended Data Fig. 6. Manhattan and Q-Q plot for the ancestry- and sex-combined meta-analysis.
The multi-ancestry meta-regression meta-analysis performed via MR-MEGA (see Methods) included 1,023,243 total samples (99,776 self-reported stuttering cases) and 42,031,853 autosomal variants observed across all eight independent sex and ancestries-specific GWAS. The dotted line represents genome-wide significance (P < 5 × 10−8). Q-Q plot x-axis represents expected −log10(P-value), and y-axis represents observed −log10(P-value) with 95% CI indicated by shaded region. Note: P-values were adjusted for genomic control.
Extended Data Fig. 7
Extended Data Fig. 7. Partitioned heritability estimates for broad functional categories.
Partitioned heritability was performed using the 52 broad functional annotations defined by Finucane et al. Enrichment is the proportion of h2/proportion of SNPs. The horizontal, dotted line indicates no enrichment. Error bars indicate enrichment estimates +/− standard error. Asterisks indicate enrichment estimates are significant based on a Bonferroni P-value of 9.6 × 10−4 (52 gene-sets). a, Broad functional categories partitioned heritability estimates in EUR female self-reported stuttering. b, Broad functional categories partitioned heritability estimates in EUR male self-reported stuttering. c, Broad functional categories partitioned heritability estimates in sex-combined EUR self-reported stuttering. Full results are presented in Supplementary Tables 8–10.
Extended Data Fig. 8
Extended Data Fig. 8. Partitioned heritability estimates for brain cell types.
Partitioned heritability was performed using the three cell-type annotations defined by Finucane et al. The −log10(P-value) for enrichment estimates is indicated on the y-axis. The horizontal dotted line indicates significant after Bonferroni correction (P < 1.7 × 10−2, three tests). a, Brain cell type partitioned heritability estimates in EUR female self-reported stuttering. b, Brain cell type partitioned heritability estimates in EUR male self-reported stuttering. c, Brain cell type partitioned heritability estimates in sex-combined EUR self-reported stuttering. Full results are presented in Supplementary Tables 11–13.
Extended Data Fig. 9
Extended Data Fig. 9. Partitioned heritability estimates for tissue-specific gene expression.
Partitioned heritability was performed using the tissue-specific expression annotations defined by Finucane et al. The −log10(P-value) for enrichment estimates is indicated on the y-axis. The horizontal dotted line indicates significant after Bonferroni correction (P < 6.25 × 10−3, eight tests). a, Tissue-specific gene expression partitioned heritability estimates in EUR female self-reported stuttering. b, Tissue-specific gene expression partitioned heritability estimates in EUR male self-reported stuttering. c, Tissue-specific gene expression partitioned heritability estimates in sex-combined EUR self-reported stuttering. Full results are presented in Supplementary Tables 11–13.
Extended Data Fig. 10
Extended Data Fig. 10. Partitioned heritability estimates for tissue-specific chromatin marks.
Partitioned heritability was performed using the tissue-specific chromatin annotations defined by Finucane et al. The −log10(P-value) for enrichment estimates is indicated on the y-axis. The horizontal dotted line indicates significant after Bonferroni correction (P < 2.5 × 10−3, 20 tests). a, Tissue-specific chromatin partitioned heritability estimates in EUR female self-reported stuttering. b, Tissue-specific chromatin partitioned heritability estimates in EUR male self-reported stuttering. c, Chromatin partitioned heritability estimates in sex-combined EUR self-reported stuttering. Full results are presented in Supplementary Tables 11–13.

References

    1. Yairi, E. & Ambrose, N. Epidemiology of stuttering: 21st century advances. J. Fluen. Disord.38, 66–87 (2013). - PMC - PubMed
    1. Chang, S. E. et al. Stuttering: our current knowledge, research opportunities, and ways to address critical gaps. Neurobiol. Lang. (Camb.)6, nol_a_00162 (2025). - PMC - PubMed
    1. Andrews, G. & Harris, M. The Syndrome of Stuttering. Clinics in Developmental Medicine No. 17 (William Heinemann Medical Books, 1964).
    1. Månsson, H. Childhood stuttering. J. Fluen. Disord.25, 47–57 (2000).
    1. Yairi, E. & Ambrose, N. A longitudinal study of stuttering in children: a preliminary report. J. Speech Hear. Res.35, 755–760 (1992). - PubMed

Publication types

LinkOut - more resources