Abstract
Developmental stuttering is a highly heritable, common speech condition characterized by prolongations, blocks and repetitions of speech. Although stuttering is highly heritable and enriched within families, the genetic architecture is largely understudied. We reasoned that there are both shared and distinct genetic variants impacting stuttering risk within sex and ancestry groups. To test this idea, we performed eight primary genome-wide association analyses of self-reported stuttering that were stratified by sex and ancestry, as well as secondary meta-analyses of more than one million individuals (99,776 cases and 1,023,243 controls), identifying 57 unique loci. We validated the genetic risk of self-reported stuttering in two independent datasets. We further show genetic similarity of stuttering with autism, depression and impaired musical rhythm across sexes, with follow-up analyses highlighting potentially causal relationships among these traits. Our findings provide well-powered insights into genetic factors underlying stuttering.
Similar content being viewed by others
Main
Developmental stuttering is the most common fluency disorder, with more than 400 million people affected worldwide and a lifetime prevalence of 5–8% among global populations1. Stuttering is characterized, in part, by syllable and word repetitions, sound prolongations and involuntary breaks between words, called blocks, that disrupt the forward movement of speech2. The onset of developmental stuttering typically occurs during childhood between ages 2 and 5, and an estimated 80% of children who stutter will spontaneously recover, with or without the aid of speech therapy3,4,5. Although many individuals who stutter seek therapies, including speech interventions, behavior modification, cognitive interventions and technology-based feedback interventions to manage various aspects of stuttering6, outside of spontaneous recovery, stuttering does not have a known cure.
For those who experience persistent stuttering into adolescence and adulthood, the impact can be profound and life-long. People who stutter often describe negative perceptions of identity and self-worth, and reduced overall quality of life7. Young people who stutter experience increased bullying and decreased classroom participation, and they report a more negative educational experience; stuttering is associated with depression and suicidal ideation in this population7,8,9. For adults, stuttering can negatively impact employability, perceived job performance, socioeconomic status and mental and social wellbeing7,8,10,11.
Developmental stuttering is also sexually dimorphic. At stuttering onset, the male-to-female ratio is approximately even (between 1:1 and 2:1)4, but stuttering is substantially more common in males than females in adolescent and adult populations (approximately 4:1)1,12 owing to differences in the rate of recovery by sex13,14. However, the mechanisms leading to differences in prevalence and the rate of recovery by sex are not known.
Studies of stuttering within families, twins and population isolates provide overwhelming evidence for a strong genetic influence on stuttering risk, with heritability estimates ranging from 0.42 to 0.84 (refs. 15,16,17,18,19,20,21,22,23,24,25,26). To date, family studies have identified six candidate causal stuttering genes: GNPTAB, GNPTG and NAGPA19,27; DRD2 (ref. 20); AP4E1 (ref. 24); and CYP17A1 (ref. 22); however, these findings have not replicated in other families and explain little of the genetic heritability of stuttering in populations28,29. Two prior studies identified two genome-wide significant loci that confer stuttering risk at the population level30,31. These previous investigations, leveraging both family data and global outbred populations, demonstrated that stuttering genetic risk factors are complex and involve both rare, familial and common variation. However, larger sample sizes are needed to elucidate ancestry-specific genetic risk factors for this common complex trait, especially to examine sexual dimorphism. Furthermore, models that leverage genetic risk markers (that is, polygenic risk scores (PRS), genetic correlation analysis and causal inference models) may illuminate the broader clinical impact of the genetic risk of stuttering.
Here, to identify both shared and distinct signals impacting stuttering risk, we report the results of eight primary ancestry-specific and sex-specific genome-wide association studies (GWAS) of stuttering and secondary meta-analyses in samples totaling more than 1.1 million individuals (99,776 cases). These analyses are well-powered to detect stuttering risk alleles with modest effect size and explore shared and distinct genetic effects across genetic ancestry-specific and sex-specific groups through cross-strata look-ups and meta-analyses.
Overall, this study reveals the complex genetic architecture of stuttering, identifying 24 signals in the primary ancestry-specific and sex-specific analyses and 63 signals in secondary meta-analyses for self-reported stuttering, mapping to 57 distinct loci. These loci met a two-tiered multiple test correction threshold. First, we applied a false discovery rate threshold of 5% across all eight primary analyses, and second, a traditional P value threshold of 5 × 10−8. We validated the observed genetic effects in two independent datasets, including an international clinically ascertained stuttering cohort called the International Stuttering Project (ISP)30 and a cohort of self-reported stuttering in the National Longitudinal Study of Adolescent to Adult Health (Add Health)32. We then leveraged our results to explore genetic correlations between stuttering and its comorbidities. Together, these advances inform our understanding of the molecular etiology of stuttering and lay groundwork for the future of precision care in developmental speech disorders.
Results
Study overview
We performed eight primary non-overlapping GWAS of self-reported stuttering that were stratified by sex and genetic ancestry in samples from 23andMe, Inc. Stuttering status was based on 23andMe participant responses to the survey question ‘Have you ever had a stammer or stutter?’ The decision to stratify by sex and genetic ancestry group was driven by several factors (see the Supplementary Note for a detailed description of how genetic ancestry and sex groups were defined). In addition to these primary analyses, we report secondary results of meta-analyses across sex by ancestry, meta-analyses across ancestry by sex and a meta-analysis of all eight primary GWAS (see Extended Data Fig. 1 for an overview of analyses).
GWAS design
In aggregate, the datasets included 99,776 participants responding ‘yes’ to the question ‘Have you ever had a stammer or stutter?’ (cases, 48,217 males; 51,559 females) and 1,023,243 participants responding ‘no’ (controls, 392,414 males; 630,829 females; Table 1). Overall, the study cohort is 60.76% female. Male stuttering prevalence in this dataset is 12.29% and female stuttering prevalence is 8.17%. Distribution by age group is shown in Supplementary Table 1, with age defined as the current age at the time the analysis was conducted. We divided our sample broadly into four continental ancestry groups: African ancestry (AFR), East Asian ancestry (EAS), European ancestry (EUR) and Latino/Admixed American ancestry (AMR), which were defined through genetic analysis33 (see Supplementary Table 1 for sample sizes by ancestries and Supplementary Note for description of genetic ancestry determination).
Genetic correlations across ancestry-specific and sex-specific GWAS
In the primary analyses of each ancestry-specific and sex-specific GWAS (eight total), we considered autosomal and X chromosome variants that were successfully imputed across all platforms and reached our quality control thresholds (see Methods). We estimated the genetic correlation between the EUR male and EUR female analyses and EAS male and EAS female analyses using linkage disequilibrium score regression (LDSC)34,35 (Supplementary Table 2). Statistically significant genetic correlation between sexes was only observed for the EUR population (rg = 0.8952, P < 10−50). Despite this finding, we observed no overlap in genome-wide significant hits between the EUR male and EUR female analyses (Table 2). Comparisons of results across ancestry through concordance analysis can be found in the Supplementary Note.
Ancestry- and sex-specific GWAS
From the eight genetic ancestry-specific and sex-specific GWAS, we identified 24 loci (Supplementary Figs. 1–24) associated with stuttering that surpassed a false discovery rate threshold of 5% across all eight primary analyses as well as a conventional genome-wide significance threshold of P < 5 × 10−8. The identified loci included nine loci from the EUR female study, ten loci from the EUR male study, three loci from the AFR male study and one locus each from the AMR female and AMR male studies (Fig. 1, Table 2 and Extended Data Fig. 2a–h). No loci reached genome-wide significance in the AFR female GWAS, the EAS female GWAS or the EAS male GWAS. Sentinel hits from the EUR male GWAS implicated VRK2, CAMTA1, MYO16, MMAB, CTNND2, SEMA6D, IRS2 and COL14A1 as the most likely impacted functional genes (Table 2 and Supplementary Figs. 1–10). Sentinel hits from the EUR female GWAS implicated SLC39A8, DCC, SRPK2, NMUR2, TSHZ2 and MITF as the most likely impacted functional genes (Table 2 and Supplementary Figs. 11–19). Sentinel hits from the AFR male GWAS implicated PTPRQ, SHISA2 and CYTH4 as likely functional genes; AMR female GWAS implicated SORCS1; and AMR male GWAS implicated RGCC (Table 2 and Supplementary Figs. 20–24). Across all eight ancestry-specific and sex-specific GWAS, no loci on the X chromosome surpassed genome-wide significance. We also present findings from secondary sex-combined meta-analyses, ancestry-combined meta-analyses and a meta-analysis of all sex and ancestry groups (see Extended Data Fig. 1 for study overview and Extended Data Fig. 3, Methods and Supplementary Note for heterogeneity of effect size estimates in the eight primary GWAS).
The EUR female association study (top panel) included 570,071 total samples (40,137 self-reported stuttering cases) and 29,449,463 autosomal variants. Nine loci reached genome-wide significance (dotted line, P < 5.00 × 10−8) through logistic regression (see Methods). The EUR male association study (bottom panel) included 374,279 total samples (38,257 self-reported stuttering cases) and 29,409,446 autosomal variants. Ten loci reached genome-wide significance (dotted line, P < 5.00 × 10−8) through logistic regression (see Methods). The x axis represents chromosome base pair coordinates in human genome build 37, and the y axis represents observed −log10(P) for each analysis. Annotated genes for each GWAS are the predicted functional gene for each locus (when available) according to the Open Targets Genetics V2G pipeline (see Methods).
Sex-combined, ancestry-specific meta-analyses
To better understand ancestry-specific effects across sexes, we performed sex-combined, ancestry-specific meta-analyses in METAL36. We identified 36 loci associated with stuttering at the conventional significance threshold of P < 5 × 10−8 (Supplementary Figs. 25–60). Specifically, 28 loci were associated with stuttering in the sex-combined EUR analysis (Table 3, Extended Data Fig. 4a and Supplementary Figs. 25–52), seven loci were associated with stuttering in the sex-combined AMR analysis (Table 3, Extended Data Fig. 4b and Supplementary Figs. 53–59) and one locus was associated with stuttering in the sex-combined EAS analysis (Table 3, Extended Data Fig. 4c and Supplementary Fig. 60). No loci reached the conventional significance threshold in the sex-combined AFR analysis (Extended Data Fig. 4d).
Sentinel hits from the sex-combined EUR meta-analysis implicated SLC39A8, VRK2, CAMTA1, LRP1B, PTBP2, GRM5, CTNND2, CREB3L4, SEMA6D, KCTD10, UBAP2, LYSMD4, MAP2K6, ARMC2, SOX5, ZNF567, KCNH8, ADGRL3, TMEM71, SGCD, MET and SRPK2 as the most likely impacted functional genes (Table 3). Sentinel hits from the sex-combined AMR meta-analysis implicated LRRC69, PACRGL, CLRN2 and C16orf95 as the most likely impacted functional genes (Table 3). Sentinel hits from the sex-combined EAS meta-analysis implicated ADCY5 as a likely functional gene (Table 3).
Ancestry-combined, sex-specific meta-analyses
We also aggregated association summary statistics by sex across ancestry groups by multi-ancestry meta-regression, using MR-MEGA37, and identified five loci in our female-specific meta-analysis (Table 4, Extended Data Fig. 5a,b and Supplementary Figs. 61–65) and three loci in our male-specific meta-analysis (Table 4, Extended Data Fig. 5a,c and Supplementary Figs. 66–68).
Sentinel hits from the female-specific meta-analysis implicated SRPK2, ZFP64, CBLN4 and DCC as the most likely impacted functional genes (Table 4). Sentinel hits from the male-specific meta-analysis implicated CAMTA1, VRK2 and KCTD10 as the most likely impacted functional genes (Table 4).
Ancestry-combined and sex-combined meta-analysis
Last, we performed an ancestry-combined and sex-combined meta-analysis, using all ancestry-specific and sex-specific preliminary GWAS analyses, through multi-ancestry meta-regression, again using MR-MEGA37. This analysis identified 19 loci reaching the conventional significance threshold (Table 4, Extended Data Fig. 6 and Supplementary Figs. 69–87). Sentinel hits from the ancestry-combined and sex-combined meta-analysis were mapped to SLC39A8, CAMTA1, CBLN4, VRK2, CREB3L4, KCTD10, CTSC, SOX5 and SCN8A as the most likely impacted functional genes (Table 4).
From these sentinel hits, we report 57 distinct loci (defined as non-overlapping within ±1 Mb of the sentinel single-nucleotide polymorphism (SNP)) associated with self-reported stuttering across the primary ancestry-specific and sex-specific GWAS and the secondary sex-combined, ancestry-specific meta-analyses, ancestry-combined, sex-specific meta-analyses, and ancestry-combined, sex-combined meta-analysis. However, sex-specific participation bias can result in spurious autosomal associations38. Hence, we compared the 57 stuttering-associated loci to 158 autosomal loci spuriously associated with sex in prior work38. We found six of the 57 stuttering-associated loci were located within the SNP windows for loci previously associated with sex-specific participation bias (Table 4). For consistency of association between non-overlapping samples within this study as well as with previously identified stuttering loci in the literature, please see the Supplementary Note.
Genetic heritability
We calculated SNP-based liability-scale heritability for our male and female EUR studies using LDSC34,35. As with the genetic correlation analyses, for AFR and AMR, LDSC and other common approaches are biased in the presence of admixture. For EAS, the sample size prevented reliable heritability estimates39. Hence, heritability analyses were not performed for the AFR, AMR and EAS ancestries. Liability-scaled heritability was estimated to be 9.11% (SE = 0.0054) for EUR females and 9.62% (SE = 0.0052) for EUR males, assuming a stuttering population prevalence of 10%. LDSC intercept and observed scale heritability estimates are reported in Supplementary Table 7.
Partitioned SNP-based heritability of stuttering by broad functional annotation35 showed significant enrichments of conserved regions, as well as a chromatin mark for enhancers, H3K4me1, in EUR male, EUR female and sex-combined EUR stuttering results (Extended Data Fig. 7 and Supplementary Tables 8–10). EUR male and sex-combined EUR stuttering results were enriched for weak enhancers, repressed marks and a marker for active chromatin, H3K9ac (Extended Data Fig. 7 and Supplementary Tables 8–10; P < 9.6 × 10−4). Sex-combined EUR stuttering results were enriched for fetal and adult DNase hypersensitive sites, introns and marks of active chromatin sites H3K27ac and H3K4me3 (Extended Data Fig. 7 and Supplementary Table 8; P < 9.60 × 10−4).
Furthermore, we used LDSC to explore whether genes expressed in specific cell or tissue types are enriched for stuttering-associated variants40. For brain cell types, we found that our EUR female and sex-combined EUR stuttering results were enriched for neurons (Extended Data Fig. 8 and Supplementary Tables 11 and 12; P < 0.017). We then tested for enrichment of effects of stuttering-associated variants in brain tissues previously associated with stuttering in imaging studies (full results shown in Extended Data Fig. 9, Supplementary Tables 12 and 13 and Supplementary Note)41,42,43,44,45,46,47,48, and enrichment was further investigated by examining tissue-specific annotations for active chromatin and enhancers (specifically, known histone marks H3K27ac, H3K9ac, H3K4me1, H3K4me3 and H3K36me3) (see Extended Data Fig. 10, Supplementary Tables 11–13 and Supplementary Note for full results).
Genetic correlation
To explore whether the genetic architecture of stuttering is shared with previously identified comorbidities, we performed genetic correlation analyses with traits associated with stuttering in the literature (Supplementary Tables 14 and 15)49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76. For traits with sex-stratified summary statistics, we observed significant positive genetic correlations, after Bonferroni correction, within both our EUR male and EUR female studies for depression (EUR male, P = 6.82 × 10−5; EUR female, P = 4.53 × 10−8; Fig. 2a and Supplementary Table 16). We also observed a number of significant positive genetic correlations in EUR females only: for hearing loss (P = 6.50 × 10−6), asthma (P = 2.91 × 10−8), daytime sleepiness (P = 6.96 × 10−8), attention deficit hyperactivity disorder (ADHD) (P = 4.66 × 10−6) and body mass index (BMI) (P = 4.08 × 10−17) as well as a significant negative genetic correlation for frequency of alcohol consumption (P = 5.0 × 10−4) and walking pace (P = 8.81 × 10−11; Fig. 2a and Supplementary Table 16). No traits were significantly genetically correlated with stuttering in EUR males exclusively.
a, Ancestry-specific and sex-specific genetic correlations performed for each indicated trait with self-reported stuttering in EUR males and EUR females. Each trait is color-coded according to descriptive category (behavioral, circadian rhythm, immune, metabolic, motor, neurological, speech and language). b, Ancestry-specific and sex-specific genetic correlation estimates (EUR male and EUR female) and sex-combined, ancestry-specific (sex-combined EUR) genetic correlation estimates for beat synchronization and autism. Male-specific correlations are designated by triangles, female-specific correlations are designated by circles and sex-combined correlations are designated by squares. Data points represent the correlation coefficients; error bars, SE. Asterisks denote significant genetic correlations by LDSC. See Supplementary Table 15 for full information on traits used for analyses.
For traits without sex-stratified summary statistics, we observed significant positive genetic correlations, after Bonferroni correction, with autism (EUR male, P = 1.11 × 10−6; EUR female, P = 2.05 × 10−11; sex-combined EUR, P = 4.51 × 10−13) and significant negative genetic correlations with beat synchronization (EUR male, P = 4.00 × 10−4; EUR female, P = 3.33 × 10−7; sex-combined EUR, P = 1.13 × 10−7; Fig. 2b and Supplementary Table 16).
Mendelian randomization
We performed debiased inverse-variance weighted Mendelian randomization analyses, which handle balanced horizontal pleiotropy and allow for weak instruments77,78, to assess the causal relationships between stuttering and previously reported co-occurring traits with significant genetic correlations. Analyses were performed for 11 sex-specific traits and two sex-combined traits genetically correlated with stuttering (after Bonferroni correction) to estimate causal effects between these traits and the self-reported stuttering phenotype captured in our EUR male GWAS, EUR female GWAS or sex-combined EUR meta-analysis. We estimated a significant causal effect of slower walking pace, autism and impaired rhythm on EUR female stuttering (Fig. 3a and Supplementary Table 17). Additionally, we found evidence of bi-directional causal relationships between both higher BMI and depression and EUR female stuttering (Fig. 3a,b and Supplementary Table 17). We also observed a bi-directional relationship between impaired beat synchronization and stuttering in the EUR male and sex-combined EUR analyses (Fig. 3a,b and Supplementary Table 17). Furthermore, we observed a bi-directional relationship between autism and stuttering in the sex-combined EUR analysis (Fig. 3b and Supplementary Table 17). Lastly, in EUR females, we observed significant causal effect estimates of female stuttering on ADHD risk (Fig. 3b and Supplementary Table 17). Results of other Mendelian randomization methods can be found in Supplementary Table 17.
a, Debiased inverse-variance weighted Mendelian randomization analysis estimating causal inference (comorbid trait → stuttering). b, Debiased inverse-variance weighted Mendelian randomization analysis estimating causal inference (stuttering → comorbid trait). Data are represented as estimates; error bars, SE. Filled circles are statistically significant (debiased inverse-variance weighted estimator P < 3.33 × 10−3 after Bonferroni correction for testing 15 associations that were genetically correlated with stuttering). Full information on traits used for analyses can be found in Supplementary Table 15. Full results along with other Mendelian randomization methods can be found in Supplementary Table 17. ASD, autism spectrum disorder.
PRS analyses
Stuttering PRS were derived from our EUR female and EUR male GWAS, and applied to EUR participants in two independent studies of developmental stuttering: ISP (893 EUR cases and 6,052 EUR controls)30 and Add Health (588 EUR cases and 6,621 EUR controls)32. Overall, male-specific PRS models out-performed female-specific PRS models (Fig. 4). In particular, the male-specific PRS model derived from the EUR male GWAS demonstrated significant predictive value for both male and female EUR in the ISP (area under the curve (AUC), 0.6108 for male, 0.6065 for female; Fig. 4a and Supplementary Fig. 88) and Add Health (AUC, 0.5373 for male, 0.5529 for female; Fig. 4b and Supplementary Fig. 88). PRS values for cases and controls within the ISP cohort and Add Health study participants can be found in Supplementary Table 18. The lack of large genetic data resources from clinically ascertained participants matching other ancestries prevented validation of PRS derived from non-EUR analyses; however, despite low power, we present preliminary results of cross-ancestry testing of the EUR male and EUR female PRS models in AFR ISP and AFR Add Health study participants (Supplementary Fig. 89).
PRS were developed using EUR male or EUR female GWAS results and applied to clinically validated ISP and self-report Add Health subjects, and demonstrate increased stuttering liability within stuttering cases. The model was developed and trained using the default auto-phi shrinkage parameter through PRScs. LD panels were constructed using 1000 Genomes Project phase 3 EUR reference data. a, AUC model performance in the ISP cohort. b, AUC model performance in the Add Health cohort.
Discussion
Our GWAS of self-reported stuttering in males and females is the largest to date, comprising nearly 100,000 cases and over one million controls. Although some complex traits with differences in prevalence by sex or ancestry show similar genetic architectures across these groups79,80, we found that there are both shared and distinct genetic effects impacting stuttering risk within sex and/or ancestry groups. Our primary GWAS and secondary meta-analyses identified 57 distinct signals (mapping to 48 unique genes), none of which have been previously reported in stuttering literature. Our estimates of stuttering heritability are in line with SNP-based estimates for other common, complex traits such as insomnia81, type 2 diabetes82 and beat synchronization83. Most prior studies of genetic risk factors for stuttering have explored rare variant effects in pedigrees; however, we find effects that are consistent with high polygenicity, suggesting a genetic architecture similar to other common complex traits81,82,83.
To validate whether the genetic architecture captured by this analysis reflects the genetic etiology of clinically ascertained developmental stuttering, we developed PRS models from the sex-specific EUR GWAS results and applied them to the ISP30 and Add Health32 stuttering cohorts (which are predominantly EUR samples) for validation. Within the ISP, a clinically ascertained cohort enriched with males and persistent cases of stuttering, the EUR male-derived models showed significant differences in liability scores between stuttering cases and controls in both sexes. By contrast, the EUR female model had significant predictive performance only in EUR females in the ISP. In Add Health, a self-report cohort, both the male and female PRS models significantly predicted case or control status. The difference in the predictive performance between females and males in these two external validation cohorts is notable, with several possible non-mutually exclusive explanations: (1) compared to 23andMe EUR males, the trait captured by 23andMe EUR females is less representative of developmental stuttering; (2) the genetic liability for developmental stuttering varies between males and females and is perhaps confounded by differences in genetic susceptibility to persistent versus recovered stuttering; or (3) genetic variation contributing to developmental stuttering risk may be confounded by horizontal pleiotropy modulated by sex.
The first possibility, that the EUR female stuttering phenotype is not capturing clinically diagnosed developmental stuttering as well as the EUR male stuttering phenotype, may have different possible explanations as well. Prior studies have found that sex-differential participation bias in 23andMe data can lead to spurious genetic associations with sex, particularly in complex traits associated with sex-differential participation bias38. For the 57 unique stuttering-associated loci, six were within the SNP windows for loci previously identified to be spuriously associated with sex38. We may also be observing sex-specific differences in adult recall accuracy. Females are more likely to recover from stuttering in early childhood1,13,14; therefore, the self-report stuttering phenotype may be more impacted by the accuracy of recall of stuttering in childhood in females than in males. There may be other explanations for a difference in how well the self-report phenotype serves as a proxy for clinical diagnosis by sex; for example, female participants in our study may be more likely to report subclinical stuttering than males. Future research will be needed to deconvolute genetic risk factors that are specific to sex and persistence.
To our knowledge, all variants reaching genome-wide significance in our study are novel findings for stuttering. The Open Targets Genetics ‘Variant-to-Gene’ (V2G) pipeline84,85 assigned VRK2 as the most likely functional gene (r2 between these variants is 0.31 in the CEU population) for one locus spanning the two top hits in EUR male (rs35609938 and rs1040225)86. Specifically, rs35609938 occurs downstream of VRK2 and upstream of FANCL, and rs1040225 occurs within either an intronic or upstream region of VRK2, depending on isoform (Table 2 and Supplementary Figs. 1 and 2). Interestingly, FANCL and VRK2 were recently implicated in musical beat synchronization83. Rhythm perception impairments have been linked to several speech and language conditions, including stuttering51,52. Complex rhythm discrimination is below average in adults51 and children50 who stutter, consistent with the Atypical Rhythm Risk Hypothesis52, which posits that those with atypical rhythm may be at risk for developmental speech and language disorders. Clinically, synchronizing speech with external pacing cues, such as a metronome, can often temporarily decrease stuttering disfluencies50,87,88,89. Impairments in rhythm processing may be functionally related to stuttering, and our findings offer the first genetic support for this hypothesis.
Our study also provides additional evidence of the effect for three genes that have been previously reported in family-based studies of stuttering. Although the previously reported variants were not directly genotyped and were too rare for accurate imputation or not present in the 23andMe data, we found three unique signals surpassing Bonferroni significance (see Methods) for the following genes: GNPTAB, GNPTG19,27 and AP4E1 (ref. 24); for GNPTG, the sentinel variant in our replication analysis was extremely rare (minor allele frequency of 4.22 × 10−5). AP4E1 interacts with previously reported gene NAGPA24,29, another gene that has been previously implicated in family studies; together, these results provide modest additional support for the role of these genes in stuttering.
Imaging studies have demonstrated that people who stutter exhibit differences in a variety of brain areas42,43, including the frontal cortex44, cingulate cortex41,44, basal ganglia (caudate, substantia nigra)45,46,47,48, inferior temporal lobe90,91,92 and cerebellum41. Our gene module enrichment analysis (see Supplementary Note) revealed enrichments in the frontal cortex, cortex, anterior cingulate cortex, nucleus accumbens of the basal ganglia and cerebellum. These results, as well as our finding that stuttering-associated variants are involved in neurons and regulation of gene expression in the brain (see Extended Data Figs. 7–10 and Supplementary Tables 11–13) and that more than 20 genes identified in our analyses have been previously implicated in neurological and mental disorder traits, provide additional evidence for the neurological underpinning of stuttering risk41,42,43,44,45,46,47,48,93,94. Future follow-up analyses are needed to functionally validate distinct enrichment features in males and females.
Genetic correlation analysis revealed a significant correlation of increased stuttering risk in both EUR males and females with increased risk of depression. In EUR females, we also observed a significant correlation of increased stuttering risk with hearing loss, asthma, daytime sleepiness, ADHD and BMI, and a negative correlation with alcohol consumption and walking pace. The connection between stuttering and sleep is further supported by a recent large-scale multi-ancestry GWAS meta-analysis of short and long sleep duration95, which identified five genes associated with sleep duration that overlap with our stuttering findings: PTPB2, SRPK2, KCTD10, MMAB and SLC39A8. In addition, for traits without available sex-stratified summary statistics, genetic correlation analysis revealed increased risk of autism and impaired beat synchronization. These genetic correlations, and their respective directions, are largely consistent with previous literature identifying traits comorbid with stuttering49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76. However, we recognize that as a volunteer-based study, there are documented biases in UK Biobank that may affect our genetic correlation and Mendelian randomization results, especially when it comes to genetic correlates of behavior, lifestyle and social outcomes96. Therefore, increasing representativeness in biobanks and increasing representation and documentation of communication disorders in particular will be paramount for future studies. We regret that many language and motor-related phenotypes that may be correlated with stuttering are underrepresented in the genetics literature, do not have publicly available GWAS summary statistics or did not meet our minimum binary trait case requirement of n > 1,000 and were not tested in our genetic correlation analyses.
We also performed Mendelian randomization analyses77 to assess causal relationships between stuttering and traits that have been previously reported as co-occurring with stuttering (see Supplementary Table 14). Overall, walking pace and autism were predicted to have a causal effect on stuttering. In addition, stuttering was predicted to have a causal effect on ADHD and autism. Furthermore, we observed significant bi-directional effects between stuttering and BMI. In general, these results bolster previous studies showing preliminary evidence for associations between stuttering and BMI53, as well as gross motor coordination97. We also observed significant bi-directional causal effects between stuttering and depression. This result is consistent with several studies suggesting that both males and females who stutter report elevated symptoms of depression compared to their fluent counterparts65,66,98. Specifically, communication difficulties caused by stuttering can result in feelings of frustration and hopelessness and, along with broader societal stigma toward stuttering, can negatively impact psychological health61,99,100. We also observed significant bi-directional effects between stuttering and beat synchronization, or the ability to clap to a beat. This finding is especially compelling considering the role rhythm perception may have in stuttering: as previously mentioned, rhythm discrimination is below average in adults51 and children50 who stutter, and synchronizing speech with external pacing cues, such as a metronome88,89, can temporarily decrease stuttering disfluencies. Interestingly, we observed different significant causal relationships across sexes, which may be a result of observed differences in genetic risk between sexes. The distinct causal pathways in males and females relating stuttering to genetically correlated traits are notable. However, as previously discussed, females are more likely to recover from stuttering than males13,14; thus, one limitation of this study is an inability to fully differentiate between effects related to sex and those related to stuttering persistence. Improved granularity of self-report with information regarding stuttering persistence will be necessary to further clarify these effects.
Overall, we leveraged 99,776 cases and over one million controls to identify 57 unique genome-wide significant loci associated with ancestry-specific and sex-specific self-reported stuttering and validated male and female PRS for self-reported stuttering in two independent stuttering datasets. This study provides insight into the genetic contributions to stuttering at the population level, demonstrating that genetic risk is complex, polygenic and dominated by low to modest genetic effects. After decades of progress examining the behavioral, neural and physiological contributions of language, articulation, speech–motor coordination, temperament and emotion to stuttering, the addition of genetic studies may help provide a mechanistic framework for integrating findings across these domains. We demonstrate shared molecular underpinnings between stuttering and other associated traits, including depression, autism and beat synchronization. An unresolved question in the field of stuttering, with lengthy historical speculation, is whether persistent stuttering and recovery from stuttering represent distinct subtypes101,102. Thus far, studies have yielded conflicting results with no clear answer103,104,105; the analyses presented here motivate continued research into causal differences between females and males as well as between persistent and recovered stuttering. These findings represent a critical step toward the next era of research for this common, complex, costly and heritable condition.
Methods
Ethics
We have complied with all ethical guidelines. All participants provided informed consent to participate in the research. This study has been approved by Vanderbilt IRB (181575 and 180583).
Studies
23andMe, Inc
GWAS included research participants from 23andMe who self-reported stuttering status through a questionnaire. Cases included participants who answered ‘yes’ (99,776 individuals) to the question ‘Have you ever had a stammer or stutter?’ Controls (1,023,243 individuals) included participants who answered ‘no’ to this same question (Table 1 and Supplementary Table 1). As is common in population-based studies investigating stuttering, our study relies on self-report (see ref. 106, in which all but two of the reviewed papers were based on retrospective questionnaire or interview-style surveys)1,106. All individuals included in the analyses provided informed consent and answered surveys online according to the 23andMe human subject protocol, which was reviewed and approved by Ethical & Independent Review Services, a private institutional review board (http://www.eandireview.com). Although developmental stuttering is by far the most common form of stuttering, the self-report phenotype in this study may also include other rarer forms of stuttering, such as acquired neurogenic stuttering. Compared to developmental stuttering, acquired stuttering is uncommon in clinical practice107. Therefore, although we expect the genetic signatures captured in these analyses to reflect effects for developmental stuttering, we use the general term ‘stuttering’ to describe the self-reported phenotype. Despite these phenotyping considerations, large-scale sample collection through surveys can dramatically increase power—a major limitation in genetic studies of stuttering thus far—and enable the discovery of robust and reproducible effects108,109,110,111.
ISP
We assessed the predictive performance of polygenic scores derived from 23andMe using participants with developmental stuttering from the ISP. Stuttering status in the ISP cohort was confirmed by speech–language pathologists with expertise in stuttering and fluency disorders (see ref. 30 for a detailed description of this study and genotyping information).
Add Health
We also assessed the predictive performance of polygenic scores derived from 23andMe using participants who self-reported stuttering on an Add Health questionnaire. Add Health is an ongoing, longitudinal study investigating social, behavioral and biological factors that influence health and developmental outcomes from early adolescence through adulthood. Add Health collects general demographics, health survey data, in-home physical data and biological data from all participants (see ref. 32 for genotyping information). Stuttering cases were defined as participants who at any point answered ‘yes’ to the following survey question: ‘Do you have a problem with stuttering or stammering?’ All control individuals answered ‘no’ to the above question. Self-reported race/ethnicity was used to group participants.
Statistical analysis
Eight ancestry-specific and sex-specific genome-wide association analyses were performed to determine variant association with stuttering (Table 1 and Supplementary Table 1). Each GWAS used a logistic regression that assumed an additive model for allelic effects, where P is the probability of self-reported stuttering:
SNP significance was evaluated by a likelihood ratio test. Results for the X chromosome were computed similarly, in sex-stratified analyses with male genotypes coded as if they were homozygous diploid for the observed allele. Principal components (pc) for each logistic regression model were derived independently for each ancestry, using ~65,000 high-quality genotyped variants present across all five genotyping platforms. Principal components were computed on a subset of participants randomly sampled across all the genotyping platforms (137k, 102k, 1,000k and 360k participants were used for AFR, EAS, EUR and AMR, respectively). Principal components for participants not included in the analysis were obtained by projection, using the eigenvectors from the analysis and the SNP weights. Summary statistics were reported for imputed autosomal and X chromosome variants that were successfully imputed across all platforms (v2, v3, v4 and v5) and reached the following quality control thresholds: average r2 > 0.5, minimum r2 > 0.3, batch check P > 1 × 10−50 and surpassed a minor allele count of 30. Loci in our primary analyses met a two-tiered multiple test correction: first, we applied a false discovery rate threshold of 5% across all autosomal and X chromosome variants; second, a traditional P value threshold of 5 × 10−8 was applied. Sentinel variants were defined as the most significant variant found within a ±1 Mb window. The top 10,000 SNPs for the ancestry-specific and sex-specific GWAS can be found in Supplementary Data 1–8. For more information on obtaining access to the full dataset, see Data availability.
For our secondary analyses, we performed ancestry-specific, sex-combined meta-analyses, using METAL with the inverse-variance weighted (STDERR command) option36 (https://genome.sph.umich.edu/wiki/METAL_Documentation) to meta-analyze our EUR male and EUR female summary statistics, EAS male and EAS female summary statistics, AFR male and AFR female summary statistics and AMR male and AMR female summary statistics. Our sex-combined EUR summary statistics were used for partitioned heritability, genetic correlations and Mendelian randomization analyses.
We aggregated association summary statistics across ancestry-specific association studies using multi-ancestry meta-regression, as implemented in MR-MEGA37. Secondary analyses were also performed for the female-specific and the male-specific meta-analyses, and an ancestry-combined, sex-combined meta-analysis. We included three axes of genetic variation as covariates in the ancestry-combined, sex-combined meta-analysis and, given the lower number of contributing analyses in the female-specific and male-specific meta-analyses and limits on the number of possible axes of genetic variation, included one axis as a covariate in the sex-specific analyses. Resulting P values were adjusted for genomic control.
Annotation
The sentinel variant for each genome-wide significant locus was reported for each ancestry-specific and sex-specific study. Genome-wide significance112 was defined as P < 5 × 10−8. Annotated gene(s) for each locus included the predicted functional gene(s) for each loci (when available) according to the Open Targets Genetics V2G pipeline, which combines evidence from molecular quantitative trait loci, chromatin interactions, in silico functional predictions from Ensembl and distance between the variant and gene canonical transcription start site84,85. Loci were defined according to independent LD blocks identified in 1000 Genomes reference data, using the matched ancestry reference. Reported sentinel variants represent the variant with the smallest P value within each associated region. All reported positional coordinates (chromosome and base pair locations) refer to human genome reference build 37. We also checked for replication (consistency across sex and/or ancestry) of genome-wide significant signals in the primary and secondary GWAS analyses, with replication defined as a Bonferroni-adjusted correction for the number of variants and look-ups performed (P < 1.92 × 10−4, using a Bonferroni correction; Supplementary Table 4).
Within-ancestry genetic correlation
To better understand the shared genetic relationship within ancestry groups, we performed genetic correlations within the EUR and EAS ancestries using LDSC34,35 and relevant ancestry reference panels. EUR and EAS reference were generated using 1000 Genomes reference data and accessed through LDSC (https://alkesgroup.broadinstitute.org/LDSCORE). Genetic correlations were not performed for the AFR and AMR ancestries, given that LDSC and other methods can produce biased results owing to admixture113. Within-ancestry genetic correlation results can be found in Supplementary Table 2. Our within-ancestry genetic correlation results indicate there may be a strong relationship between EUR male and EUR female findings; however, there are differences in genome-wide significant sentinel variants (Table 2). Given that the LDSC approach and other common genetic correlation approaches rely on LD estimates, they can produce biased estimates of correlation when the data being compared have different LD patterns113. Therefore, genetic correlation analyses were not performed within AFR and AMR analyses because genetic correlation approaches are biased in the presence of admixture, and correlations were also not tested across ancestry groups113.
Variant effect size concordance analysis
We compared summary statistics from each ancestry-specific and sex-specific GWAS to all others to test for concordance between the summary statistics (Supplementary Table 3)31. As we were interested in investigating a broader genetic architecture, rather than including only variants with associations surpassing genome-wide significance, we examined variants with P < 0.005. To establish our observed concordance rate, we determined the number of shared genetic variants with the same direction of effect divided by the total number of shared variants between the two datasets being compared. Given that datasets may have different LD patterns, we sought to estimate the expected concordance effects under the null hypothesis, preserving the LD structure observed in the data. To do so, we defined blocks of variants with the same direction of effect within 10 kb windows within each dataset. Next, we performed 25 permutations to randomly assign the direction of effect for these blocks to compute a simulated concordance rate for each permutation (defined as the number of shared genetic variants having the same direction of effect divided by the total number of shared variants between the two datasets). To establish our concordance rate under the null, we took the mean of the 25 simulated concordance rates. The rate of concordance expected under the null was used to compare the observed and expected concordance rates using a binomial t-test.
Effect size heterogeneity
To test for heterogeneity of effect estimates for our 24 sentinel variants from our eight ancestry-specific and sex-specific primary analyses, we used a Wald hypothesis test114 to examine the differential effects for each sentinel variant compared to the effects of the other seven primary GWAS. We calculated the test statistic, defined as the quotient of the difference in coefficients and the standard error of this difference. Under the null hypothesis positing equality of coefficients across subgroups, the test statistic conforms approximately to a standard normal distribution, as dictated by the principles of the Wald test. Significance was determined by using a Bonferroni-adjusted P value threshold of 2.98 × 10−4, correcting for the 24 sentinel variants in our primary analysis multiplied by seven (the number of Wald test comparisons performed).
SNP heritability and partitioned heritability
Genome-wide SNP-based heritability (h2) was calculated using summary statistics resulting from the EUR male and EUR female GWAS results using the LDSC software. We used LDSC to estimate liability-scaled h2, assuming a 10% lifetime population prevalence of stuttering based on the observed frequency of stuttering cases (Table 1 and Supplementary Table 1). EUR LD maps were generated using 1000 Genomes reference data and accessed through LDSC (https://alkesgroup.broadinstitute.org/LDSCORE). Heritability calculations were not estimated for the AFR, AMR and EAS ancestries. For AFR and AMR, heritability estimates are often biased in the presence of admixture113. For EAS, given the limited sample size, heritability estimates are likely to be unreliable39.
To better understand the types of variation that contribute most to stuttering, we estimated partitioned SNP heritability for our EUR male GWAS, EUR female GWAS and sex-combined EUR meta-analysis, using stratified LDSC34,35. LD scores, regression weights and allele frequencies for EUR populations were obtained from https://alkesgroup.broadinstitute.org/LDSCORE. We performed 80 different tests, resulting in a Bonferroni-corrected global significance threshold of P < 6.25 × 10−4. Partitioning was performed for 52 baseline annotations as previously described40. Enrichment was considered significant for P < 9.6 × 10−4, derived by Bonferroni correction for 52 gene sets.
Next, we tested for enrichment of cell-type-specific and tissue-specific heritability40 in our EUR male GWAS, EUR female GWAS and sex-combined EUR meta-analysis, while controlling for the baseline models. Data for the brain cell types used to estimate enrichment of heritability consisted of neuron, astrocyte and oligodendrocyte data from a previous publication115. Enrichment tests were considered significant at P < 0.017, derived by Bonferroni correction for three gene sets. Gene expression data (computed from GTEx Project data116) used to estimate enrichment of heritability consisted of eight brain regions with empirical evidence of relation to stuttering41,42,43,44,45,46,47,48. Enrichments were considered significant at P < 6.25 × 10−3, derived by Bonferroni correction for eight gene sets. Lastly, 20 tissue-specific annotations of active chromatin sequences and enhancers, derived from the Roadmap Epigenomics consortium117 and EN-TEx40,118 with epigenetic marks of monomethylation (me1), trimethylation (me3) and acetylation (ac) were used to estimate enrichment of heritability. Analyses were performed using epigenetic marks of H3K27ac, H3K9ac, H3K4me1, H3K4me3 and H3K36me3 in four brain regions previously associated with stuttering41,43,44,46,47,48. These active histone marks were considered significant at P < 2.5 × 10−3, a Bonferroni adjustment for 20 gene sets.
Validation of loci from the literature
Locus validation analysis was performed using methods detailed in a previous publication30; however, the previously calculated effective number of tests was multiplied by eight, given that we checked for validation of signals across our eight ancestry-specific and sex-specific GWAS. As such, the effective number of tests used for our Bonferroni correction represented the number of independent tag SNPs in each gene, with pairwise r2 < 0.4 multiplied by eight. Gene validation results were Bonferroni-corrected for the effective number of tests in each gene, and we report the variant with the minimum P value for each gene.
SNP-based replications looked for replication of the top hits reported in previous publications30,31 across all eight ancestry-specific and sex-specific studies. Replication for the 16 sentinel variants identified in ref. 30 and replication results for the 11 sentinel variants identified in ref. 31 can be found in Supplementary Table 6.
Stuttering PRS model development
PRS models were trained using the EUR male and EUR female self-reported stuttering GWAS results in PRScs119 using a continuous shrinkage before adjusting individual SNP weights for LD and variant significance. Default auto-phi parameters were used in both the male-derived and female-derived models and were not optimized to prevent overfitting. EUR LD reference panels were constructed using the 1000 Genomes Project phase 3 EUR reference and used for EUR and AFR analyses. The male PRS model included 1,024,432 variant predictors, and the female PRS model included 1,024,431 variant predictors. Although the PRS we developed are not clinically useful, here we leveraged these scores to evaluate whether the risk of self-reported stuttering captured in the 23andMe analyses is predictive of risk in clinically validated cohorts. Each model was applied to both the ISP30 and Add Health samples32, matched according to ancestry and stratified by sex. The ISP testing set included 651 EUR male stuttering cases and 4,264 sex-matched controls, as well as 242 EUR female cases and 1,788 female controls; 48 AFR male stuttering cases and 308 sex-matched controls were included as well as 16 AFR female stuttering cases and 90 sex-matched controls. The Add Health testing set included 352 EUR male stuttering cases and 3,104 sex-matched controls; 236 EUR female cases and 3,517 female controls; 117 AFR male stuttering cases and 847 sex-matched controls were included as well as 107 AFR female stuttering cases and 1,101 sex-matched controls.
Genetic datasets were scored using PLINK (v.1.9)120. Genetic liability scores were z-score-normalized. Liability score distributions between cases and controls were compared by Student’s two-sample t-test. Predictive performance was interpreted using the AUC metric.
Genetic correlation
The phenotypes chosen for the genetic correlation analyses were either found to be associated with stuttering within electronic health records at Vanderbilt University Medical Center53 or previously associated with stuttering in other extant literature (see Supplementary Table 14 for phenotypes and references). Specifically, we compared our EUR male stuttering GWAS results to summary statistics from GWAS of EUR males and our EUR female stuttering GWAS results to summary statistics from GWAS of EUR females for 16 traits encompassing the following categories, for which sex-specific summary statistics were available: behavioral, circadian rhythm, immune, metabolic, motor, neurological and hearing traits. Sex-specific GWAS results were obtained from http://www.nealelab.is/uk-biobank (see Supplementary Table 15). We also explored the genetic correlation of stuttering with two traits for which sex-stratified summary statistics were not available to us: beat synchronization and autism (Supplementary Table 15). For beat synchronization and autism, we compared our EUR male, EUR female and sex-combined EUR summary statistics for each trait to the available sex-combined EUR GWAS summary statistics for the trait. Overall, we performed genetic correlation analyses for 18 traits previously associated with stuttering. A case sample size of >1,000 was required for binary traits. Owing to these constraints, genetic correlation analyses were only performed in the EUR-specific GWAS. All genetic correlation estimates were calculated using LDSC34,35. Results were Bonferroni-corrected for the number of tests performed (32 sex-specific analyses and six sex-combined analyses, P < 1.32 × 10−3). Full results are shown in Supplementary Table 16.
Mendelian randomization
We performed debiased inverse-variance weighted77 Mendelian randomization analysis using the MendelianRandomization R package121. Debiased inverse-variance weighted Mendelian randomization can robustly handle weak instrument bias, as well as balanced horizontal pleiotropy77,78. We performed Mendelian randomization analysis for all associations that were significantly genetically correlated with stuttering (Fig. 2 and Supplementary Table 16). Specifically, we performed a sex-specific Mendelian randomization analysis for depression, walking pace, BMI, ADHD, beat synchronization and autism, and sex-combined Mendelian randomization analyses for beat synchronization and autism. All of these traits have prior evidence of association with stuttering in the literature49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76 (see Supplementary Table 14 for summary). All summary statistics for depression, walking pace, BMI, autism and beat synchronization were obtained from either http://www.nealelab.is/uk-biobank, PGC+iPSYCH data122 or 23andMe83). Significant variants for each comorbid trait (P < 5 × 10−6) were filtered to include only those tested in both the comorbid trait GWAS and the stuttering GWAS. Furthermore, independent instrumental SNPs were selected based on LD using the 1000 Genomes Project EUR reference data, retaining SNPs with r2 < 0.02 in 1,000 kb windows using PLINK120. Analysis details and results of other common Mendelian randomization methods are annotated in Supplementary Table 17.
GWAS Catalog
After filtering the GWAS Catalog123 (release date, 2022-21-12) to contain only genome-wide significant loci (P < 5.00 × 10−8), 48 unique genes from all ancestry-specific and sex-specific GWAS, sex-combined, ancestry-specific meta-analyses, ancestry-combined, sex-specific meta-analyses and ancestry-combined, sex-combined meta-analysis were included in the GWAS Catalog search. Traits associated with these genes in the GWAS Catalog were binned into 22 trait categories (Supplementary Table 19). Our genome-wide significant hits and the associated GWAS Catalog findings can be found in Supplementary Table 19b. The number of unique genes per category can be seen in Supplementary Fig. 90.
Gene module enrichment
We performed an enrichment test for gene modules using our most likely genes for each identified signal in either the EUR male or EUR female GWAS to identify sets of highly correlated genes (gene modules) that were associated with stuttering risk (see Supplementary Note). The top associated genes were determined for all variants with P < 5.00 × 10−6 using the Open Targets Genetics V2G pipeline84,85. Gene co-expression networks, as previously defined124, consist of groups of functionally related genes or ‘modules’. Module enrichments were reported for any gene-tissue-specific analysis with a false discovery rate-adjusted P value of <0.05 among any of the 49 available GTEx tissues. Using g:Profiler, we conducted a competitive gene pathway analysis for reported module enrichments, followed by annotation for the biological pathways identified (Supplementary Tables 20–24).
Colocalization
Bayesian colocalization analysis was performed between our EUR male and EUR female genome-wide significant associations and tissue-specific expression quantitative trait locus signals from GTEx V8 data116 using fast enrichment aided colocalization analysis (see Supplementary Note)125,126. We examined colocalization solely within regions where a variant was identified as a top hit (Table 2). Evaluated regions included all sentinel variants in either the EUR male, EUR female, AFR male, AMR male or AMR female GWAS, as well as any other variant found in the same LD block. LD blocks were defined based on LD calculated in the 1000 Genomes EUR population reference data127. Colocalization analyses were tissue-specific and included all GTEx V8 tissues available. For significant colocalized signals, we report regional colocalization probability (that is, the probability that the association signal is in the shared region across stuttering and expression128) of >0.05 (Supplementary Table 25).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Ancestry-specific and sex-specific summary statistics of self-reported stuttering will be made available through the 23andMe website to qualified researchers under agreement with 23andMe that protects the privacy of the 23andMe participants. Interested investigators should visit the 23andMe Publication Dataset Access Program at https://research.23andme.com/dataset-access. The top 10,000 SNPs for all primary analyses are provided in Supplementary Data 1–8.
Code availability
The code for the concordance analysis can be found on GitHub (https://github.com/belowlab/Concordance-Analysis) and Zenodo (https://doi.org/10.5281/zenodo.14884575)129.
References
Yairi, E. & Ambrose, N. Epidemiology of stuttering: 21st century advances. J. Fluen. Disord. 38, 66–87 (2013).
Chang, S. E. et al. Stuttering: our current knowledge, research opportunities, and ways to address critical gaps. Neurobiol. Lang. (Camb.) 6, nol_a_00162 (2025).
Andrews, G. & Harris, M. The Syndrome of Stuttering. Clinics in Developmental Medicine No. 17 (William Heinemann Medical Books, 1964).
Månsson, H. Childhood stuttering. J. Fluen. Disord. 25, 47–57 (2000).
Yairi, E. & Ambrose, N. A longitudinal study of stuttering in children: a preliminary report. J. Speech Hear. Res. 35, 755–760 (1992).
Baxter, S. et al. The state of the art in non‐pharmacological interventions for developmental stuttering. Part 1: a systematic review of effectiveness. Int. J. Lang. Commun. Disord. 50, 676–718 (2015).
Daniels, D. E. & Gabel, R. M. The impact of stuttering on identity construction. Top. Lang. Disord. 24, 200–215 (2004).
Daniels, D. E., Gabel, R. M. & Hughes, S. Recounting the K–12 school experiences of adults who stutter: a qualitative analysis. J. Fluen. Disord. 37, 71–82 (2012).
McAllister, J., Collier, J. & Shepstone, L. The impact of adolescent stuttering on educational and employment outcomes: evidence from a birth cohort study. J. Fluen. Disord. 37, 106–121 (2012).
Klein, J. F. & Hood, S. B. The impact of stuttering on employment opportunities and job performance. J. Fluen. Disord. 29, 255–273 (2004).
Craig, A., Blumgart, E. & Tran, Y. The impact of stuttering on the quality of life in adults who stutter. J. Fluen. Disord. 34, 61–71 (2009).
Craig, A., Hancock, K., Tran, Y., Craig, M. & Peters, K. Epidemiology of stuttering in the community across the entire life span. J. Speech Lang. Hear. Res. 45, 1097–1105 (2002).
Singer, C. M., Hessling, A., Kelly, E. M., Singer, L. & Jones, R. M. Clinical characteristics associated with stuttering persistence: a meta-analysis. J. Speech Lang. Hear. Res. 63, 2995–3018 (2020).
Singer, C. M., Otieno, S., Chang, S.-E. & Jones, R. M. Predicting persistent developmental stuttering using a cumulative risk approach. J. Speech Lang. Hear. Res. 65, 70–95 (2022).
Shugart, Y. Y. et al. Results of a genome-wide linkage scan for stuttering. Am. J. Med. Genet. 124A, 133–135 (2004).
Riaz, N. et al. Genomewide significant linkage to stuttering on chromosome 12. Am. J. Hum. Genet. 76, 647–651 (2005).
Suresh, R. et al. New complexities in the genetics of stuttering: significant sex-specific linkage signals. Am. J. Hum. Genet. 78, 554–563 (2006).
Wittke-Thompson, J. K. et al. Genetic studies of stuttering in a founder population. J. Fluen. Disord. 32, 33–50 (2007).
Kang, C. et al. Mutations in the lysosomal enzyme-targeting pathway and persistent stuttering. N. Engl. J. Med. 362, 677–685 (2010).
Lan, J. et al. Association between dopaminergic genes (SLC6A3 and DRD2) and stuttering among Han Chinese. J. Hum. Genet. 54, 457–460 (2009).
Domingues, C. E. F. et al. A genetic linkage study in Brazil identifies a new locus for persistent developmental stuttering on chromosome 10. Genet. Mol. Res. 13, 2094–2101 (2014).
Mohammadi, H. et al. Sex steroid hormones and sex hormone binding globulin levels, CYP17 MSP AI (−34 T:C) and CYP19 codon 39 (Trp:Arg) variants in children with developmental stuttering. Brain Lang. 175, 47–56 (2017).
Raza, M. H., Amjad, R., Riazuddin, S. & Drayna, D. Studies in a consanguineous family reveal a novel locus for stuttering on chromosome 16q. Hum. Genet. 131, 311–313 (2012).
Raza, M. H. et al. Association between rare variants in AP4E1, a component of intracellular trafficking, and persistent stuttering. Am. J. Hum. Genet. 97, 715–725 (2015).
van Beijsterveldt, C. E. M., Felsenfeld, S. & Boomsma, D. I. Bivariate genetic analyses of stuttering and nonfluency in a large sample of 5-year-old twins. J. Speech Lang. Hear. Res. 53, 609–619 (2010).
Fagnani, C., Fibiger, S., Skytthe, A. & Hjelmborg, J. V. B. Heritability and environmental effects for self-reported periods with stuttering: a twin study from Denmark. Logoped. Phoniatr. Vocol. 36, 114–120 (2011).
Kazemi, N., Estiar, M. A., Fazilaty, H. & Sakhinia, E. Variants in GNPTAB, GNPTG and NAGPA genes are associated with stutterers. Gene 647, 93–100 (2018).
Kang, C. et al. Evaluation of the association between polymorphisms at the DRD2 locus and stuttering. J. Hum. Genet. 56, 472–473 (2011).
Frigerio Domingues, C. E. et al. Are variants in sex hormone metabolizing genes associated with stuttering? Brain Lang. 191, 28–30 (2019).
Polikowsky, H. G. et al. Population-based genetic effects for developmental stuttering. HGG Adv. 3, 100073 (2022).
Shaw, D. M. et al. Phenome risk classification enables phenotypic imputation and gene discovery in developmental stuttering. Am. J. Hum. Genet. 108, 2271–2283 (2021).
Harris, K. M. et al. Cohort profile: the National Longitudinal Study of Adolescent to Adult Health (Add Health). Int. J. Epidemiol. 48, 1415–1415k (2019).
Durand, E. Y., Do, C. B., Mountain, J. L. & Macpherson, J. M. Ancestry composition: a novel, efficient pipeline for ancestry deconvolution. Preprint at https://doi.org/10.1101/010512 (2014).
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Mägi, R. et al. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution. Hum. Mol. Genet. 26, 3639–3650 (2017).
Pirastu, N. et al. Genetic analyses identify widespread sex-differential participation bias. Nat. Genet. 53, 663–671 (2021).
The Neale Lab. Insights from estimates of SNP-heritability for >2,000 traits and disorders in UK Biobank. http://www.nealelab.is/blog/2017/9/20/insights-from-estimates-of-snp-heritability-for-2000-traits-and-disorders-in-uk-biobank (2017).
Finucane, H. K. et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat. Genet. 50, 621–629 (2018).
Lu, C. et al. The neural substrates for atypical planning and execution of word production in stuttering. Exp. Neurol. 221, 146–156 (2010).
Chang, S.-E., Garnett, E. O., Etchell, A. & Chow, H. M. Functional and neuroanatomical bases of developmental stuttering: current insights. Neuroscientist 25, 566–582 (2019).
Etchell, A. C., Civier, O., Ballard, K. J. & Sowman, P. F. A systematic literature review of neuroimaging research on developmental stuttering between 1995 and 2016. J. Fluen. Disord. 55, 6–45 (2018).
Liu, J. et al. A functional imaging study of self-regulatory capacities in persons who stutter. PLoS ONE 9, e89891 (2014).
Neef, N. E. et al. Altered morphology of the nucleus accumbens in persistent developmental stuttering. J. Fluen. Disord. 55, 84–93 (2018).
Toyomura, A., Fujii, T. & Kuriki, S. Effect of an 8-week practice of externally triggered speech on basal ganglia activity of stuttering and fluent speakers. NeuroImage 109, 458–468 (2015).
Chang, S. E. & Zhu, D. C. Neural network connectivity differences in children who stutter. Brain 136, 3709–3726 (2013).
Chang, S.-E., Horwitz, B., Ostuni, J., Reynolds, R. & Ludlow, C. L. Evidence of left inferior frontal–premotor structural and functional connectivity deficits in adults who stutter. Cereb. Cortex 21, 2507–2518 (2011).
Choo, A. L., Smith, S. A. & Li, H. Associations between stuttering, comorbid conditions and executive function in children: a population-based study. BMC Psychol. 8, 113 (2020).
Wieland, E. A., McAuley, J. D., Dilley, L. C. & Chang, S.-E. Evidence for a rhythm perception deficit in children who stutter. Brain Lang. 144, 26–34 (2015).
Garnett, E. O. et al. Auditory rhythm discrimination in adults who stutter: an fMRI study. Brain Lang. 236, 105219 (2023).
Ladányi, E., Persici, V., Fiveash, A., Tillmann, B. & Gordon, R. L. Is atypical rhythm a risk factor for developmental speech and language disorders? Wiley Interdiscip. Rev. Cogn. Sci. 11, e1528 (2020).
Pruett, D. G. et al. Identifying developmental stuttering and associated comorbidities in electronic health records and creating a phenome risk classifier. J. Fluen. Disord. 68, 105847 (2021).
Arenas, R. M., Walker, E. A. & Oleson, J. J. Developmental stuttering in children who are hard of hearing. Lang. Speech Hear. Serv. Sch. 48, 234–248 (2017).
Strom, M. A. & Silverberg, J. I. Eczema is associated with childhood speech disorder: a retrospective analysis from the National Survey of Children’s Health and the National Health Interview Survey. J. Pediatrics 168, 185–192.e4 (2016).
Ajdacic-Gross, V. et al. Subtypes of stuttering determined by latent class analysis in two Swiss epidemiological surveys. PLoS ONE 13, e0198450 (2018).
Briley, P. M. & Merlo, S. Presence of allergies and their impact on sleep in children who stutter. Perspect. ASHA Spec. Interest Groups 5, 1454–1466 (2020).
Jacobs, M. M., Merlo, S. & Briley, P. M. Sleep duration, insomnia, and stuttering: the relationship in adolescents and young adults. J. Commun. Disord. 91, 106106 (2021).
Merlo, S. & Briley, P. M. Sleep problems in children who stutter: evidence from population data. J. Commun. Disord. 82, 105935 (2019).
Mohammadi, H. et al. Sleep problems, social anxiety and stuttering severity in adults who do and adults who do not stutter. J. Clin. Med. 12, 161 (2023).
Briley, P. M., Merlo, S. & Ellis, C. Sex differences in childhood stuttering and coexisting developmental disorders. J. Dev. Phys. Disabil. 34, 505–527 (2022).
Lebrun, Y. Stuttering and epilepsy. J. Neurolinguist. 6, 433–444 (1991).
Tomisato, S., Oishi, N., Asano, K., Watanabe, Y. & Ogawa, K. Developmental disability and psychiatric conditions in 39 patients with stuttering. Jpn. J. Logoped. Phoniatr. 57, 7–11 (2016).
Tichenor, S. E., Palasik, S. & Yaruss, J. S. Understanding the broader impact of stuttering: suicidal ideation. Am. J. Speech Lang. Pathol. 32, 2087–2110 (2023).
Briley, P. M., Gerlach, H. & Jacobs, M. M. Relationships between stuttering, depression, and suicidal ideation in young adults: accounting for gender differences. J. Fluen. Disord. 67, 105820 (2021).
Bernard, R., Hofslundsengen, H. & Frazier Norbury, C. Anxiety and depression symptoms in children and adolescents who stutter: a systematic review and meta-analysis. J. Speech Lang. Hear. Res. 65, 624–644 (2022).
Iverach, L. & Rapee, R. M. Social anxiety disorder and stuttering: current status and future directions. J. Fluen. Disord. 40, 69–82 (2014).
Tichenor, S. E. & Yaruss, J. S. Stuttering as defined by adults who stutter. J. Speech Lang. Hear. Res. 62, 4356–4369 (2019).
Tichenor, S. E., Johnson, C. A. & Yaruss, J. S. A preliminary investigation of attention-deficit/hyperactivity disorder characteristics in adults who stutter. J. Speech Lang. Hear. Res. 64, 839–853 (2021).
Alm, P. A. Stuttering in relation to anxiety, temperament, and personality: review and analysis with focus on causality. J. Fluen. Disord. 40, 5–21 (2014).
Pirinen, V. et al. A comprehensive analysis of speech disfluencies in autistic young adults and control young adults: group differences in typical, stuttering-like, and atypical disfluencies. J. Speech Lang. Hear. Res. 66, 832–848 (2023).
Plexico, L. W., Cleary, J. E., McAlpine, A. & Plumb, A. M. Disfluency characteristics observed in young children with autism spectrum disorders: a preliminary report. Perspect. Fluen. Fluen. Disord. 20, 42–50 (2010).
Scaler Scott, K., Tetnowski, J. A., Flaitz, J. R. & Yaruss, J. S. Preliminary study of disfluency in school‐aged children with autism. Int. J. Lang. Commun. Disord. 49, 75–89 (2014).
Tetnowski, J. A. & Donaher, J. Stuttering and autism spectrum disorders: assessment and treatment. Semin. Speech Lang. 43, 117–129 (2022).
Iverach, L. et al. Mood and substance use disorders among adults seeking speech treatment for stuttering. J. Speech Lang. Hear. Res. 53, 1178–1190 (2010).
Heelan, M., McAllister, J. & Skinner, J. Stuttering, alcohol consumption and smoking. J. Fluen. Disord. 48, 27–34 (2016).
Ye, T., Shao, J. & Kang, H. Debiased inverse-variance weighted estimator in two-sample summary-data Mendelian randomization. Ann. Stat. 49, 2079–2100 (2021).
Sanderson, E. et al. Mendelian randomization. Nat. Rev. Methods Prim. 2, 6 (2022).
Mahajan, A. et al. Multi-ancestry genetic study of type 2 diabetes highlights the power of diverse populations for discovery and translation. Nat. Genet. 54, 560–572 (2022).
Hautakangas, H. et al. Genome-wide analysis of 102,084 migraine cases identifies 123 risk loci and subtype-specific risk alleles. Nat. Genet. 54, 152–160 (2022).
Jansen, P. R. et al. Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways. Nat. Genet. 51, 394–403 (2019).
Xue, A. et al. Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes. Nat. Commun. 9, 2941 (2018).
Niarchou, M. et al. Genome-wide association study of musical beat synchronization demonstrates high polygenicity. Nat. Hum. Behav. 6, 1292–1309 (2022).
Mountjoy, E. et al. An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci. Nat. Genet. 53, 1527–1533 (2021).
Ghoussaini, M. et al. Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics. Nucleic Acids Res. 49, D1311–D1320 (2021).
Machiela, M. J. & Chanock, S. J. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics 31, 3555–3557 (2015).
Wingate, M. E. & Howell, P. Foundations of stuttering. J. Acoust. Soc. Am. 112, 1229–1231 (2002).
Brady, J. P. Metronome-conditioned speech retraining for stuttering. Behav. Ther. 2, 129–150 (1971).
Brady, J. P. Studies on the metronome effect on stuttering. Behav. Res. Ther. 7, 197–204 (1969).
Hosseini, R., Walsh, B., Tian, F. & Wang, S. An fNIRS-based feature learning and classification framework to distinguish hemodynamic patterns in children who stutter. IEEE Trans. Neural Syst. Rehabil. Eng. 26, 1254–1263 (2018).
Hartwigsen, G., Neef, N. E., Camilleri, J. A., Margulies, D. S. & Eickhoff, S. B. Functional segregation of the right inferior frontal gyrus: evidence from coactivation-based parcellation. Cereb. Cortex 29, 1532–1546 (2019).
Chesters, J., Möttönen, R. & Watkins, K.E. Neural changes after training with transcranial direct current stimulation to increase speech fluency in adults who stutter. Preprint at https://doi.org/10.31219/osf.io/8st3j (2021).
Loucks, T., Kraft, S. J., Choo, A. L., Sharma, H. & Ambrose, N. G. Functional brain activation differences in stuttering identified with a rapid fMRI sequence. J. Fluen. Disord. 36, 302–307 (2011).
Chow, H. M. et al. Linking lysosomal enzyme targeting genes and energy metabolism with altered gray matter volume in children with persistent stuttering. Neurobiol. Lang. 1, 365–380 (2020).
Austin-Zimmerman, I. et al. Genome-wide association studies and cross-population meta-analyses investigating short and long sleep duration. Nat. Commun. 14, 6059 (2023).
Schoeler, T. et al. Participation bias in the UK Biobank distorts genetic associations and downstream analyses. Nat. Hum. Behav. 7, 1216–1227 (2023).
Kim, K. S., Daliri, A., Flanagan, J. R. & Max, L. Dissociated development of speech and limb sensorimotor learning in stuttering: Speech auditory-motor learning is impaired in both children and adults who stutter. Neuroscience 451, 1–21 (2020).
Ardila, A. et al. An epidemiologic study of stuttering. J. Commun. Disord. 27, 37–48 (1994).
Corcoran, J. A. & Stewart, M. Stories of stuttering. J. Fluen. Disord. 23, 247–264 (1998).
Boyle, M. P. Enacted stigma and felt stigma experienced by adults who stutter. J. Commun. Disord. 73, 50–61 (2018).
Cox, N. J. & Kidd, K. K. Can recovery from stuttering be considered a genetically milder subtype of stuttering? Behav. Genet. 13, 129–139 (1983).
Seider, R. A., Kidd, K. K. & Gladstien, K. L. Recovery and persistence of stuttering among relatives of stutterers. J. Speech Hear. Disord. 48, 402–409 (1983).
Ambrose, N. G., Yairi, E., Loucks, T. M., Seery, C. H. & Throneburg, R. Relation of motor, linguistic and temperament factors in epidemiologic subtypes of persistent and recovered stuttering: initial findings. J. Fluen. Disord. 45, 12–26 (2015).
Yairi, E., Ambrose, N. & Cox, N. Genetics of stuttering: a critical review. J. Speech Lang. Hear. Res. 39, 771–784 (1996).
Yairi, E. Subtyping stuttering I: a review. J. Fluen. Disord. 32, 165–196 (2007).
Bloodstein, O. & Ratner, N. B. A Handbook on Stuttering 6th edn (Delmar Cengage Learning, 2008).
Theys, C., Van Wieringen, A., Sunaert, S., Thijs, V. & De Nil, L. F. A one year prospective study of neurogenic stuttering following stroke: incidence and co-occurring disorders. J. Commun. Disord. 44, 678–687 (2011).
Wells, H. R. R. et al. GWAS identifies 44 independent associated genomic loci for self-reported adult hearing difficulty in UK Biobank. Am. J. Hum. Genet. 105, 788–802 (2019).
Skelton, M. et al. Self‐reported medication use as an alternate phenotyping method for anxiety and depression in the UK Biobank. Am. J. Med. Genet. B Neuropsychiatr. Genet. 186, 389–398 (2021).
Hyde, C. L. et al. Identification of 15 genetic loci associated with risk of major depression in individuals of European descent. Nat. Genet. 48, 1031–1036 (2016).
Purves, K. L. et al. A major role for common genetic variation in anxiety disorders. Mol. Psychiatry 25, 3292–3303 (2020).
Risch, N. & Merikangas, K. The future of genetic studies of complex human diseases. Science 273, 1516–1517 (1996).
Luo, Y. et al. Estimating heritability and its enrichment in tissue-specific gene sets in admixed populations. Hum. Mol. Genet. 30, 1521–1534 (2021).
Clogg, C. C., Petkova, E. & Haritou, A. Statistical methods for comparing regression coefficients between models. Am. J. Sociol. 100, 1261–1293 (1995).
Cahoy, J. D. et al. A transcriptome database for astrocytes, neurons, and oligodendrocytes: a new resource for understanding brain development and function. J. Neurosci. 28, 264–278 (2008).
GTEx Consortium. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Roadmap Epigenomics Consortium et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C. A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Yavorska, O. O. & Burgess, S. MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data. Int. J. Epidemiol. 46, 1734–1739 (2017).
Martin, J. et al. A genetic investigation of sex bias in the prevalence of attention-deficit/hyperactivity disorder. Biol. Psychiatry 83, 1044–1053 (2018).
Buniello, A. et al. The NHGRI-EBI GWAS catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019).
Gerring, Z. F., Gamazon, E. R. & Derks, E. M. A gene co-expression network-based analysis of multiple brain tissues reveals novel genes and molecular pathways underlying major depression. PLoS Genet. 15, e1008245 (2019).
Pividori, M. et al. PhenomeXcan: mapping the genome to the phenome through the transcriptome. Sci. Adv. 6, eaba2083 (2020).
Wen, X., Pique-Regi, R. & Luca, F. Integrating molecular QTL data into genome-wide genetic association analysis: probabilistic assessment of enrichment and colocalization. PLoS Genet. 13, e1006646 (2017).
Berisa, T. & Pickrell, J. K. Approximately independent linkage disequilibrium blocks in human populations. Bioinformatics 32, 283–285 (2015).
Hukku, A., Sampson, M. G., Luca, F., Pique-Regi, R. & Wen, X. Analyzing and reconciling colocalization and transcriptome-wide association studies from the perspective of inferential reproducibility. Am. J. Hum. Genet. 109, 825–837 (2022).
Scartozzi, A. Concordance-analysis, version 1.0. Zenodo https://doi.org/10.5281/zenodo.14884575 (2025).
Acknowledgements
We thank participants from 23andMe, Add Health and the ISP. This research was supported by National Institutes of Health (NIH) grants from the National Institute on Deafness and Other Communication Disorders (NIDCD) to Vanderbilt University Medical Center and Wayne State University (1R03DC015329 and R01DC017175) and to Vanderbilt University Medical Center (5R21DC016723 and R01DC020311) supporting H.G.P., D.M.S., J.E.B., E.J.L. R.M.J. and S.K. D.G.P. received funding from the National Center for Advancing Translational Sciences of the NIH under award number TL1TR002244. A.C.S. received funding from the NIDCD under award number F31DC022482. The content is solely the responsibility of the authors and does not necessarily represent the official views of any funding agencies.
Author information
Authors and Affiliations
Consortia
Contributions
J.E.B. oversaw the entire study. J.E.B. and S.J.K. conceived the study. H.G.P., A.C.S., D.M.S. and J.E.B. drafted the manuscript, with sections contributed by D.G.P. and H.-H.C. H.G.P., A.C.S., D.M.S., H.-C.C., L.E.P., S.-H.C. and H.M.H. performed analyses. Y.Y., C.D.H. and R.M.J. assisted in the interpretation of all analyses. A.S.P. provided computational support to all analyses. H.M.H., C.L.A. and K.M.H. managed the Add Health Study. S.J.K., J.M.B., E.J.L. and K.Z.V. managed the ISP cohort. R.L.G. contributed the beat synchronization GWAS summary statistics. S.M. and the 23andMe Research Team performed all ancestry-specific and sex-specific GWAS analyses. All authors critically reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
S.M. and members of the 23andMe Research Team were or are employed by and hold stock or stock options in 23andMe, Inc. D.M.S. is currently employed by AstraZeneca. The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Genetics thanks Beate St Pourcain and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Study design.
a, Overview of primary and secondary GWAS analyses. b, Flow chart of study design and downstream analyses. Primary analyses are indicated in yellow. Secondary analyses are indicated in blue. Green boxes represent analyses that consist of using data from both the primary and secondary analyses. ‘*’ indicate analyses performed in European ancestry only.
Extended Data Fig. 2 Manhattan and Q-Q plots for primary genome-wide association analyses.
Manhattan plots for genome-wide association analyses performed via logistic regression (see Methods). The dotted line represents genome-wide significance (P < 5 × 10−8). Q-Q plot x-axes represent expected −log10(P-value), and the y-axes represent observed −log10(P-value) with 95% CI indicated by shaded region. a, The AFR female association study included 24,783 total samples (3,008 self-reported stuttering cases) and 29,555,214 autosomal variants. No loci reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0165. b, The AFR male association study included 13,977 total samples (2,480 self-reported stuttering cases) and 29,443,778 autosomal variants. Three loci reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0156. c, The AMR female association study included 74,187 total samples (7,209 self-reported stuttering cases) and 29,814,878 autosomal variants. One locus reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0340. d, The AMR male association study included 45,206 total samples (6,346 self-reported stuttering cases) and 29,759,321 autosomal variants. One locus reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0177. e, The EAS female association study included 13,347 total samples (1,205 self-reported stuttering cases) and 10,595,969 autosomal variants. No loci reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0081. f, The EAS male association study included 7,169 total samples (1,134 self-reported stuttering cases) and 10,586,108 autosomal variants. No loci reached genome-wide significance (dotted line P < 5 × 10−8). λgc = 1.0182. g, Q-Q plot for the EUR female GWAS; λgc = 1.0855. h, Q-Q plot for the EUR male GWAS; λgc = 1.0834.
Extended Data Fig. 3 Forest plots showing significant differences in effect size heterogeneity.
Wald tests were performed to test for heterogeneity of effect estimates for the 24 sentinel variants that reached genome-wide significance in the eight ancestry- and sex-specific GWAS compared to the other seven primary GWAS. Plot displays the eight sentinel variants that demonstrated significant differences in effect size by ancestry and/or sex. Data points represent effect size +/− standard error. Each genetic ancestry is color coded. Female-specific effect estimates are shown using circles. Male-specific effect estimates are shown using triangles. Effect estimates that are not shown indicate that the variant was not found. The symbol ‘+’ represents the effect of the result in the Discovery GWAS. The symbol ‘*’ denotes significant differences in effect size estimates compared to the Discovery GWAS after Bonferroni correction. Full GWAS samples sizes can be found in Table 1.
Extended Data Fig. 4 Manhattan and Q-Q plots for the sex-combined meta-analyses.
Manhattan plots for sex-combined meta-analyses performed using the inverse variance weighted option in METAL (see Methods). The dotted line represents genome-wide significance (P < 5 × 10−8). Q-Q plot x-axes represent expected −log10(P-value), and y-axes represent observed −log10(P-value) with 95% CI indicated by shaded region. a, The meta-analysis includes 944,350 total samples (78,394 self-reported stuttering cases) and 29,566,899 autosomal variants observed across EUR male and EUR female GWAS; λgc = 1.0586. b, The meta-analysis includes 119,393 total samples (13,555 self-reported stuttering cases) and 29,948,846 autosomal variants observed across AMR male and AMR female GWAS; λgc = 1.0070. c, The meta-analysis includes 20,516 total samples (2,339 self-reported stuttering cases) and 10,656,266 autosomal variants observed across EAS male and EAS female GWAS; λgc = 1.0023. d, The meta-analysis includes 38,760 total samples (5,488 self-reported stuttering cases) and 29,705,842 autosomal variants observed across AFR male and AFR female GWAS; λgc = 1.0019.
Extended Data Fig. 5 Miami plot for female-specific meta-analysis and male-specific meta-analysis.
Ancestry-combined, sex-specific meta-analyses were performed using multi-ancestry meta-regression via MR-MEGA (see Methods). a, The multi-ancestry female meta-analysis (top panel in orange) included 682,388 total samples (51,559 self-reported stuttering cases) and 41,960,804 autosomal variants (see Methods). Five loci reached genome-wide significance (dotted line P < 5 × 10−8). The multi-ancestry male meta-analysis (bottom panel in blue) included 440,631 total samples (48,217 self-reported stuttering cases) and 41,937,718 autosomal variants (see Methods). Three loci reached genome-wide significance (dotted line P < 5 × 10−8). The x-axis represents chromosome base pair coordinates in human genome build 37, and the y-axis represents observed −log10(P-value) for each GWAS. Annotated genes for each GWAS includes the predicted functional gene for each loci (when available) according to Open Targets V2G pipeline. b, Q-Q plot for the female-specific meta-analysis. c, Q-Q plot for the male-specific meta-analysis. Q-Q plot x-axes represent expected −log10(P-value), and y-axes represent observed −log10(P-value), with 95% CI indicated by shaded region. Note: P-values were adjusted for genomic control.
Extended Data Fig. 6 Manhattan and Q-Q plot for the ancestry- and sex-combined meta-analysis.
The multi-ancestry meta-regression meta-analysis performed via MR-MEGA (see Methods) included 1,023,243 total samples (99,776 self-reported stuttering cases) and 42,031,853 autosomal variants observed across all eight independent sex and ancestries-specific GWAS. The dotted line represents genome-wide significance (P < 5 × 10−8). Q-Q plot x-axis represents expected −log10(P-value), and y-axis represents observed −log10(P-value) with 95% CI indicated by shaded region. Note: P-values were adjusted for genomic control.
Extended Data Fig. 7 Partitioned heritability estimates for broad functional categories.
Partitioned heritability was performed using the 52 broad functional annotations defined by Finucane et al. Enrichment is the proportion of h2/proportion of SNPs. The horizontal, dotted line indicates no enrichment. Error bars indicate enrichment estimates +/− standard error. Asterisks indicate enrichment estimates are significant based on a Bonferroni P-value of 9.6 × 10−4 (52 gene-sets). a, Broad functional categories partitioned heritability estimates in EUR female self-reported stuttering. b, Broad functional categories partitioned heritability estimates in EUR male self-reported stuttering. c, Broad functional categories partitioned heritability estimates in sex-combined EUR self-reported stuttering. Full results are presented in Supplementary Tables 8–10.
Extended Data Fig. 8 Partitioned heritability estimates for brain cell types.
Partitioned heritability was performed using the three cell-type annotations defined by Finucane et al. The −log10(P-value) for enrichment estimates is indicated on the y-axis. The horizontal dotted line indicates significant after Bonferroni correction (P < 1.7 × 10−2, three tests). a, Brain cell type partitioned heritability estimates in EUR female self-reported stuttering. b, Brain cell type partitioned heritability estimates in EUR male self-reported stuttering. c, Brain cell type partitioned heritability estimates in sex-combined EUR self-reported stuttering. Full results are presented in Supplementary Tables 11–13.
Extended Data Fig. 9 Partitioned heritability estimates for tissue-specific gene expression.
Partitioned heritability was performed using the tissue-specific expression annotations defined by Finucane et al. The −log10(P-value) for enrichment estimates is indicated on the y-axis. The horizontal dotted line indicates significant after Bonferroni correction (P < 6.25 × 10−3, eight tests). a, Tissue-specific gene expression partitioned heritability estimates in EUR female self-reported stuttering. b, Tissue-specific gene expression partitioned heritability estimates in EUR male self-reported stuttering. c, Tissue-specific gene expression partitioned heritability estimates in sex-combined EUR self-reported stuttering. Full results are presented in Supplementary Tables 11–13.
Extended Data Fig. 10 Partitioned heritability estimates for tissue-specific chromatin marks.
Partitioned heritability was performed using the tissue-specific chromatin annotations defined by Finucane et al. The −log10(P-value) for enrichment estimates is indicated on the y-axis. The horizontal dotted line indicates significant after Bonferroni correction (P < 2.5 × 10−3, 20 tests). a, Tissue-specific chromatin partitioned heritability estimates in EUR female self-reported stuttering. b, Tissue-specific chromatin partitioned heritability estimates in EUR male self-reported stuttering. c, Chromatin partitioned heritability estimates in sex-combined EUR self-reported stuttering. Full results are presented in Supplementary Tables 11–13.
Supplementary information
Supplementary Information
Supplementary Note and Supplementary Figs. 1–90.
Supplementary Table 1
Supplementary Tables 1–25.
Supplementary Data 1
Supplementary Data 1–8.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Polikowsky, H.G., Scartozzi, A.C., Shaw, D.M. et al. Large-scale genome-wide analyses of stuttering. Nat Genet 57, 1835–1847 (2025). https://doi.org/10.1038/s41588-025-02267-2
Received:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/s41588-025-02267-2
This article is cited by
-
De novo protein-coding gene variants in developmental stuttering
Molecular Psychiatry (2025)