Introduction

Speech disorders are a common presenting concern for children referred to paediatricians and primary care physicians [1]. Associated poor language, literacy, educational, employment and social outcomes mean speech disorders come with significant individual, societal and health system burden [2,3,4].

Up to 5% of children develop common speech disorders, including articulation and phonological impairments [1]. These conditions are highly tractable and typically resolve by 7 years of age, with or without intervention [4, 5]. By contrast, 1 in 1000 children follow a severely disrupted developmental path to intractable motor speech disorders of childhood apraxia of speech (CAS) and/or dysarthria [6]. CAS affects the ability to plan and sequence speech movements, decreasing the precision, consistency and intelligibility of speech [7]. Children with CAS need explicit teaching and practice of new sounds and words, often requiring speech therapy into adolescence or adulthood [8]. A diagnosis of CAS implicates disruption of inferior frontal lobe and basal ganglia pathways, whereas dysarthria is a disorder of the control and execution of neuromuscular movements for speech implicating the ‘final common’ motor pathway or corticobulbar tract [9,10,11].

A major challenge of managing motor speech disorders has been a lack of aetiological knowledge. Families search for explanatory causes in a long diagnostic journey [12], attending multiple primary care physician, paediatric, neurology, and other specialists appointments, involving serial investigations (e.g. MRI, metabolic testing) with significant costs and psychological stress and without explanatory results [13]. In the past five years, genomic sequencing, performed in a research setting, has shown a substantial genetic contribution to these conditions [14,15,16,17], even in the absence of intellectual disability [15, 16], with a diagnostic yield of 30% [3]. This research yield is comparable to that seen in cerebral palsy, epilepsy and intellectual disability, conditions for which genomic clinical testing is routine in many centres [3]. However, clinical genomic testing is not yet the standard of care for motor speech disorders, despite such testing being highly valued by families with lived experience [13].

In this study, we offered clinical genomic testing, consisting of chromosomal microarray, fragile X PCR and exome sequencing, to a clinically ascertained population of children referred to a speech genomics clinic at a paediatric tertiary hospital. We examined diagnostic yield and clinical variables predictive of a genetic diagnosis.

Materials and methods

Ethical consent

The study was approved by the Human Research Ethics Committee of The Royal Children’s Hospital, Melbourne, Australia (#37353). Written informed consent was obtained from parents or legal guardians.

Study design

This prospective single-centre cohort study recruited probands referred to the Genetics of Speech Disorders clinic at the Royal Children’s Hospital in Melbourne, Australia, between August 2020 and December 2023.

Recruitment

Children were referred for motor speech testing by their treating primary care physician, paediatrician, paediatric sub-specialist (metabolic, genetics, neurology) or speech pathologist. Probands were triaged for eligibility by the clinical nurse co-ordinator with input from the clinical team where indicated (e.g., psychologist to interpret cognitive assessments; clinical geneticist to interpret past genetic reports). Inclusion criteria were: (1) probands aged 2;7 to 17 years with CAS or dysarthria in the absence of known or previously diagnosed intellectual disability, autism spectrum disorder (ASD) level III, or acquired brain injury; and (2) parent, speech pathologist and treating physician reporting the primary presenting feature and clinical concern as motor speech disorder [15, 16].

Phenotyping

Probands were prospectively assessed by a speech pathologist (AM, ML, CB); paediatrician and clinical geneticist (DA, dual-qualified); and psychologist (EB, CB) using a protocol outlined below; see clinical pipeline Fig. 1a.

Fig. 1: Clinical pipeline.
figure 1

a Clinical workflow. PCR Polymerase chain reaction. Referral received by clinical co-ordinator (nurse, LO) at Motor Speech Disorders Clinic. Triage by nurse (LO) and speech pathologist (AM). Proband seen by speech pathologist (AM, ML, CB, AG) to confirm speech motor disorder. Medical work-up by paediatrician and clinical geneticist (DA, dual-qualified). Psychologists (EB, CB) determined IQ status at the further appointment. A genetic counsellor (ML) or a clinical geneticist (DA) performed pre-test genetic counselling and genetic consent. Saliva samples for chromosomal microarray and exome sequencing were collected using paediatric. Oracollect kits from DNAGenotek. Clinically accredited genomic sequencing was performed at the Victorian Clinical Genetics Service, including: initial computational bioinformatics analysis; meeting to prioritise variants for assessment attended by senior medical genomic scientists, clinical geneticists, and genetic counsellors; variant curation as per American College of Medical Genetics and Genomics guidelines. Next, a review of variant and phenotypic data was undertaken by a multidisciplinary speech clinic team, with laboratory scientists, clinical geneticists and genetic counsellors independent to the speech clinic, to reach consensus and ensure genotype concordant with phenotype prior to reporting. Proband attended further clinic appointment (in-person or telehealth) for the return of results with post-test genetic counselling and segregation encouraged where appropriate. b Flowchart of recruitment and genetic results. CNV Copy number variant; *Monogenic findings include single nucleotide variants (SNVs) and intragenic deletions.

A detailed family history and pedigree were recorded and prospective phenotyping documented medical and developmental history, including the presence of dysmorphism [15, 16] (Table 1, Supplemental Tables 1, 2a, b). All secondary neurodevelopmental outcomes were defined using standardised testing and a professional report for validation, e.g., multi-disciplinary assessment standardised testing for a formal diagnosis of ASD; a physiotherapist confirmed fine or gross motor disorder. Where children had not yet received a formal ASD or attention deficit hyperactivity disorder (ADHD) diagnosis, the paediatrician (DA) recorded the presence of ASD/ADHD features (Table 1; Supplemental Tables 1, 2a, b).

Table 1 Phenotype of participants with genetic diagnosis (n = 44).

For speech, CAS was confirmed if the American Speech-Language-Hearing Association (ASHA) criteria for CAS were met: (1) inconsistent errors on consonants and vowels in repeated productions of syllables or words, (2) lengthened and disrupted coarticulatory transitions between sounds and syllables, and (3) inappropriate prosody; all relative to age and stage of development [7]. ASHA criteria were operationally defined and rated from: phonetic transcriptions of 55 single-word speech tests designed to elicit all sounds of English, a test of speech consistency (children repeat 15/55 stimuli words twice) and a 5 min conversational speech sample as described previously [15, 16, 18]. Dysarthria was diagnosed as an oral tone or coordination disturbance, examined by a paediatrician and speech pathologist and speech dysarthric features were identified during the conversation sample, using the Mayo Clinic Dysarthria rating scale [15, 16, 18, 19]. Receptive and expressive language was assessed where time and resources were available [20,21,22,23] (Table 1, Supplemental Tables 1, 2a, b). Cognition was assessed by a psychologist using standardised testing [24,25,26].

Genetic and genomic analysis

A genetic counsellor (ML) and/or clinical geneticist (DA) provided pre and post-genetic counselling. Chromosomal microarray analysis (CMA, Illumina GSA, San Diego, USA) and Fragile X PCR sizing were performed on saliva or blood DNA for all probands prior to exome sequencing (ES). Saliva samples were collected using paediatric Oracollect kits from DNAGenotek. For ES, genomic DNA was isolated from saliva samples with trio testing (probands together with parents) performed where possible. ES was performed by Victorian Clinical Genetics Services (VCGS), Melbourne, see Supplemental Fig. 1 for an overview of the analysis pipeline. Coding regions were enriched using TWIST exome capture (TWIST Bioscience). Sequencing was performed on Illumina sequencers to an average target depth of 80×. Data were analysed using an in-house validated version of Cpipe [27], followed by variant filtering and analysis using Alissa Interpret analysis software (Agilent). Data were analysed for variants linked to disorders relevant to the proband phenotype following a tiered approach. Initially, variants in genes associated with intellectual disability were evaluated (https://panelapp-aus.org/panels/250/). Analysis was then expanded to a virtual panel of ~6000 genes known to cause Mendelian disorders (https://panelapp-aus.org/panels/137/). Variants were classified according to the current American College of Medical Genetics and Genomics (ACMG) guidelines for variant interpretation [28].

Statistical analysis

We report the proportion and type of pathogenic variants causative for motor speech disorder from CMA, Fragile X and ES testing. For variables predictive of a genetic diagnosis, baseline clinical variables are recorded as frequencies (n, %) and medians (range), as dictated by data type. Odds ratios have been used to identify variables associated with a genetic diagnostic result. The variables of ASD, ADHD, dysmorphism, intelligence quotient (IQ) and receptive and expressive language were all dichotomised for the odds ratio analysis.

Results

Genetic diagnostic yield and genomic findings

172 probands met the inclusion criteria, of whom 153 (88.9%; 111 males, 42 females) completed genetic testing (153 CMA, 143 ES), at an average age of 5 years 8 months (range 2 years 7 months to 16 years 5 months). Two probands withdrew from the study and 4 declined genetic testing. Despite three reminders to do so, the remaining 13 either did not respond to the genetic testing appointment invitation, did not send back their consent form, or did not return their saliva sample (Fig. 1b).

Overall, 44/153 (29%) probands received a genetic diagnosis (15 females; median 5 years 1 month; Tables 1, 2). 30 findings were confirmed de novo and 9 inherited (5 maternal; 4 paternal). Inheritance status could not be confirmed for 5 pathogenic variants due to missing parental DNA (Table 2, Supplemental Fig. 1a). 9 were aneuploidies or copy number variants (CNVs) (all >50 kb) detected on CMA: 15q13.2q13.3 del, 22q11.21 del, 22q11.21q11.22 dup, 1q21.1q21.2 dup, 16p12.2 del, 17q12, 47 dup, 47,XXX, 48,XXYY, 48,XYYY. There were 35 monogenic diagnoses, including single-nucleotide variants (SNVs) and intragenic deletions (< 50 kb). 34/35 were identified from ES: ADGRL1, ANK2, BPTF, CACNA1A (n = 3), CAMK2A, CAMTA1, EBF3, EHMT1 (n = 2), FBXW7, FOXP1, GNAI1, KCND3, KDM5C, NSD1, PPP2R5D, RAF1, SCN8A, SET, SETBP1 (n = 2), SETD1A, SETD2, SETD5, SLC6A1, SLC6A8, SMARCA2 (n = 2), SPTBN1, SRRM2, TAB2, TRIM8. 1/35 was identified from CMA:

Table 2 (a) Monogenic findings (including single nucleotide variations and intragenic deletions), n = 35; (b) Copy number variants (CNVs) and aneuploidies, n = 9 (Genetic diagnoses n = 44).

CUX1 (Tables 1, 2). Variant types for SNVs (n = 32) included missense (n = 13), frameshift (n = 9), nonsense (n = 7), in-frame deletions (n = 2), and splicing (n = 1) (Table 2). All variants were deposited in ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/; accession numbers SCV002557839.2- SCV005399598.1).

13 candidate variants of uncertain significance (VUS) were identified in probands without a diagnosis; comprising SNVs in the following genes: AP1G, BCORL1, BRPF1, CRY, DPF2, FGF13, IQSEC2, USP9X (n = 2); and the following CNVs: 14q12 del (de novo), 7p22.3 del (maternal), 3p14.2 del (de novo), and 2q37.2q37.3 del (maternal) (Supplemental Table 3). Two additional VUSs were identified in solved probands ID8 FOXP2, and ID5 TRRAP [28].

All (likely) pathogenic variants identified were in genes previously associated with syndromic or non-syndromic intellectual disability, epilepsy, ASD and cerebral palsy, Supplemental Table 4. No child received a diagnosis of Fragile X syndrome or Fragile X-associated disorder/pre-mutation carrier status.

Phenotyping: variables predictive of a genetic diagnosis

Phenotypes of 44 probands with a genetic diagnosis, and 109 without a genetic diagnosis, are presented in Fig. 2 and Supplemental Tables 1, 2, 5. Compared to those without a genetic diagnosis, a higher proportion of children with genetic diagnoses had dysarthria (with and without CAS; 18.2% vs 8.3%), gross motor delays (68.2% vs. 37.6%), fine motor delays (65.9% vs. 40.4%), or a delayed age of walking (52.3% vs. 6.4%), dysmorphism (45.5% vs. 5.5%) and/or borderline or mild intellectual disability (55.9% vs. 15.2%); and a lower proportion had confirmed ASD (9.1% vs 18.3%). No participant with a genetic diagnosis had seizures or a diagnosis of epilepsy, although participant 6 had staring spells with a normal EEG. Of those without genetic diagnoses, only two had seizures; participant 147 had absence seizures; participant 71 had a history of febrile seizures.

Fig. 2: Radar chart of phenotypic overlap for participants with and without clinical genetic diagnoses.
figure 2

ADHD ADHD diagnosis or ADHD features, ASD ASD diagnosis or ASD features, Dysmorphism Dysmorphic or dysmorphic features, Intellectual impairment: borderline ID or mild ID; Expressive language impairment: mildly impaired, moderately impaired, severely impaired, minimally verbal or could not be assessed due to severity; Receptive language impairment: mildly impaired, moderately impaired or severely impaired.

No potential environmental factors contributing to CAS were identified. No probands had a history of congenital infection, teratogen exposure, birth injury, severe prematurity or acquired brain injury. Six probands had a history of moderate to late prematurity (32–26 weeks) and none of these had a genetic diagnosis.

Consistent with these descriptive findings, the odds of having a genetic diagnosis were highest for individuals with delayed age of walking (OR:15.96, 95% CI: 6.34, 44.86), receptive language impairment (OR: 7.14, 95% CI: 2.96, 18.62), borderline or mildly impaired intellect (OR: 7.07, 95% CI: 2.89, 18.18), dysmorphism (present/features) (OR: 5.59, 95% CI: 2.66, 12.19), expressive language impairment (OR: 3.62, 95% CI: 1.28, 13.04), gross motor delay/disorder (OR: 3.55, 95% CI: 1.72, 7.65); and fine motor delay/disorder (OR: 2.86, 95% CI: 1.39, 6.06), respectively (Supplemental Table 5). The odds of having a genetic diagnosis were lowest for children with confirmed diagnoses of ADHD or ASD. Wide confidence intervals for the OR reflect the relatively small sample size for children with a genetic diagnosis.

Discussion

We identified a specific genetic diagnosis in 29% of children referred to a motor speech disorders clinic at a paediatric tertiary hospital. This yield replicates research findings in a clinical setting for the first time. Previous prospective research-based cohorts ascertained for a primary motor speech diagnosis have identified genetic diagnoses in 42%, 33% and 26% respectively [14,15,16], but may have been subject to ascertainment bias. Importantly, this diagnostic yield is comparable to that of genomic testing in other neurodevelopmental conditions, such as epilepsy, cerebral palsy and intellectual disability, for which genomic testing is increasingly routine [29,30,31,32].

Mendelian diagnoses were present in 23% (35/153) of the cohort. We implicate 15 Mendelian disorder genes as novel explanations for CAS and dysarthria: ADGRL1, ANK2, BPTF, CAMK2A, CUX1, FBXW7, KCND3, NSD1, RAF1, SETD2, SLC6A8, SPTBN1, SRRM2, TAB2, TRIM8). We also confirm involvement of CACNA1A, CAMTA1, EBF3 EHMT1, GNAI1, KDM5C, PPP2R5D, SCN8A, SET, SETBP1, SETD1A, SETD5, SLC6A1, SMARCA2 and FOXP1, now recurrent in unrelated cases across cohorts ascertained for CAS and dysarthria [14,15,16,17, 33]. Past work led to the development of the first publicly available clinical speech panel in July 2024, which we have now expanded to include newly implicated genes with sufficient evidence for inclusion (https://panelapp-aus.org/panels/4290/). We highlight that two of our implicated genes are associated with multiple speech disorder phenotypes: SPTBN1 in CAS, stuttering [34] and speech delay [35]; and SETD1A in CAS and speech delay [35].

Although we excluded children with a pre-existing diagnosis of intellectual disability, our subsequent assessment indicated that 9.7% had mild and 17.7% had borderline intellectual disability. Probands with a genetic diagnosis were more likely to have cognitive impairment compared to those without a genetic diagnosis. This result was not unexpected, given that all genes associated with CAS to date are also associated with intellectual disability [3]. Of those children with a genetic diagnosis and normal IQ, most had an IQ at the lower end of the normal range; although, there were some exceptions: an IQ above 100 was recorded in female participant 47 with 48,XXX (Trisomy X), male participant 10 with an EBF3 variant, and male participant 38 with 16p12.2 deletion. Children with a genetic diagnosis were also more likely to have impairments in motor domains, with delays in walking and fine and gross motor function commonly reported (52.3%, 65.9%, 68.2%, respectively). Overall, these data support the hypothesis that in monogenic forms of CAS, the speech phenotype is part of a broader neurodevelopmental impairment. We also noted that children with CAS and co-occurring dysarthria were more likely to have a genetic diagnosis than children with CAS alone. This is the first evidence of this observation to our knowledge, however requires independent replication.

It is notable that children with a genetic diagnosis ascertained via CAS and/or dysarthria in our cohort were typically towards the milder end of the reported severity spectrum for their disorder. This is likely explained by ascertainment biases favouring (1) severe cases in the published literature, and (2) milder cases in our CAS cohort, also seen in our past studies [15, 16]. For example, we identified CACNA1A variants in three unrelated probands in the absence of epilepsy or intellectual disability [36]. By extension from these data, milder forms of monogenic neurodevelopmental disorders are likely to be more common than currently recognised, and a case can be made for expanding the application of genomic sequencing to children (and adults) with mild and borderline ID.

In the 71% of our cohort without a specific genetic diagnosis, a different phenotypic profile emerged. A significant majority of this group had average FSIQ and expected age of walking, and nearly half (47.7%) had either features of ASD or a formal ASD diagnosis. The genetic basis of these undiagnosed cases remains unclear, but the clinical overlap with ASD implicates polygenic inheritance [37, 38]. Based on these observations, we hypothesise that the genetic architecture of CAS comprises at least two distinct causal pathways: (1) monogenic CAS, characterised by variable cognitive impairment, motor impairment and syndromic features, and (2) polygenic CAS, in which cognition and motor skills are relatively spared, but ASD features are more common.

The association between CAS abnormalities on CMA has yet to be fully characterised. Only 4% of our cohort had a copy number variant. All six CNVs in our study are recognized recurrent CNVs with incomplete penetrance, and each was detected in only one patient. An additional 2% of patients had sex chromosome aneuploidies, including two patients with four sex chromosomes (XXYY and XYYY). The association between sex chromosome trisomies and quadrisomies has been previously recognized [39]. An overall yield of 6% for CMA in CAS is lower than reported previously in an Italian CAS cohort (13%) [17] and for intellectual disability, but comparable to the yield in some other neurodevelopmental phenotypes, such as autism and epilepsy, and microarray remains an important front-line test for motor speech disorders. By contrast, we found no cases of fragile X (neither premutation nor full mutation), in line with past prospective studies in the field, suggesting that this test may be less useful in children with CAS [14,15,16,17]. Past retrospective chart review studies of large cohorts reporting on genotype-CAS phenotype associations have also failed to identify any association with fragile X [33, 40].

We found no significant evidence for environmental factors contributing to CAS, including pregnancy-related and neonatal complications. Moderate to late prematurity (32-36 weeks) was reported in 5.5% of patients without a genetic diagnosis, comparable to the general population. Prematurity was not seen in any participants with a genetic diagnosis. Our data are in line with past studies showing no evidence of CAS or dysarthria diagnoses in children born pre-term [41].

The range of genetic diagnoses in our cohort underscores the role of chromatin modifiers and transcriptional regulators (BPTF, CAMTA1, CUX1, EBF3, EHMT1, FOXP1, KDM5C, NSD1, SET, SETD1A, SETD2, SETD5, SMARCA2, SETBP1, SRRM2) in speech disorder pathogenicity, consistent with our prior reports [3]. We highlight the commonality of genes encoding SET domain proteins (SETD1A, SETD2, SETD5, NSD1, EHMT1), as these are part of a methyltransferase superfamily integral in post-translational protein regulation with a well-established role in neurodevelopmental disorders [15, 16]. Other protein classes enriched for pathogenic variants were ion channels and solute carriers (ANK2, SCN8A, KCND3, CACNA1A, SLC6A1, SLC6A8) and cell signalling pathway members (FBXW7, PPP2R5D, TAB2, RAF1, SET).

Whilst further replication and verification of results is warranted, given the modest size of our cohort, motor speech disorders are extremely rare (1 in 1000) when compared to other neurodevelopmental conditions (1 in 100 for intellectual disability, autism; 4-10 per 1000 for epilepsy) and greater collaboration will be necessary to build sizeable cohorts for independent replication.

While this research focused on a clinical diagnostic setting, future gene discovery research outside of a clinical pipeline remains important to explore novel variants associated with CAS, with functional characterisation.

Conclusion

Our findings confirm a diagnostic yield for motor speech disorder that is comparable to other neurodevelopmental conditions. We have identified predictors of genetic diagnosis that can be used to guide advances in clinical practice and health services for motor speech disorder. We suggest prioritisation of clinical genomic testing for children with a primary motor speech disorder on a background of delayed walking, persistent fine and gross motor delays/disorder, receptive language impairment, dysmorphism, and/or borderline IQ. The presence of ASD appears to be less commonly associated with a genetic diagnosis. Motor speech conditions are distinctive, socially debilitating disorders, and clinical genomic testing should be warranted to end the diagnostic odyssey and position them for precision therapies.