Abstract
Huntington’s disease (HD) is an autosomal dominant neurodegenerative disease with the age at which characteristic symptoms manifest strongly influenced by inherited HTT CAG length. Somatic CAG expansion occurs throughout life and understanding the impact of somatic expansion on neurodegeneration is key to developing therapeutic targets. In 57 HD gene expanded (HDGE) individuals, ~23 years before their predicted clinical motor diagnosis, no significant decline in clinical, cognitive or neuropsychiatric function was observed over 4.5 years compared with 46 controls (false discovery rate (FDR) > 0.3). However, cerebrospinal fluid (CSF) markers showed very early signs of neurodegeneration in HDGE with elevated neurofilament light (NfL) protein, an indicator of neuroaxonal damage (FDR = 3.2 × 10−12), and reduced proenkephalin (PENK), a surrogate marker for the state of striatal medium spiny neurons (FDR = 2.6 × 10−3), accompanied by brain atrophy, predominantly in the caudate (FDR = 5.5 × 10−10) and putamen (FDR = 1.2 × 10−9). Longitudinal increase in somatic CAG repeat expansion ratio (SER) in blood was a significant predictor of subsequent caudate (FDR = 0.072) and putamen (FDR = 0.148) atrophy. Atypical loss of interruption HTT repeat structures, known to predict earlier age at clinical motor diagnosis, was associated with substantially faster caudate and putamen atrophy. We provide evidence in living humans that the influence of CAG length on HD neuropathology is mediated by somatic CAG repeat expansion. These critical mechanistic insights into the earliest neurodegenerative changes will inform the design of preventative clinical trials aimed at modulating somatic expansion. ClinicalTrials.gov registration: NCT06391619.
Similar content being viewed by others
Main
Huntington’s disease (HD) is a devastating condition characterized by loss of striatal medium spiny neurons (MSNs) and striatal neurodegeneration1 leading to impaired motor, cognitive and neuropsychiatric function which typically manifests in middle age, with clinical diagnosis defined by the appearance of unequivocal HD-related motor signs. There are currently no disease-modifying treatments2.
HD is an autosomal dominant disorder and is caused by an expanded CAG repeat ≥40 in the huntingtin gene (HTT) coding for polyglutamine in the mutant huntingtin protein (mHTT), which is the presumed toxic entity leading to neuronal dysfunction and death. It is well established that inherited CAG repeat length has a strong influence on age at clinical motor diagnosis3. Notably, the HTT repeat is somatically unstable4 and expansion of tens or even hundreds of repeats are observed in the most vulnerable striatal neurons5,6,7,8; greater somatic expansion occurs with longer initial CAG length. Evidence indicating that faster individual-specific rates of somatic expansion in brain are associated with earlier clinical motor diagnosis and faster disease progression9 strongly suggests that somatic expansion is a key mechanism explaining the CAG effect on disease progression. Indeed, it has been suggested that somatic expansion is required to generate pathology, and that HD involves two thresholds as follows: first, the inherited CAG length that leads to further somatic expansion, and second, the intracellular pathogenic threshold above which neuronal dysfunction and death occur10,11,12,13. Consistent with this, a recent postmortem study suggests that neurons may experience decades of ‘biologically quiet’ somatic CAG repeat expansion with neuronal damage triggered by a cascade of repeat-length dependent transcriptional dysregulation events only when the CAG reaches a threshold of ~150 repeats8. Further understanding the dynamics of somatic expansion directly in the brain is hampered by the nonavailability of brain biopsy material from young living participants. Although somatic CAG expansion is clearly cell-type dependent6,7,8, faster individual-specific rates of somatic expansion in blood DNA are also associated with earlier clinical motor diagnosis14, suggesting that individual-specific somatic expansion rates in blood DNA are at least partially predictive of individual-specific somatic expansion rates in the brain. This hypothesis is supported by genetic modifier studies that reveal a panoply of DNA repair gene variants as modifiers of both HTT somatic expansion and HD clinical phenotypes13,14,15,16,17.
The polyglutamine-encoding CAG repeat tract in HTT is followed just downstream with a polymorphic polyproline-encoding CCG repeat. Typically, the intervening sequence between the CAG and CCG repeat tracts is comprised of a glutamine-encoding CAACAG cassette and a proline-encoding CCGCCA cassette. However, a number of atypical HTT repeat structures have been identified with loss of either or both of the intervening CAACAG or CCGCCA cassettes associated with an earlier age at clinical motor diagnosis; conversely, duplication of the CAACAG cassette delays this milestone13,14,17,18,19. These data reveal that both HD age at clinical motor diagnosis and the somatic expansion potential of the repeat are best predicted by pure CAG repeat length, rather than encoded polyglutamine length, providing additional support for a key role for somatic expansion in driving disease onset13,14,18.
The monogenic nature of HD and the existence of diagnostic and predictive testing for at-risk family members makes it a tractable disease and much progress has been made towards developing disease modification treatments2. The first phase 1/2 trial of an antisense oligonucleotide (ASO), tominersen, showed dose-dependent lowering of mutant huntingtin levels20. Although the subsequent phase 3 trial was halted early due to adverse safety concerns21, a phase 2 study to better establish safety and tolerability earlier in disease progression is ongoing (ClinicalTrials.gov registration: NCT05686551). Alternative approaches such as allele-specific huntingtin-lowering, protein splicing modulation, and gene therapy are also currently being trialed (reviewed in ref. 22). Additionally, somatic expansion and proteins, such as MSH3 and FAN1, are now being actively pursued as therapeutic targets in HD. A key question in using such therapies will be determining the optimal timing for treatment. The appearance of HD motor signs is already accompanied by substantial striatal neurodegeneration, and earlier treatment seems likely to produce greater clinical benefit. However, all the studies to date have relied on postmortem brain analyses to model the link between CAG repeat expansion to the earliest pathological progression of the disease. Understanding the triggers of the neurodegenerative process is vital in the search for future therapies and identifying the best time to treat to provide therapeutic intervention.
The greatest opportunity to influence disease progression lies in early treatment, with the goal of delaying or preventing clinical motor diagnosis. Numerous large observational studies show that brain changes occur decades from predicted clinical motor diagnosis23,24,25 and that subtle cognitive and motor signs emerge as HD gene expanded (HDGE) individuals approach clinical motor diagnosis. The recent introduction of the HD Integrated Staging System (HD-ISS) provides a new empirical framework for classifying people with HD throughout life26, with stage 0 being the HDGE group with striatal volumes within the general population range, stage 1 being the presence of a biomarker of pathogenesis (caudate and/or putamen volume change), stage 2 being the presence of motor and/or cognitive signs and stage 3 being marked by the onset of functional impairment26. Cohorts in the earliest stages will likely gain the most benefit from preventative therapies.
A key challenge in delivering preventative treatments is to identify and validate robust measures in HD-ISS stages 0 and 1, where the absence of outward signs of impairment renders established motor and cognitive testing batteries insensitive. HD Young Adult Study (HD-YAS) is a unique cohort, ~23 years from predicted clinical motor diagnosis at baseline with deep phenotyping including biofluid, imaging, clinical, cognitive and motor assessments. Our cross-sectional baseline data demonstrated subtle elevations in biofluid biomarkers, such as cerebrospinal fluid (CSF) neurofilament light (NfL), accompanied by slightly smaller putamen volumes in the HDGE group compared to unaffected controls25. Despite this, there was no difference in functional performance between the groups. This cohort, therefore, spans an optimum window for investigating the potential of interventions to delay or prevent symptoms.
Here we present 4.5-year follow-up data from HD-YAS, a deep-phenotyped longitudinal study of young stages 0 and 1 HDGE adults, ~19 years before clinical motor diagnosis. We hypothesized that the effects of somatic expansion in the brain might be detected long before clinical motor onset and tested this hypothesis through detailed longitudinal analysis of preclinical HD phenotypes, biomarkers of neurodegeneration and somatic expansion in blood DNA. We examined change over time in a range of assessments with the aim of identifying ongoing neuropathology and associations with somatic CAG expansion in blood DNA and HTT repeat structures, decades before predicted clinical motor diagnosis, and biomarkers of disease progression, which may have utility in future prevention trials.
Results
Participant characteristics
A total of 131 (64 HDGE and 67 controls) participants attended at baseline and 103 (57 HDGE and 46 controls) returned for follow-up ~4.5 years later (see Extended Data Fig. 1 for reasons for dropout). To account for those not returning, we recruited 23 new participants (9 HDGE and 14 controls) giving a total of 154 participants (73 HDGE and 81 controls). At baseline, 44 (81%) participants of the cohort were in HD-ISS stage 0, 9 (17%) in stage 1 and 1 (2%) in stage 2 (Fig. 1a). Over 4.5 years, 10 (~23%) participants moved from stage 0 to stage 1; there was no progression to stage 2. The transition in staging within the HD-YAS cohort is depicted by overlaying the probability matrix for each HD-ISS stage across different ages for individuals with a mean CAG repeat length of 42, comparable to the mean CAG repeat length of our cohort (Fig. 1b). Here we describe further longitudinal results from the participants; cross-sectional results, updated from the original baseline study, are provided in Supplementary Results and Discussion.
a, The distribution of HD-ISS stages at baseline and at follow-up 4.5 years later. b, A probability matrix for being in each HD-ISS stage across different ages for individuals with a mean CAG repeat length of 42, which is comparable to the HD-YAS cohort. These probabilities are derived from data in the Enroll-HD, PREDICT-HD and TRACK-HD studies, which were used to develop the HD-ISS26. The black dashed box highlights the HD-YAS cohort at baseline, while the red box indicates their position at follow-up after 4.5 years. c, A radar plot showing group differences in longitudinal changes in cognitive measures. d, A radar plot showing group differences in longitudinal changes for neuropsychiatric and functional measures. The black line represents the standardized mean difference between the HDGE and control groups, with conventional frequentist 95% CI shaded in gray. The red circle denotes no difference between means; values within this circle indicate greater change over time in the HDGE group. After FDR correction for multiple comparisons, there were no significant longitudinal group differences in any cognitive or neuropsychiatric measures. Further details on longitudinal changes in cognitive measures can be found in Supplementary Table 1 and neuropsychiatric measures in Supplementary Table 2. Cross-sectional changes in cognitive measures are presented in Supplementary Table 3 and neuropsychiatric changes in Supplementary Table 4. Cross-sectional findings are visualized in Supplementary Fig. 1. AMI, Apathy Motivation Index; BIS, Barratt Impulsivity Scale; CI, confidence interval; ED, extra dimensional; FSBS, Frontal Systems Behavioral Scale; IED, intra–extra-dimensional set shifting; OCI, Obsessive-Compulsive Inventory; OTS, One Touch Stockings; PAL, paired associates learning; PSQI, Pittsburgh Sleep Quality Index; RVP, rapid visual processing; RVP A’, a signal detection theory measure of target sensitivity and mean response latency; SDMT, Symbol Digit Modalities Test; SF-36, 36-item self-report survey; SSRT, stop-signal reaction time; SSTAI, Speilberger State-Trait Anxiety Inventory; SWM, spatial working memory; ZSDS, Zung Self-rating Depression Score.
There were no significant differences (false discovery rate (FDR) < 0.15) between the HDGE and control groups in age, sex, interval between visits, education score or National Adult Reading Test (a measure of premorbid intelligence; Extended Data Table 1).
Cognitive and neuropsychiatric assessments
There was no significant longitudinal disease-related decline in any of the comprehensive cognitive (FDR > 0.8; Fig. 1c) or neuropsychiatric (FDR > 0.3; Fig. 1d) assessments, demonstrating that change in the HDGE group was no different from matched controls. Cross-sectional results are shown in Supplementary Fig. 1 and summary statistics are provided in Supplementary Tables 1 and 2 for longitudinal and Supplementary Tables 3 and 4 for cross-sectional results.
Neuroimaging
After quality control, longitudinal data were available for 88 (54 HDGE and 34 controls) participants for volumetric imaging, 83 (50 HDGE and 33 controls) for diffusion-weighted imaging (DWI) and 75 (43 HDGE and 32 controls) for multiparametric mapping (MPM). As left-handed participants were excluded, 70 (43 HDGE and 27 controls) participants were available for the structural connectivity analysis. See Supplementary Table 5 and Supplementary Methods for further details.
The HDGE group showed significantly greater rates of atrophy in putamen (P = 4.0 × 10−10, FDR = 1.2 × 10−9) and caudate (P = 1.1 × 10−10, FDR = 5.5 × 10−10). There were also significant group differences for gray matter (P = 7.5 × 10−3, FDR = 9.4 × 10−3), white matter (P = 1.4 × 10−2, FDR = 1.4 × 10−2) and whole brain (P = 7.1 × 10−4, FDR = 1.2 × 10−3) with associated ventricular expansion (P = 3.9 × 10−5, FDR = 9.8 × 10−5; Fig. 2). Caudate, putamen and white matter loss were significantly predicted by age and CAG (P = 2.1 × 10−7, FDR = 1.0 × 10−6; P = 1.5 × I0−8, FDR = 8.9 × 10−8; P = 0.01, FDR = 0.012, respectively).
a–f, Putamen (a), caudate (b), gray matter (c), white matter (d), whole brain (e) and ventricles (f) are shown. For each structure, we present (i) comparison of standardized residuals (age- and sex-adjusted) for the annualized rate of change in HDGE (n = 54; red) and control (n = 34; gray) groups, (ii) comparison of standardized residuals for annualized rate of change within HDGE by follow-up HD-ISS stage 0 (orange) and stage 1 (green) and (iii) scatterplots of volume by CAP100 score, colored by HD-ISS stage within HDGE. Repeated visits per participant are connected by black lines, with baseline shown as squares and follow-up as circles. HD-ISS stages are represented as follows: stage 0 (orange), stage 1 (green) and stage 2 (blue). Negative standardized residuals denote a rate of change below the adjusted mean across groups. Each box plot displays the median (horizontal line), interquartile range (box) and whiskers extending to 1.5× IQR. Sample sizes (n) reflect biological replicates per group, with n = 54 for HDGE and n = 34 for controls; data represent longitudinal measures per participant, with no technical replicates. Volumetric change analyses for brain structures, excluding the putamen, used a single boundary-shift integral measure or voxel-based morphometry measure of scan pairs per participant (baseline to follow-up) converted to annual rates and modeled by ordinary least squares regression. Putamen changes were calculated by subtracting baseline MALP-EM segmentations from follow-up segmentations and dividing the result by the follow-up duration. Analysis results and residual adjustments reflect control for baseline age, sex and their interaction. Statistical two-sided group comparisons were adjusted for multiple comparisons using the FDR, with P values, degrees of freedom and confidence limits provided in Extended Data Table 2. CAP, CAG-Age Product; ICV, intracranial volume; IQR, interquartile range.
DWI demonstrated elevated rates of longitudinal change in all diffusion and neurite orientation and dispersion density imaging metrics across multiple regions of interest in the HDGE group compared to controls (FDR < 0.15). The splenium of the corpus callosum, the anterior capsule and the external capsule showed associations with age and CAG (FDR < 0.15). There were no significant between-group differences in the rate of change for any of the structural connectivity (all FDR > 0.4) or MPM measures (all FDR > 0.3), nor any evidence of an influence of age and CAG (all FDR > 0.15).
Neuroimaging results suggest that across HD-ISS stages 0 and 1, there are already elevated rates of brain atrophy accompanied by subtle microstructural white matter changes. See Extended Data Tables 2 and 3 for summary statistics for longitudinal volumetric and diffusion results, respectively. Summary statistics for remaining longitudinal metrics are provided in Supplementary Tables 6 and 7 and cross-sectional data in Supplementary Tables 8–11.
Biofluids
A total of 216 biofluid samples were collected across baseline and follow-up visits over the 4.5-year interval. Paired fasting CSF and plasma samples were acquired in 86 (53 HDGE and 33 controls) of the 103 (83.5%) longitudinal participants.
From a significantly increased baseline, CSF NfL (Fig. 3a) and CSF YKL-40 (also known as chitinase-3 like-protein-1 (CHI3L1)) (Fig. 3c) rose more rapidly in HDGE compared to controls (P = 3.2 × 10−13, FDR = 3.2 × 10−12 and P = 0.01, FDR = 0.056, respectively). New to this timepoint, proenkephalin (PENK), a surrogate marker for striatal MSN state, measured in CSF, showed a significant longitudinal reduction in HDGE individuals compared to controls (P = 4.4 × 10−4, FDR = 2.6 × 10−3; Fig. 3b). An increase in plasma NfL was nonsignificant (P = 0.336, FDR = 0.669; Extended Data Fig. 2).
a–c, CSF NfL (a), CSF PENK (b) and CSF YKL-40 (c) are shown. For each biofluid biomarker, we present (i) comparison of standardized residuals (age- and sex-adjusted) for the annualized rate of change in HDGE (n = 48; red) and control (n = 30; gray) groups, (ii) comparison of standardized residuals for annualized rate of change within HDGE by HD-ISS stage 0 (orange) and stage 1 (green) and (iii) scatterplots of biofluid marker levels by CAP100 score, colored by HD-ISS stage within HDGE. Repeated visits per participant are connected by black lines, with baseline shown as squares and follow-up as circles. HD-ISS stages are represented as follows: stage 0 (orange), stage 1 (green) and stage 2 (blue). Negative standardized residuals denote a rate of change below the adjusted mean across groups. Each box plot displays the median (horizontal line), interquartile range (box) and whiskers extending to 1.5× IQR. Sample sizes (n) reflect biological replicates per group, with n = 48 for HDGE and n = 30 for controls; data represent longitudinal measures per participant. All statistical analyses were conducted using mixed-effect linear models with a participant-specific random effect, controlling for age, sex and their interaction. Natural log-transformed concentrations served as the outcomes in these models. Statistical two-sided group comparisons were adjusted for multiple comparisons using the FDR, with P values, degrees of freedom and confidence limits provided in Supplementary Table 17. Please note one prominent outlier in the control group with marked NfL elevation, as previously reported at baseline25. This outlier showed no additional cause on further investigation, with normal T1 MRI brain scan and normal CSF white and red cell counts. Additionally, this control participant did not deviate from other biofluid or cognitive parameters and was, therefore, not excluded from the analysis.
Cross-sectionally, log concentrations of both CSF NfL (P = 5.4 × 10−30, FDR = 6.5 × 10−29) and PENK (P = 1.7 × 10−7, FDR = 1.0 × 10−6) were highly associated with age, CAG length and their interaction. There was also evidence for an influence on longitudinal change in CSF NfL (P = 0.027, FDR = 0.322) and PENK (P = 0.0547, FDR = 0.328). Plasma NfL had a similar cross-sectional association (P = 7.2 × 10−7, FDR = 2.9 × 10−6) but no significant longitudinal association with age and CAG. Regression coefficients are reported in Supplementary Tables 12–14.
Slightly higher annualized rates of change in NfL in CSF and plasma were observed in the HDGE group at stage 0 compared to stage 1 on follow-up but did not reach the threshold of significance (FDR > 0.15). Mean CSF NfL levels (across both visits) were higher in HD-ISS progressor (stages 0 to 1—mean = 6.89 pg ml−1, log scale) compared to nonprogressors (stage 0 to 0—mean = 6.11 pg ml−1, log scale; stage 1 to 1—mean = 6.37 pg ml−1, log scale; Supplementary Table 15). After adjusting for age, sex and their interaction, the difference between stage 0 to 1 progressors and stage 0 nonprogressors was statistically significant (P = 0.0004). Similarly, the difference between stage 0 to 1 progressors and stage 1 nonprogressors was significant (P = 0.045) when controlling for age and sex, but nonsignificant without these adjustments. No significant differences were observed for plasma NfL levels (Supplementary Table 16).
CSF mHTT levels were notably very low than later disease stages27, with only 38.3% (n = 41/107) of samples exceeding the lower limit of quantification and demonstrating an acceptable coefficient of variation below 30% (Supplementary Fig. 2).
The rate of change in other biofluid markers, including plasma NfL, CSF and plasma tau, CSF and plasma glial fibrillary acidic protein (GFAP), CSF and plasma ubiquitin carboxyl-terminal hydrolase L1 (UCH-L1), and CSF interleukin-6 (IL-6) and IL-8, showed no significant differences between groups (Extended Data Fig. 2). Additionally, none of the fluid biomarkers, including NfL, had an association with age, CAG or age-by-CAG interaction (FDR > 0.15). See Supplementary Table 17 for longitudinal and Supplementary Table 18 for cross-sectional summary statistics.
Somatic expansion ratios in blood
Significant longitudinal increases in the somatic expansion ratio (SER) were detected in blood DNA in the HDGE group over 4.5 years (P = 2.0 × 10−8), with SER clearly increasing as early as HD-ISS stage 0 (Fig. 4a). SER rates of change were strongly influenced by an accelerating effect of CAG repeat length (P = 3.0 × 10−5).
a, Longitudinal changes in SER—(i) SER trajectories by CAP100 and HD-ISS stage with baseline visits represented by squares and follow-up visits by circles, and lines connecting data from the same individual, where stage 0 is shown in orange, stage 1 in green, and stage 2 in blue; (ii) changes in SER between visits and (iii) changes in SER by age and CAG repeat length. b,c, Associations between longitudinal SER increase and caudate (b) and putamen (c) volume change, (i) before and (ii) after age-by-CAG correction. d,e, Associations of baseline SER with cross-sectional CSF NfL (d) and CSF PENK (e) levels, with (i) before and (ii) after age-by-CAG correction. Associations were modeled via mixed effects regression using the measure on the vertical axis as the outcome and controlled for age, sex and age-by-sex interaction. Longitudinal caudate change based on a single boundary-shift integral measure per participant was an exception where an analogous ordinary least-squares model was employed.
HTT allele structures
The majority of the HDGE group exhibited the typical HTT repeat structure on their expanded allele (n = 66, 91.6%), while a small subset (n = 6) showed atypical allelic variations (Fig. 5a). Specifically, the CAACAG duplication was observed in 1 (1.4%) participant, the CAACAGCCGCCA double loss was found in 4 (5.6%) and 1 (1.4%) had the CCGCCA loss.
a, Illustration of the HTT repeat structure and allelic variations in the HDGE cohort (n = 72), including the typical structure and four atypical variants—CAACAG duplication (green), CAACAG loss (black, not observed in cohort), CAACAG CCGCCA loss (red) and CCGCCA loss (blue). b, Illustration of atypical allele differences for key biomarker measures. The ANOCOVA models controlled for CAG length, sex and age, including age interactions with CAG and sex. Participant-specific random effects were included except for caudate change based on one boundary-shift integral per participant. (i) The effect on caudate volume change, with significant differences between typical alleles and CAACAG CCGCCA loss (P < 0.0001) and a trend with CCGCCA loss (P = 0.050). (ii) The effect on putamen volume change, with significant differences between typical alleles and CAACAG CCGCCA loss (P = 0.007). (iii and iv) Cross-sectional effects on CSF NfL (iii) and CSF PENK (iv) levels, respectively, with significant differences noted for CAACAG CCGCCA loss. Statistically significant comparisons (P < 0.05) are indicated by asterisks. Please note no longitudinal imaging for CAACAG duplication, hence no plot in (i) or (ii).
Predictors of progression
Baseline NfL, both plasma and CSF, and CSF PENK were predictors of atrophy over time in all brain regions (all FDR < 0.04), even after controlling for the effect of age and CAG (all FDR < 0.12; Extended Data Table 4). Rate of change in caudate and putamen was most strongly associated with change in CSF NfL (P = 3.0 × 10−4, FDR = 0.003 and P = 2.2 × 10−4, FDR = 0.003, respectively) and plasma NfL (P = 0.002, FDR = 0.01 and P = 0.03, FDR = 0.06, respectively) and the association remained after controlling for age and CAG effects (all FDR < 0.09). Rates of change in caudate and putamen were also associated with longitudinal change in CSF PENK before (P = 2.0 × 10−4, FDR = 0.003 and P = 1.0 × 10−4, FDR = 0.001, respectively) and after (P = 0.002, FDR = 0.021 and P = 9.0 × 10−4, FDR = 0.011, respectively) age-by-CAG correction.
Longitudinal increase in SER was a significant predictor of the rate of subsequent caudate volume change before (P = 0.01, FDR = 0.04) and after age-by-CAG correction (P = 0.03, FDR = 0.07; Fig. 4b). Longitudinal increase in SER was also a significant predictor of the rate of subsequent putamen volume change before (P = 0.02, FDR = 0.07) and after (P = 0.049, FDR = 0.148) age-by-CAG correction (Fig. 4c). Baseline SER was strongly associated with cross-sectional levels of CSF NfL (P = 2.5 × 10−12, FDR = 2.9 × 10−11; Fig. 4d) and CSF PENK (P = 8.4 × 10−5, FDR = 3.4 × 10−4; Fig. 4e) before age-by-CAG correction. However, these associations did not remain significant after the correction (CSF NfL—P = 0.827, FDR = 0.956; CSF PENK—P = 0.908, FDR = 0.956).
After controlling for CAG, age, age-by-CAG, sex and SER effects, compared to typical allele structure, the loss of CAACAG CCGCCA atypical allele had significant effects on rates of caudate (P = 1.90 × 10−5; Fig. 5b.i) and putamen (P = 0.007; Fig. 5b.ii) atrophy as well as cross-sectional CSF NfL (P = 0.002; Fig. 5b.iii) and CSF PENK (P = 0.001; Fig. 5b.iv) levels, with the loss of the intervening CAACAG CCGCCA associated with an accelerated neurodegenerative course (Extended Data Fig. 3). Notably, after correction for pure CAG length, there was no detectable association between atypical allele structure and SER (FDR > 0.15).
Sample size calculations
Extended Data Table 5 shows hypothetical sample size calculations for those variables with significant longitudinal effects in the HDGE group. For a 50% treatment effect over 2 years in stages 0 and 1, total sample sizes would be 232, 282 and 326 for rates of change in CSF NfL levels, caudate and putamen volume, respectively. For a 3-year trial, these numbers would be reduced to 104, 126 and 146, respectively.
Discussion
We have used state-of-the-art multimodal measures of cognition, neuroimaging, genetics and biofluid markers in a new assessment battery to study a unique cohort of young adult HDGE who were at baseline, on average, approximately 23 years before predicted clinical motor diagnosis, comparing them to matched controls in an unprecedented level of detail. Our baseline cross-sectional data identified early signs of neurodegeneration despite the maintenance of intact brain function25 and here we present 4.5-year follow-up data with important new mechanistic insights into what drives neurodegeneration in humans carrying the HD mutation (Fig. 6).
This graphical abstract illustrates the proposed pathways linking somatic expansion and its effects on biomarkers in HD-YAS. Inherited CAG repeat length is identified as the primary driver of disease progression in HD57. Red arrows represent observed data associations in HD-YAS and blue arrows reflect assumed causal relationships. The black bidirectional arrow under mechanisms indicates somatic expansion in WBCs as a proxy for CNS expansion, based on shared inherited genetic modifiers14,17. The black bidirectional arrow under biomarkers shows associations between elevated CSF NfL (marker of neuroaxonal damage) and reduced CSF PENK (surrogate of striatal MSN state) with the earliest caudate and putamen volume changes. Somatic expansion is influenced by inherited CAG repeat length, age and DNA mismatch repair gene variants14,17. DNA mismatch repair, highlighted in the schematic and shown as a repeat loop-out mismatch icon, is a key mechanism linking inherited CAG repeat length to somatic expansion16 with repair activity increasing with longer CAG repeat lengths58. Within the CNS, somatic expansions substantially contribute to disease progression, as supported by recent postmortem research8. While brain biopsy would be the gold standard for direct assessment of CNS somatic expansion in vivo, WBC-derived somatic expansion is detectable peripherally, showing early and longitudinal changes by HD-ISS stages 0 and 1. The biomarkers section shows associations of somatic expansion with the earliest caudate and putamen volume changes, and CSF NfL and PENK levels. The age continuum from HD-ISS stage 0 to stage 1 illustrates early detection of somatic expansion and biomarker changes, influencing pathology from stage 0 onward. HD-YAS provides in vivo evidence that somatic expansion drives early pathology during these early stages, highlighting its potential as a promising therapeutic target in proof-of-concept clinical trials at HD-ISS stages 0 and 1 to slow or prevent further neurodegeneration before clinical motor diagnosis. WBC, white blood cell. The figure is created with BioRender.com.
Our data highlight the role of inherited CAG repeat length and somatic expansion on neurodegeneration, decades before clinical motor diagnosis. We identify brain atrophy, elevated levels of CSF NfL, a marker of neuronal damage, and reduced levels of CSF PENK, a marker of striatal MSN state, in the earliest adult HD cohort studied to date. Despite evidence for the start of the neurodegenerative process, there is an absence of any decline in cognitive, motor or neuropsychiatric function at HD-ISS stages 0 and 1. Notably, we show that somatic CAG repeat expansion measured longitudinally in blood, a validated measure of somatic expansion in living patients14,17, is a predictor of the effect of CAG repeat length on striatal markers of very early neurodegeneration.
Consistent with the elevated levels of CSF NfL we reported at baseline25, we now show substantially greater rates of increase in CSF NfL in HDGE compared to controls, indicating accelerating neuroaxonal injury from the earliest stages. Most notably, the rate of change in CSF NfL in HD-ISS stage 0 was at least as fast as in stage 1, suggesting rapid neuroaxonal injury increases even before reaching the threshold of caudate or putamen volumetric loss cutoff for stage 1. Interestingly, mean CSF NfL levels were higher in HD-ISS stage 0 to 1 progressors compared to nonprogressors in both stage 0 and stage 1. The annualized rates of increase in CSF NfL across the whole HDGE group (mean = 63.38 pg ml−1 yr−1) are slightly lower than those reported in the previous HD-CSF cohort (mean = 79.16 pg ml−1 yr−1)27, which is consistent with the HD-YAS cohort being towards the beginning of the neurodegenerative process.
Axonal damage and injury lead to leakage of NfL into the CSF28,29,30 and are elevated in active inflammation31. NfL is a nonspecific marker of neuronal injury, and elevated levels have been reported in other neurodegenerative conditions32,33,34,35,36,37,38,39. Increases in CSF NfL are not necessarily attributable to neuronal death and could result from other degenerative processes such as leaky axons. Nevertheless, it is a clear marker of neuroaxonal pathology and therefore understanding CSF NfL temporal dynamics and kinetics can provide valuable insights into mechanisms in neurodegenerative diseases29.
Previously, cross-sectional studies have revealed lower levels of CSF PENK in manifest HD compared to other neurodegenerative conditions40, as well as compared to HDGE before clinical motor diagnosis and controls41,42. Our longitudinal findings in a larger cohort, and our demonstration of a significant association between PENK levels and striatal imaging measures, serve to substantially strengthen the rationale for using PENK as a surrogate marker for striatal MSN state.
Astrocytes are implicated in disease processes through both cell-autonomous and non-cell-autonomous mechanisms43,44, with one key study identifying a core signature of astrocyte genes with expression altered by mHTT in both humans and mouse models44. A recent study provided the first evidence of mHTT-induced alterations in basal pro-inflammatory cytokine production in microglia without immune stimulation, along with a reduction in endocytic and phagocytic activity in mHTT-bearing microglia under basal conditions, suggesting a possible role for microglial cell-autonomous inflammation and activity in the early stages of HD45. Consistent with our previous findings of elevated microglial marker CSF YKL-40 levels at baseline25, we now show greater rates of increase in CSF YKL-40 longitudinally in the HDGE group compared to controls. However, we do not observe significant longitudinal changes in pro-inflammatory cytokine markers IL-6 and IL-8, which are components of the innate immune system, nor in GFAP, an intermediate filament protein of astrocytes associated with astroglial activation46. It is known that mHTT is expressed in microglia47 and that microglial activation correlates with severity later in the disease48, where mHTT-induced dysfunction of central nervous system (CNS) immune cells is closely linked to pathogenesis49. We postulate that the isolated elevation of YKL-40 may be due to both cell-autonomous and non-cell-autonomous mechanisms at play with activation driven by mHTT dysregulation of astrocytes, rather than general gliosis, which would be additionally indicated by a concomitant rise in GFAP. Our findings suggest that astrocytic dysfunction is more prominent than any abnormal innate immune response at this stage of the disease, as IL-6 and IL-8 levels, which are upregulated in HD and correlate with disease progression49,50, remained unchanged longitudinally, reinforcing the importance of treating early at this stage, before widespread neuroinflammation occurs.
The presence of neuronal damage within HD-ISS stage 0 is further supported by the evidence of substantially elevated rates of brain atrophy and a corresponding reduction in CSF PENK levels. Stages 0 to 1 progressors also had substantially higher elevations in CSF NfL than stage 0 nonprogressors. The substantially higher rates of caudate and putamen atrophy and global brain measures and their association with disease burden suggest that neurodegenerative processes are already occurring across our cohort and at the earliest ages observed in this study. This atrophy was measurable in those with basal ganglia volumes distributed throughout the volume range observed in unaffected controls, implying the beginning of detectable neurodegeneration. In addition to these changes seen at the macrostructural level, diffusion imaging provides evidence that there is ongoing very early microstructural white matter damage. The strong predictive power of baseline NfL (in both plasma and CSF) for subsequent atrophy in all brain regions further supports the suggestion that there is early neuroaxonal damage which leads to macroscopic effects such as brain atrophy.
Despite the evidence of ongoing pathological changes in our stages 0 and 1 cohort, neurodegeneration is not yet impacting measurable function as we saw no significant disease-related decline in any of the cognitive, neuropsychiatric or functional measures. Previous work has shown that such changes only become evident from HD-ISS stage 2 (ref. 51).
We demonstrate the accumulation of somatic expansion of the HTT CAG repeat in blood DNA over time in HD-ISS stages 0 and 1 and, critically, show that it is associated with both brain atrophy and CSF NfL, a marker of neuronal–axonal injury, and CSF PENK, a surrogate marker of striatal MSN state. A higher inherited CAG length was associated with a faster increase in SER over time. SER was associated with caudate and putamen atrophy, both cross-sectionally and longitudinally, even after controlling for age-by-CAG interactions. Baseline SER was strongly associated with cross-sectional levels of CSF NfL and CSF PENK before age-by-CAG correction; however, these associations did not remain significant after the correction. We postulate that bioassay measurements demonstrate higher variability and noise compared to striatal volume measurements. Therefore, the lack of significance in associations with CSF NfL and PENK does not undermine the significant association between the longitudinal increase in SER and volumetric changes in the caudate and putamen. Additionally, the statistical strength of the influence of CAG length on atrophy was weakened in models also controlling for blood SER. Assuming that, via the common baseline CAG length effects and shared genetic modifiers, SER measured in blood is an indirect quantifiable indicator of the greater somatic expansion occurring in neurons, these results may be seen as providing in vivo evidence for the key role of somatic CAG repeat expansion in very early HD pathology in humans (Extended Data Fig. 4), reinforcing the putative pathological role of somatic expansion as a critical factor in disease progression5,6,7,8,9.
If the recent suggestion from HD postmortem brains that asynchronous somatic expansion leads to asynchronous stochastic crossing of the transcriptional dysregulation threshold and asynchronous neuronal death8 is correct, then our data would support the hypothesis that somatic expansion is already an active process in the brain and that some neurons have already crossed a critical repeat length threshold ~20 years before clinical motor diagnosis. Indeed, this phenomenon is both predicted by the stochastic models and consistent with autopsy observations of early neuronal loss8. This would suggest that suppressing CAG repeat somatic expansion from this point in the disease process could prevent additional neurons from passing the neuronal toxicity threshold and reduce neurodegeneration before functional deficits are manifest. Therapeutic agents targeting DNA repair proteins that modify somatic expansion show great potential, with MSH3 as a particularly attractive target for HD and other repeat expansion disorders52, and various MSH3-targeting therapeutics are currently under development53,54. To this end, somatic expansion of CAG repeats in blood DNA could be a useful biomarker to demonstrate target engagement of somatic expansion-suppressing therapies with peripheral exposure.
Within our cohort, a small number of individuals carried atypical CCGCCA or CAACAG CCGCCA loss of intervening sequence HTT alleles. These atypical structures have a high potential to cause mid-estimation of the CAG repeat length13,14,17,18,19, and using the MiSeq-derived CAG lengths changed the mean baseline years to predicted clinical motor diagnosis in HD-YAS from 24 to 23 years. After correcting for pure CAG length, these structures have previously been associated with earlier clinical motor diagnosis13,14,17,18,19. Consistent with this, we find those participants with the loss of intervening sequence structures exhibit higher rates of caudate and putamen atrophy, and have some of the greatest elevations in CSF NfL and reductions in CSF PENK, which together suggest an acceleration of the degenerative process (Fig. 5b). Detecting these effects in such small numbers so early in the course of disease suggests these synonymous DNA structural differences are exerting a substantial influence on the rate of neuropathological change. Interestingly, after correcting for pure inherited CAG there was no residual association between these allele structure variants and SER. This is consistent with previous work in other cohorts in blood, postmortem brains and cell lines14,17,19 showing that the loss of the intervening CAACAG CCGCCA does not increase the rate of CAG expansion over and above the effects of pure CAG length. Relevant available brain data is limited so it is still possible that the CAACAG CCGCCA loss increases CAG expansions in brain but not blood. An alternative hypothesis is that, after correcting for pure CAG, the residual disease-modifying mechanism of the CAACAG CCGCCA loss is independent of somatic expansion of the HTT repeat via effects on RNA transcription, RNA stability, or canonical or repeat-associated non-ATG translation (Extended Data Fig. 3). Regardless, these variants clearly have a profound impact on the disease course.
This work not only provides evidence to support the potential of therapies targeting somatic expansion but also identifies robust markers of disease progression, which may have utility as likely surrogates for future preventative clinical trials. CSF NfL, PENK and brain atrophy measures have the potential to monitor disease progression in HD-ISS stages 0 and 1, where clinical endpoints are not applicable. Change in CSF NfL level has previously been used as an outcome measure for a trial of the ASO nusinersen55 in children with spinal muscular atrophy. Earlier treatment initiation was also associated with a larger decrease in CSF NfL levels, underscoring the importance of early intervention to preserve neuronal health.
At this stage of the disease, CSF mHTT levels are very low, with only 38.3% of samples in the HDGE group exceeding the detection level. These findings underscore the limitations of available CSF mHTT assays and confirm there is an urgent need for a reliable assay capable of detecting very low concentrations of mHTT in HDGE, ideally at attomolar levels, if HTT-lowering therapies are to be pursued in stage 0 and 1 HDGE cohorts.
Our extensive phenotypic characterization of HD-ISS stages 0 and 1 may allow us to enrich recruitment for future preventative trials. For example, we demonstrate that baseline NfL and PENK levels predict subsequent brain atrophy, and the potential to establish cutoffs for enriching HD-ISS stage 0 based on these biofluids holds significant promise. Harmonization of HD-YAS with existing cohorts across the disease spectrum such as HD-CSF and HDClarity (ClinicalTrials.gov: NCT02855476) will help to establish reliable cutoffs for inclusion. Another important consideration in clinical trial design is that atypical repeat structures, although infrequent, substantially affect disease progression and may additionally impact therapeutic efficacy. Identification of these rare cases through MiSeq will be important to control for these effects and more accurately assess treatment efficacy.
If these biomarkers can serve as likely surrogate outcomes, sample size calculations suggest feasible numbers for clinical trials in an HD-ISS stage 0/1 cohort given sufficiently large treatment effects. For example, in a clinical trial over 3 years with a 50% treatment effect, 104 participants would be required with CSF NfL as an outcome measure, with 126 for caudate and 146 for putamen atrophy. Notably, the caudate boundary-shift integral measure of change we use here is already well-validated and has previously been used in the laquinimod trial in HDGE with a clinical motor diagnosis56.
In summary, the results presented strongly support the hypotheses that individual-specific somatic expansion in blood DNA predicts individual-specific somatic expansion in the brain. We show in living participants, decades before clinical motor diagnosis, that somatic expansion of the CAG repeat appears to be an important driver of the earliest pathological disease processes, as evidenced by its association with striatal atrophy rates and CSF NfL and PENK levels. Somatic expansion of repeats underlying disease pathogenesis is likely relevant to many repeat expansion diseases, where similar DNA repair mechanisms may play a role. With new therapies in development to target the DNA repair proteins that are known to influence somatic expansion, our results are timely in demonstrating its association with measurable disease markers. By intervening with therapies targeting somatic CAG repeat expansion at the start of the neurodegenerative process, that is, HD-ISS stages 0 and 1 decades before clinical motor diagnosis, while function remains intact, there is the very real possibility that treatments can delay or even prevent the appearance of clinical signs. To this end, we have identified robust measures of early pathology with potential to act as possible biomarker surrogates of disease progression, and identified the ideal cohort for intervention to delay or prevent clinical motor diagnosis.
Methods
Participant characteristics
Participants were recruited across the UK and enrolled at one study site (University College London (UCL)). The inclusion criteria are detailed in the Supplementary Methods. Participants (131 in total, 64 HDGE and 67 controls) attended at baseline and 103 (57 HDGE and 46 controls) returned for follow-up approximately 4.5 years later.
The study was registered on ClinicalTrials.gov (NCT06391619) where the study protocol and the predefined statistical analysis plan are provided. All participants underwent comprehensive assessment of clinical, cognitive and neuropsychiatric function, neuroimaging, blood sampling, and optional CSF collection consistent with the baseline procedure (Supplementary Methods)25. See Extended Data Table 6 for a list of assessments and Supplementary Table 5 for missing data. Supplementary Methods provides further details for all assessments.
Ethics
The study received approval by the London—Bloomsbury Research Ethics Committee (22/LO/0058). All study procedures adhered to principles outlined in the Declaration of Helsinki, and before enrollment, written consent was obtained from all participants.
Clinical, cognitive and neuropsychiatric assessments
A clinical examination was performed including assessment for lumbar puncture suitability.
All the HDGE participants with longitudinal neuroimaging (n = 54) were staged according to the HD-ISS26 at each visit. The longitudinal pipeline of FreeSurfer version 6 (https://surfer.nmr.mgh.harvard.edu/pub/dist/freesurfer/) was used to derive caudate and putamen segmentations to classify stages 0 and 1, and stage 2 was defined by participants reaching the age- and education-adjusted cutoffs for the Symbol Digit Modalities Test and/or Total Motor Score. Additionally, predicted years to clinical motor diagnosis were linked to the standardized CAG-Age-Product (CAP) score. A CAP score of 100 occurs at the CAG-specific expected age of motor diagnosis59.
All cognitive and neuropsychiatric tasks from baseline were repeated at follow-up with two exceptions. Due to participant feedback, we replaced the Progressive Ratio Task from baseline with the Goals Prior Assay task at follow-up to assess the motivational domain. Additionally, Stroop interference was included within the core cognitive tasks in follow-up. See Supplementary Methods and Supplementary Figs. 3–5 for details of the cognitive and neuropsychiatric testing battery.
Neuroimaging
Scanning was performed on a 3T Siemens Prisma (Siemens Healthineers, Erlangen, Germany) and parameters were consistent between baseline and 4.5-year follow-up. Neuroimaging assessments included volumetric T1-weighted imaging (T1W), DWI and MPM. We applied the same predefined regions of interest (ROI) that were examined at baseline. Volumes were extracted from T1W images and mean values across the relevant ROI were derived for (1) standard DWI and neurite orientation and dispersion density imaging (NODDI) metrics60, (2) structural connectivity (right-hand dominant participants only) and (3) MPMs.
Longitudinal imaging changes were derived by subtraction of values, except for direct measures of change for some volumetric ROIs. The boundary-shift integral is a direct measure of change between positionally matched (registered) serial images, which is more sensitive to longitudinal change than subtraction61. This technique was used for whole brain, ventricles and caudate62. Within-participant voxel-compression maps were derived and convolved with gray and white matter maps generated by voxel-based morphometry24 to estimate volume change within these tissues.
Details of diffusion, structural connectivity and MPM processing pipelines are provided in Supplementary Methods and Supplementary Fig. 6.
Biofluids
Biofluids were collected at baseline and follow-up under the same standardized, well-validated conditions, methods and equipment63. To remove potential batch effects, biofluid samples collected at baseline were reanalyzed in parallel with the follow-up samples, employing the assays detailed in Supplementary Table 19. Quantification of analytes was performed blinded to group status.
Measurements in CSF included mHTT, NfL protein, total tau (tau), GFAP, UCH-L1, YKL-40 (also known as chitinase-3 like-protein-1 (CHI3L1)), IL-6 and IL-8. In plasma, NfL, tau, GFAP and UCH-L1 were quantified. The Neurology 4-Plex A (GFAP, NfL, Tau and UCH-L1) was measured in singlicate, yielding the following inter-plate coefficients of variation (%CV): CSF GFAP (3.7%), CSF NfL (8.2%), CSF Tau (11.1%), CSF UCH-L1 (62.8%), plasma GFAP (5.6%), plasma NfL (5.0%), plasma Tau (11.1%) and plasma UCH-L1 (63.8%). The %CVs were calculated from duplicate measurements of internal plate controls made of pooled human plasma. CSF IL-6, IL-8 and YKL-40 were measured in duplicate, with the following %CVs: CSF YKL-40 (10.4%), CSF IL-6 (29.2%) and CSF IL-8 (12.2%). CSF mHTT was measured in triplicate and the relative abundance of CSF PENK was quantified in single measurements.
Unbiased liquid chromatography-mass spectrometry (LC–MS)-based proteomics analysis was performed using the tandem mass tag (TMT) technique64 to measure relative abundance of CSF PENK. CSF samples were prepared including an initial multi-affinity depletion step to reduce interference by high-abundant blood-derived proteins (Supplementary Methods). Following this step, samples were subjected to reduction and alkylation of cysteine residues, digestion with trypsin and endoproteinase Lys-C and isobaric labeling using TMTpro 18-plex reagents65 (Thermo Fisher Scientific). TMT multiplex peptide samples were fractionated by high-pH reversed-phase high-performance LC (HPLC)66,67, and analyzed by nano-HPLC (EasyLC, Thermo Fisher Scientific) coupled to a high-resolution Orbitrap hybrid mass spectrometer (Orbitrap Lumos Tribrid, Thermo Fisher Scientific). Protein identification and data processing for quantification was performed using Proteome Discoverer 2.5 (Thermo Fisher Scientific), and R Statistics. More details are provided in Supplementary Methods.
HTT CAG repeat structure and somatic expansion
DNA was extracted from whole blood using the chemagic 360-D instrument (Perkin Elmer) for automated DNA extraction. The modal length of the pure HTT CAG repeat, the HTT repeat structure and quantity of blood HTT somatic expansions in the HDGE group were determined by ultra-deep amplicon MiSeq sequencing14,68. The HTT MiSeq reads were processed and genotyped using ScaleHD (v1 (ref. 14)) and RGT (https://github.com/hossam26644/RGT). The SER14 of the HTT CAG repeat was quantified from MiSeq data at baseline and follow-up, allowing for longitudinal assessment of somatic expansion changes during the interval between the two visits. CAG repeat length estimated using MiSeq was used for all statistical analyses of associations.
Somatic expansion mediation models
Within the HDGE group, we estimated statistical associations between white blood cell (WBC) somatic expansion ratios with NfL, PENK and with volumetric brain measures. The analyses were performed with and without correction for age, CAG and age-by-CAG interaction. The models also controlled age, sex and age-by-sex interaction as covariates. We modeled cross-sectional SER versus biomarker relationships via random effects regression as described earlier. We estimated relationships between baseline SER and longitudinal change in SER versus longitudinal change in biomarkers using ordinary least square regression as described above for other models of volumetric direct-change rates. For NfL and putamen volumes, we calculated change rates by subtraction of baseline from follow-up values.
To assess the potential causal implications of SER-biomarker associations, we compared the statistical strength of SER as a predictor of the biomarker before and after controlling for age and CAG length (Extended Data Fig. 4 illustrates a schematic of the underlying causal reasoning). A substantially weakened SER relationship after age and CAG control would suggest that the associations may be due to mutual influence of CAG length over time without more direct causality. Conversely, we assessed the strength of the age–CAG versus biomarker relationships with and without SER control. Weakened age–CAG associations with a biomarker in the presence of both a significant predictive SER effect on the biomarker and a significant age–CAG association with SER are consistent with an intermediate causal role for CNS somatic expansion (as indirectly assessed by SER in WBC DNA). These statistical comparisons are the essence of statistical assessments of plausible causality in formal causal models. In the simple, cross-sectional case, the degree of mediation is often expressed by the relative reduction in the adjusted correlation or regression coefficient of a single distal variable. However, the analyses here involve the joint mediation of two terms—CAG length and its interaction with age. Mediation is no longer simply quantified by this approach. Hence, we instead emphasize the hypothesis testing aspect of the underlying statistical tests.
We used the same regression methods as for the somatic expansion models to assess associations between log NfL levels and volumetric outcomes. However, we attempted no causal interpretation of NfL versus volumetric relationships. Although these are both HD-related biomarkers, they are considered concurrent indicators of the same pathology and there is no well-justified conception of one of these changes ‘causing’ the other.
Sample size estimates
Approximate sample size estimates for clinical trials involving equal allocation to one treatment group and a placebo group were calculated based on observed HDGE group versus control longitudinal models. The group differences in longitudinal rates were converted to Cohen’s d effect sizes using within-group longitudinal standard deviations derived from estimated random effect variances. Assumed treatment effects were then defined as a percent slowing of the group difference in longitudinal rates adjusted for the assumed trial length. These should be considered somewhat optimistic order-of-magnitude estimates. We could not estimate potential increased within-group variance over time (and resultant sample size increase) based on only two observed time points. We did not factor in trial dropout rates. On the other hand, we did not consider potential sample size reduction due to efficient (but perhaps controversial) estimates of treatment-induced slope deviation derived from repeated measures over time, but instead based calculations on net group differences at the end of a trial. See Supplementary Methods for further details.
Statistical analysis
Unless otherwise noted, all statistical analyses were performed in accordance with a prespecified statistical analysis plan. Analyses testing the relationship between caudate and putamen volumes or NfL levels to observed or predicted disease progression were defined as primary hypotheses, with a statistical significance level of P = 0.05. Analyses of potential disease associations for the rest of the wide-ranging test battery were considered exploratory and primarily assessed by estimated FDR calculated using the Benjamini–Hochberg method69. FDR was calculated separately per measurement domain (cognitive, volumetric imaging, bioassays, etc.) to examine conceptually and technically sensible sets of underlying P values. We suggest an FDR threshold of 15% for identifying exploratory results of interest.
Demographic group comparisons were conducted by t test for continuous variable, by chi-square test for sex differences and by Mann-Whitney U test for education level. Both the longitudinal and repeated-measure cross-sectional analyses were performed by maximum likelihood random effect regression modeling with (necessarily) only random intercepts per participant. The primary regression predictors of interest were group status (control versus HDGE) and in models testing age and CAG effects, terms for age, MiSeq-derived CAG length and age-by-CAG interactions nested within the HDGE. The joint significance of age and CAG effects were estimated by the maximum likelihood test for differences in model fit. The models also contained covariates for sex, a non-nested age term and sex-by-age interaction. The non-nested age term was included to make HDGE group-specific aging effects estimable. Cognitive models also included International Standard Classification of Education education level and estimated intelligence quotient via the National Adult Reading Test score. Due to the consistent skewness of measured distributions, logarithmic transformations were used on all bioassay concentration measures.
Longitudinal volumetric analyses of brain structures other than putamen relied on a single measure of volume change per participant derived from pairs of baseline and follow-up scans. These changes were converted to annual rates and modeled by ordinary least squares regression with predictor variables analogous to those listed above. All other models of longitudinal change used total values at the two visits as outcomes and longitudinal effects of primary predictors and covariates were estimated by interactions with follow-up time between the visits. Those models also retained all baseline effects for predictors and covariates. To preserve the unbiased estimation of model parameters when data is missing at random, baseline data from participants with no follow-up were included in the model.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
We are committed to data sharing while maintaining confidentiality due to the sensitive and potentially identifiable nature of these data. Biofluid samples will not be shared due to the limited amount of material available. The remaining samples will be required for replication for the next HD-YAS visit. Upon reasonable request, data will be made available 24 months after the end of data collection, through application via UCL to the Principal Investigator, Professor Sarah Tabrizi. Researchers will be required to submit a proposal meeting the research criteria and must demonstrate full GDPR compliance. A data access agreement with UCL will be required.
Code availability
All software is freely available with the exception of the in-house MIDAS software used to generate the boundary-shift integral for caudate, whole brain and ventricles. This can be requested from Professor Nick Fox at the Dementia Research Centre, UCL, UK.
References
Vonsattel, J. P. et al. Neuropathological classification of Huntington’s disease. J. Neuropathol. Exp. Neurol. 44, 559–577 (1985).
Tabrizi, S. J. et al. Potential disease-modifying therapies for Huntington’s disease: lessons learned and future opportunities. Lancet Neurol. 21, 645–658 (2022).
Lee, J.-M. et al. CAG repeat expansion in Huntington disease determines age at onset in a fully dominant fashion. Neurology 78, 690–695 (2012).
Monckton, D. G. The contribution of somatic expansion of the CAG repeat to symptomatic development in Huntington’s disease: a historical perspective. J. Huntingtons Dis. 10, 7–33 (2021).
Kennedy, L. et al. Dramatic tissue-specific mutation length increases are an early molecular event in Huntington disease pathogenesis. Hum. Mol. Genet. 12, 3359–3367 (2003).
Shelbourne, P. F. et al. Triplet repeat mutation length gains correlate with cell-type specific vulnerability in Huntington disease brain. Hum. Mol. Genet. 16, 1133–1142 (2007).
Mätlik, K. et al. Cell-type-specific CAG repeat expansions and toxicity of mutant Huntingtin in human striatum and cerebellum. Nat. Genet. 56, 383–394 (2024).
Handsaker, R. E. et al. Long somatic DNA-repeat expansion drives neurodegeneration in Huntington disease. Preprint at bioRxiv https://doi.org/10.1101/2024.05.17.592722 (2024).
Swami, M. et al. Somatic expansion of the Huntington’s disease CAG repeat in the brain is associated with an earlier age of disease onset. Hum. Mol. Genet. 18, 3039–3047 (2009).
Kaplan, S., Itzkovitz, S. & Shapiro, E. A universal mechanism ties genotype to phenotype in trinucleotide diseases. PLoS Comput. Biol. 3, e235 (2007).
Donaldson, J. et al. What is the pathogenic CAG expansion length in Huntington’s disease? J. Huntingtons Dis. 10, 175–202 (2021).
Hong, E. P. et al. Huntington’s disease pathogenesis: two sequential components. J. Huntingtons Dis. 10, 35–51 (2021).
Genetic Modifiers of Huntington’s Disease (GeM-HD) Consortium. CAG repeat not polyglutamine length determines timing of Huntington’s disease onset. Cell 178, 887–900.e14 (2019).
Ciosi, M. et al. A genetic association study of glutamine-encoding DNA sequence structures, somatic CAG expansion, and DNA repair gene variants, with Huntington disease clinical outcomes. EBioMedicine 48, 568–580 (2019).
Moss, D. J. H. et al. Identification of genetic variants associated with Huntington’s disease progression: a genome-wide association study. Lancet Neurol. 16, 701–711 (2017).
Rajagopal, S. et al. Genetic modifiers of repeat expansion disorders. Emerg. Top. Life Sci. 7, 325–337 (2023).
Genetic Modifiers of Huntington’s Disease (GeM-HD) Consortium & Lee, J.-M. et al. Genetic modifiers of somatic expansion and clinical phenotypes in Huntington’s disease reveal shared and tissue-specific effects. Preprint at bioRxiv https://doi.org/10.1101/2024.06.10.597797 (2024).
Wright, G. E. B. et al. Length of uninterrupted CAG, independent of polyglutamine size, results in increased somatic instability, hastening onset of Huntington disease. Am. J. Hum. Genet. 104, 1116–1126 (2019).
Dawson, J. et al. A probable cis-acting genetic modifier of Huntington disease frequent in individuals with African ancestry. HGG Adv. 3, 100130 (2022).
Tabrizi, S. J. et al. Targeting huntingtin expression in patients with Huntington’s disease. N. Engl. J. Med. 380, 2307–2316 (2019).
McColgan, P. et al. Tominersen in adults with manifest Huntington’s disease. N. Engl. J. Med. 389, 2203–2205 (2023).
Estevez-Fraga, C., Tabrizi, S. J. & Wild, E. J. Huntington’s disease clinical trials corner: March 2024. J. Huntingtons Dis. 13, 1–14 (2024).
Paulsen, J. S. et al. Detection of Huntington’s disease decades before diagnosis: the predict-HD study. J. Neurol. Neurosurg. Psychiatry 79, 874–880 (2008).
Tabrizi, S. J. et al. Potential endpoints for clinical trials in premanifest and early Huntington’s disease in the TRACK-HD study: analysis of 24 month observational data. Lancet Neurol. 11, 42–53 (2012).
Scahill, R. I. et al. Biological and clinical characteristics of gene carriers far from predicted onset in the Huntington’s disease Young Adult Study (HD-YAS): a cross-sectional analysis. Lancet Neurol. 19, 502–512 (2020).
Tabrizi, S. J. et al. A biological classification of Huntington’s disease: the Integrated Staging System. Lancet Neurol. 21, 632–644 (2022).
Rodrigues, F. B. Mutant huntingtin and neurofilament light have distinct longitudinal dynamics in Huntington’s disease. Sci. Transl. Med. 12, eabc2888 (2020).
Shaw, G. et al. Hyperphosphorylated neurofilament NF-H is a serum biomarker of axonal injury. Biochem. Biophys. Res. Commun. 336, 1268–1277 (2005).
Paterson, R. W. et al. SILK studies—capturing the turnover of proteins linked to neurodegenerative diseases. Nat. Rev. Neurol. 15, 419–427 (2019).
Khalil, M. et al. Neurofilaments as biomarkers in neurological disorders. Nat. Rev. Neurol. 14, 577–589 (2018).
Zetterberg, H. et al. Neurochemical aftermath of amateur boxing. Arch. Neurol. 63, 1277–1280 (2006).
Sjögren, M. et al. Cytoskeleton proteins in CSF distinguish frontotemporal dementia from AD. Neurology 54, 1960–1964 (2000).
Hall, S. et al. Accuracy of a panel of 5 cerebrospinal fluid biomarkers in the differential diagnosis of patients with dementia and/or parkinsonian disorders. Arch. Neurol. 69, 1445–1452 (2012).
Delaby, C. et al. Differential levels of neurofilament light protein in cerebrospinal fluid in patients with a wide range of neurodegenerative disorders. Sci. Rep. 10, 9161 (2020).
Bech, S. et al. Amyloid-related biomarkers and axonal damage proteins in parkinsonian syndromes. Parkinsonism Relat. Disord. 18, 69–72 (2012).
Feneberg, E. et al. Multicenter evaluation of neurofilaments in early symptom onset amyotrophic lateral sclerosis. Neurology 90, e22–e30 (2018).
Abdo, W. F. et al. CSF neurofilament light chain and tau differentiate multiple system atrophy from Parkinson’s disease. Neurobiol. Aging 28, 742–747 (2007).
Dhiman, K. et al. Cerebrospinal fluid neurofilament light concentration predicts brain atrophy and cognition in Alzheimer’s disease. Alzheimers Dement. 12, e12005 (2020).
Vrillon, A. et al. Comparison of CSF and plasma NfL and pNfH for Alzheimer’s disease diagnosis: a memory clinic study. J. Neurol. 271, 1297–1310 (2024).
Barschke, P. et al. Cerebrospinal fluid levels of proenkephalin and prodynorphin are differentially altered in Huntington’s and Parkinson’s disease. J. Neurol. 269, 5136–5143 (2022).
Niemela, V. et al. Proenkephalin decreases in cerebrospinal fluid with symptom progression of Huntington’s disease. Mov. Disord. 36, 481–491 (2021).
Caron, N. S. et al. Cerebrospinal fluid biomarkers for assessing Huntington disease onset and severity. Brain Commun. 4, fcac309 (2022).
Ilieva, H., Polymenidou, M. & Cleveland, D. W. Non-cell autonomous toxicity in neurodegenerative disorders: ALS and beyond. J. Cell Biol. 187, 761–772 (2009).
Diaz-Castro, B. et al. Astrocyte molecular signatures in Huntington’s disease. Sci. Transl. Med. 11, eaaw8546 (2019).
Stöberl, N. et al. Mutant huntingtin confers cell-autonomous phenotypes on Huntington’s disease iPSC-derived microglia. Sci. Rep. 13, 20477 (2023).
Von Bartheld, C. S., Bahney, J. & Herculano-Houzel, S. The search for true numbers of neurons and glial cells in the human brain: a review of 150 years of cell counting. J. Comp. Neurol. 524, 3865–3895 (2016).
Shin, J.-Y. et al. Expression of mutant huntingtin in glial cells contributes to neuronal excitotoxicity. J. Cell Biol. 171, 1001–1012 (2005).
Pavese, N. et al. Microglial activation correlates with severity in Huntington disease: a clinical and PET study. Neurology 66, 1638–1643 (2006).
Björkqvist, M. et al. A novel pathogenic pathway of immune activation detectable before clinical onset in Huntington’s disease. J. Exp. Med. 205, 1869–1877 (2008).
Wild, E., Björkqvist, M. & Tabrizi, S. J. Immune markers for Huntington’s disease? Expert Rev. Neurother. 8, 1779–1781 (2008).
Tabrizi, S. J. et al. Biological and clinical changes in premanifest and early stage Huntington’s disease in the TRACK-HD study: the 12-month longitudinal analysis. Lancet Neurol. 10, 31–42 (2011).
Flower, M. et al. MSH3 modifies somatic instability and disease severity in Huntington’s and myotonic dystrophy type 1. Brain 142, 1876–1886 (2019).
Benn, C. L., Gibson, K. R. & Reynolds, D. S. Drugging DNA damage repair pathways for trinucleotide repeat expansion diseases. J. Huntingtons Dis. 10, 203–220 (2021).
O’Reilly, D. et al. Di-valent siRNA-mediated silencing of MSH3 blocks somatic repeat expansion in mouse models of Huntington’s disease. Mol. Ther. 31, 1661–1674 (2023).
Olsson, B. et al. NFL is a marker of treatment response in children with SMA treated with nusinersen. J. Neurol. 266, 2129–2136 (2019).
Reilmann, R. et al. Safety and efficacy of laquinimod for Huntington’s disease (LEGATO-HD): a multicentre, randomised, double-blind, placebo-controlled, phase 2 study. Lancet Neurol. 23, 243–255 (2024).
Bates, G. P. et al. Huntington disease. Nat. Rev. Dis. Primers 1, 15005 (2015).
Ferguson, R. et al. Therapeutic validation of MMR-associated genetic modifiers in a human ex vivo model of Huntington disease. Am. J. Hum. Genet. 111, 1165–1183 (2024).
Warner, J. H. et al. Standardizing the CAP score in Huntington’s disease by predicting age-at-onset. J. Huntingtons Dis. 11, 153–171 (2022).
Zhang, H. et al. NODDI: practical in vivo neurite orientation dispersion and density imaging of the human brain. Neuroimage 61, 1000–1016 (2012).
Fox, N. C. & Freeborough, P. A. Brain atrophy progression measured from registered serial MRI: validation and application to Alzheimer’s disease. J. Magn. Reson. Imaging 7, 1069–1075 (1997).
Hobbs, N. Z. et al. Automated quantification of caudate atrophy by local registration of serial MRI: evaluation and application in Huntington’s disease. Neuroimage 47, 1659–1665 (2009).
Wild, E. J. et al. Quantification of mutant huntingtin protein in cerebrospinal fluid from Huntington’s disease patients. J. Clin. Invest. 125, 1979–1986 (2015).
Thompson, A. et al. Tandem mass tags: a novel quantification strategy for comparative analysis of complex protein mixtures by MS/MS. Anal. Chem. 75, 1895–1904 (2003).
Li, J. et al. Proteome-wide mapping of short-lived proteins in human cells. Mol. Cell 81, 4722–4735.e5 (2021).
Batth, T. S., Francavilla, C. & Olsen, J. V. Off-line high-pH reversed-phase fractionation for in-depth phosphoproteomics. J. Proteome Res. 13, 6176–6186 (2014).
UniProt Consortium. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 47, D506–D515 (2019).
Ciosi, M. et al. Library preparation and MiSeq sequencing for the genotyping-by-sequencing of the Huntington disease HTT exon one trinucleotide repeat and the quantification of somatic mosaicism. Protoc. Exch. https://doi.org/10.1038/protex.2018.089 (2018).
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289–300 (1995).
Acknowledgements
We are grateful to all the study participants and their families who have so generously given their time to the HD Young Adult Study. We would also like to thank the staff at the Wellcome Centre for Human Neuroimaging (London, UK) and the Leonard Wolfson Experimental Neurology Centre (London, UK). LC–MS analysis was performed at the Proteomics Core Facility, Sahlgrenska Academy, Gothenburg University, with financial support from SciLifeLab and BioMS. We thank Annabelle Coleman and Alexiane Touzé for their help with blood collection and processing, Siti Binte Mohd Ikhsan for help with collecting cognitive and neuropsychiatric data, Peter McColgan for assistance with the structural connectivity pipeline and Davina Hensman-Moss and Emily Gantman for helpful discussions on the paper. Funding for this study was provided by the following: Wellcome Collaborative Award (223082/Z/21/Z to S.J.T., R.I.S., M.F., M.J.M., N.Z.H., M.L., C.L., K.F., T.W.R., J.B.R., B.J.S., D.G.M. and D.R.L.); the CHDI Foundation (a not-for-profit organization dedicated to finding treatments for Huntington’s disease) funded CSF collection; the UK Dementia Research Institute (DRI; London, UK) through UK DRI, principally funded by the UK Medical Research Council; University College London Hospital/University College London (London, UK), supported by the National Institute for Health and Care Research (NIHR) University College London Hospitals Biomedical Research Centre; Wellcome Trust (220258 to J.B.R.); Medical Research Council (MC_UU_00030/14 and MR/T033371/1 to J.B.R.) and NIHR Cambridge Biomedical Research Centre (NIHR203312 to J.B.R.). H.Z. is a Wallenberg Scholar and a Distinguished Professor at the Swedish Research Council.
Author information
Authors and Affiliations
Contributions
R.I.S., C.S., H.Z., E.J.W., G.R., B.J.S., D.G.M. and D.R.L. gained funding for this study. R.I.S., K.F., E.J.W. and G.R. designed the study. R.I.S., M.F., M.J.M., N.Z.H., K.F. and O.T. contributed to recruitment. R.I.S., N.Z.H., M.L., H.K., C.S.P. and I.B.M. contributed to image acquisition and analysis. M.F., M.J.M., M.N., C.E.F. and Y.H. contributed to clinical assessments. M.F. and M.J.M. contributed to biofluid collection. M.F., J.G., J.R., S.W. and A.H. contributed to biofluid processing and analysis. R.I.S., M.F., M.J.M., M.L., C.L. and K.F. contributed to cognitive and neuropsychiatric assessments. C.L., T.W.R. and B.J.S. contributed to cognitive and neuropsychiatric battery design. M.C. and N.K.P.P. contributed to somatic expansion analysis. K.F. contributed to project management. K.F. and O.T. contributed to biofluid processing. H.H. contributed to the radiological review process. J.D.L. and D.R.L. performed statistical analysis. C.S. contributed to data interpretation. H.Z. designed biofluid processing methods and analysis. J.B.R. contributed to cognitive battery design. D.G.M. contributed to the somatic expansion processing and analysis. D.R.L. wrote the Statistical Analysis Plan. R.I.S., M.F., C.L., M.C., B.J.S., D.G.M., D.R.L. and S.J.T. drafted the paper. S.J.T. was PI of the Wellcome Trust Collaborative Award and had overall responsibility for study design, project management, ethics approval, analysis and drafting the paper. All authors contributed to the interpretation of the data and reviewed the paper.
Corresponding author
Ethics declarations
Competing interests
J.B.R. is supported by the Wellcome Trust (220258), the Medical Research Council (MC_UU_00030/14; MR/T033371/1) and the NIHR Cambridge Biomedical Research Centre (NIHR203312). The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care. J.B.R. has undertaken paid consultancy for Asceneuron, Astronautx, Astex, ClinicalInk, CumulusNeuro, Curasen, Invivro, Prevail and SVHealth; received research funding unrelated to the current work from AstraZeneca, Lily, GSK and Janssen; and is Chief Scientific Advisor to Alzheimer’s Research UK. D.G.M. has been a scientific consultant and/or received honoraria/grants from AMO Pharma, Dyne, F. Hoffman-La Roche, Function Rx, LoQus23, MOMA Therapeutics, Novartis, Ono Pharmaceuticals, Pfizer Pharmaceuticals, Rgenta Therapeutics, Sanofi and Sarepta Therapeutics. J.G. is supported by Alzheimerfonden (AF-980746) and Stiftelsen för Gamla tjänarinnor (2022-01324). J.B.R. has appeared as an expert witness to the Medicines and Healthcare products Regulatory Agency, unrelated to the current work. D.G.M. is on the Scientific Advisory Board of the Myotonic Dystrophy Foundation and EuroDyMA (European Dystrophia Myotonica Association), is a scientific advisor to the Myotonic Dystrophy Support Group and is a vice president for research of Muscular Dystrophy UK. G.R. is a nonexecutive director of UCL Business. D.R.L. is an unpaid academic member of the Critical Path Institute HD-RSC Consortium Coordinating Committee. B.J.S. is a co-inventor of the Cambridge Neuropsychological Test Automated Battery. I.B.M. is an associate editor for Frontiers in Neurology. The other authors declare no competing interests.
Peer review
Peer review information
Nature Medicine thanks Harry Orr and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Jerome Staal, in collaboration with the Nature Medicine team.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Flowchart of recruitment, follow-up and total cumulative assessments.
Flowchart detailing participant enrollment at baseline (2017–2019) and the retention and recruitment of new participants at follow-up (2022–2024) in the Huntington’s Disease Young Adult Study (HD-YAS). CSF, cerebrospinal fluid; DBS, disease burden score; FHx, family history; HDGE, HD gene expanded; IVF, in vitro fertilization; LP, lumbar puncture; TMS, Total Motor Score; UHDRS, Unified Huntington’s Disease Rating Scale.
Extended Data Fig. 2 Annualized rate of change in non-significant biofluid markers.
Panel shows comparison of standardized residuals for annualized rate of change in the HDGE vs. control groups for: (a) CSF Tau, (b) CSF GFAP, (c) CSF UCH-L1, (d) plasma Tau, (e) plasma GFAP, (f) plasma UCH-L1, (g) CSF IL-6, (h) CSF IL-8 and (i) plasma NfL. Negative standardized residuals indicate that the rate of change was less than the adjusted mean rate of change across both groups. The horizontal lines represent medians, boxes indicate the upper and lower quartiles, and whiskers are 1.5× IQR. All statistical analyses were conducted using mixed-effect linear models with a participant-specific random effect, controlling for age, sex and their interaction. Natural log-transformed concentrations served as the outcomes in these models. Statistical two-sided group comparisons and correlations controlled for the effects of age and sex and were corrected for multiple comparisons using the FDR. CSF, cerebrospinal fluid; FDR, false discovery rate; GFAP, glial fibrillary acidic protein; HDGE, HD gene expanded; IL, interleukin; IQR, interquartile range; NfL, neurofilament light; UCH-L1, ubiquitin carboxyl-terminal hydrolase L1; YKL-40, also known as chitinase-3 like-protein-1 (CHI3L1).
Extended Data Fig. 3 The effect of atypical HTT allele structure on blood somatic expansions does not explain their effect on HD phenotypes and biomarkers.
(a) Schematic representation of the absence of effect of the atypical HTT allele structure on blood somatic expansion in the HD-YAS cohort and of their relative effect on caudate and putamen volumes and on CSF NfL levels after age-by-CAG correction. (b) Graphical representation of the HTT allele structures observed in the HD-YAS cohort. On the left-hand side is the pure CAG tract, which is likely the primary substrate of the somatic HTT repeat instability machinery and is the primary determinant of the rate of somatic HTT repeat expansion and HD pathology. Center, the sequence variants intervening between the CAG and CCG tracts, which define the atypical HTT allele structures observed (that is, CAACAG duplication, CCGCCA loss and CAACAG CCGCCA loss).
Extended Data Fig. 4 Conceptual causal model.
Conceptual model illustrating the causal relationships among age-related CAG repeat length (age, CAG), CNS somatic expansion, WBC somatic expansion and biomarkers. The squares in gray depict measured data, while the circles in purple represent unobservable quantities. The black, bidirectional arrow signifies the assumption that somatic expansion in WBCs serves as a proxy for somatic expansion in the CNS. Red arrows indicate correlations between observable data and the underlying biological processes of interest. Blue arrows are presumed to reflect true causal pathways indirectly. We note that DNA mismatch repair is a key mechanism linking inherited CAG repeat length to somatic expansion, with mismatch repair activity increasing with longer CAG repeat lengths. *Note additional pathways could be involved due to the imprecise and indirect measurement of actual CNS expansion. Abbreviations: CNS=Central Nervous System. WBC=White Blood Cell.
Supplementary information
Supplementary Information
Supplementary Methods, results and discussions, Tables 1–19 and Figs. 1–6.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Scahill, R.I., Farag, M., Murphy, M.J. et al. Somatic CAG repeat expansion in blood associates with biomarkers of neurodegeneration in Huntington’s disease decades before clinical motor diagnosis. Nat Med 31, 807–818 (2025). https://doi.org/10.1038/s41591-024-03424-6
Received:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/s41591-024-03424-6
This article is cited by
-
Degeneration decades before disease onset
Nature Reviews Neurology (2025)
-
Revealed: Why the fatal Huntington’s gene takes so long to cause harm
Nature (2025)
-
Evaluating finger-prick blood collection for remote quantification of neurofilament light in neurological diseases
Journal of Neurology (2025)