Introduction

Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease characterized by progressive muscle weakness resulting from motor neuron loss. This condition typically leads to death from respiratory failure within two to four years of diagnosis1. The etiology of ALS remains elusive, involving a complex interplay of genetic factors, chronic environmental exposures, and lifestyle influences. Furthermore, current therapeutic options are limited and unable to reverse disease progression. The poor prognosis of ALS imposes a significant burden on families and society. Therefore, early identification and management of ALS risk factors are critical for establishing effective prevention strategies.

Numerous observational studies have investigated the relationship between muscle mass and ALS2,3,4. Traditionally, ALS involves muscle denervation during disease progression, and reduced muscle mass at diagnosis is associated with accelerated disease progression and decreased survival5,6,7,8. However, recent studies have highlighted that skeletal muscle also plays a key role in the incidence of ALS9,10,11. The gold standard for muscle mass measurement includes dual-energy X-ray absorptiometry, computed tomography (CT), bioelectrical impedance analysis (BIA), and magnetic resonance imaging (MRI)12,13,14. However, these methods are not widely used due to their high cost, radioactivity and infeasibility15. Therefore, the use of accessible and cost-effective serum markers is particularly crucial. Recently, a novel serum creatinine-to-cystatin C ratio (CCR) developed by Kashani et al., has shown a positive relationship with muscle mass16. This ratio has demonstrated satisfactory diagnostic accuracy for muscle mass in various diseases17,18,19,20 and has been associated with the severity of muscle loss in a cohort of patients with ALS21. Despite this, it remains unclear whether lower CCR can increase the incidence of ALS. Given that CCR is influenced by factors such as age, sex, body weight, and renal function, we conducted stratified analyses for these covariates individually22,23.

Therefore, the main purpose of this study was to investigate the relationship between CCR and ALS incidence. And this was done by using survey data of the UK Biobank.

Methods

Study population

The study population was derived from the UK Biobank, a large-scale longitudinal cohort study. Over 500,000 participants aged 40–69 years were recruited between 2006–2010. The participants were from 22 baseline assessment centers in England (89%), Scotland (7%), and Wales (4%)24. During the baseline assessment, participants provided sociodemographic, lifestyle, and health-related information through touch-screen questionnaires and verbal interviews, underwent a series of physical examinations, and provided biological samples. Longitudinal samples were obtained through cohort-wide electronic medical records. All participants provided informed consent.

Of the 502,357 participants at baseline in the UK Biobank, 78 participants reported a personal history of ALS at baseline, 33,475 participants lacked information on serum creatinine and cystatin C, and 21,859 participants had an estimated glomerular filtration rate (eGFR) of < 60 mL/min/m2 using CKD-EPI Cystatin C equation25. After excluding these participants, data from 446,945 individuals were included in the primary analysis (Fig. 1).

Fig. 1
figure 1

Flowchart of the study. Participants with blood tests and traceable outcome data were eligible for Cox proportional hazards analysis after excluding those with prevalent ALS or eGFR < 60 mL/min/m2. ALS = amyotrophic lateral sclerosis; eGFR = estimated glomerular filtration rate; CCR = creatinine-to-cystatin C ratio; BMI = body mass index.

The UK Biobank study received ethical approval from the North West Multi-center Research Ethics Committee (https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/about-us/ethics). Further information about the biobank data can be found at https://www.ukbiobank.ac.uk/. This study was conducted under the UK Biobank resource request number 108832.

Incident ALS

The primary outcome of this study, ALS events, was determined from hospitalized patient records in England, Scotland, and Wales. Hospital records were obtained from the Hospital Episode Statistics (HES) in England, the Scottish Morbidity Record in Scotland, and the Patient Episode Database in Wales. The diagnosis of ALS was classified according to the International Classification of Diseases, 10th Revision codes (ICD-10), G12.2 (motor neuron disease). ALS is used as an umbrella term for this group of motor neuron diseases. The reliability of identifying an ALS diagnosis in the UK biobank has been well established and with high positive predictive value estimates ranging from 70 to 91%26.

Serum creatinine-to-cystatin C ratio collection and calculation

The sample collection and processing procedure in the UK Biobank have been described in detail elsewhere. Briefly, blood samples were collected at baseline assessment centers, stored at -80 in two separate locations. Serum creatinine was measured by an enzymatic method on a Beckman Coulter AU5400 clinical chemistry analyzer using the manufacturer’s reagents and calibrators. Serum cystatin C was measured using a latex-enhanced immunoturbidimetric method on a Siemens Advia 18,005. Based on the serum creatinine and cystatin C measurements, CCR was calculated as CCR = creatinine / cystatin C.

Statistical analyses

Based on the tertile boundaries of CCR, participants were divided into 3 groups, with tertile 3 as the reference group27.

The baseline characteristics of participants were summarized for those with and without incident ALS, reported as medians and quartiles for continuous variables and as numbers and percentages for categorical variables.

A multivariate Cox proportional hazards regression model was used to estimate the relationship between CCR and ALS incidence. CCR was entered into the model as both a continuous and categorical variable. The study period spanned from the date of baseline assessment to the date of first diagnosis of ALS, death, loss to follow-up, or end of follow-up, whichever occurred first. The end of the follow-up was based on the availability of medical record data in the UK biobank. The end of follow-up was based on the availability of medical record data in the UK Biobank, which was reviewed on September 30, 2021, in England, July 31, 2021, in Scotland, and February 28, 2018, in Wales. Model 1 was adjusted for demographic variables including age at recruitment and sex. Model 2 was further adjusted for socioeconomics factors, including education and Townsend deprivation index (TDI). Model 3 was further adjusted for lifestyle factors (smoking status, drinking, body mass index (BMI)) and risk factors associated with ALS (diabetes28, serum glucose29,30, C-reactive protein (CRP)31,32, serum albumin33, and sleep duration34). Since ALS often takes more than a year to be diagnosed after the onset of the disease, Model 4 further excluding the ALS patients with less than 1 year of follow-up. We assessed multicollinearity using variance inflation factors (VIF).

Given the differences in CCR by sex, age, and BMI, subgroup analyses were performed by gender (male vs. female), age (< 65 years and ≥ 65 years), and BMI (< 25 kg/m2 and ≥ 25 kg/m2). Since CCR is calculated through two serum markers, both of which were used to assess the kidney function. We also examined the effect of eGFR (< 90 mL/min/m2 and ≥ 90 mL/min/m2) on CCR-ALS association. The above analyses were adjusted for age, sex, education, TDI, smoking status, drinking, BMI, diabetes, serum glucose, CRP, serum albumin, and sleep duration.

A series of sensitivity analyses were conducted to test the robustness of our results. 1) To evaluate the potential reverse causation biases, we performed the analyses by excluding the ALS patients with less than 3 years and 10 years of follow-up. 2) We repeated analyses after imputing missing covariates through multiple imputation. We used predictive mean matching (PMM) to handle missing data, setting the number of imputations to 5 (m = 5). This means that 5 different complete datasets were generated, each providing reasonable estimates for the missing values. This approach helps reduce bias due to missing data and enhances the reliability of the results. 3) We excluded extreme values of CCR (mean ± 3SD) for further analysis.

All statistical analyses were carried out using R version 4.2.1.

Results

Baseline characteristics of study participants

The baseline characteristics of the participants are presented based on incident ALS status (Table 1). Overall, the mean age of the participants was 56.21 years (± 8.07 years), and 204,331 (45.72%) were men. During the median follow-up period of 13.76 years, 589 participants developed ALS. Participants with ALS had higher cystatin C levels (P < 0.001) compared to those without ALS. Additionally, participants with ALS had lower CCR and eGFR (P < 0.05) than those without ALS. There was no significant difference in creatinine levels between participants with and without ALS.

Table 1 Baseline characteristics of participants categorized by with incident ALS and without incident ALS.

Relation between CCR and ALS incidence

The correlation between CCR and ALS incidence was demonstrated by cox regression analyses. Lower CCR was significantly associated with a higher incidence of ALS when adjusting for age and sex in model 1 (hazard ratio (HR): 0.989, 95% confidence interval (CI): 0.982–0.996, P = 0.002). This association remained consistent in model 2 after further adjustment for socioeconomic indicators (education level and TDI) (HR: 0.990, 95%CI: 0.982–0.998, P = 0.011), and in model 3 after additional adjustment for smoking status, drinking, BMI, diabetes, serum glucose, C-reactive protein (CRP), serum albumin, and sleep duration (HR: 0.990, 95%CI: 0.982–0.999, P = 0.026). Participants were divided into three groups based on the tertile boundaries of CCR (73.776 and 86.191). In all three models, the risk of ALS incidence for CCR tertile 1 and CCR tertile 2 were 1.388 (95% CI: 1.032–1.866, P = 0.030) and 1.348 (95% CI: 1.045–1.739, P = 0.021) compared to those in CCR tertile 3. Model 4 remains consistent with previous results (Table 2). No multicollinearity among predictor variables.

Table 2 Multivariate Cox analyses of the CCR associated with ALS incidence.

Subgroup analyses and sensitivity analysis

We conducted subgroup analyses based on age, sex, BMI, and eGFR to explore the effects of variables on CCR and the incidence of ALS. We categorized individuals into two groups: young and middle-aged participants aged < 65 years, and older individuals aged ≥ 65 years. Among participants younger than 65 years, the risk of ALS incidence for those in CCR tertile 1 and CCR tertile 2 were 1.916 (95% CI: 1.366–2.688, P < 0.001) and 1.699 (95% CI: 1.267–2.278, P < 0.001) compared to those in CCR tertile 3. Moreover, smoking status, drinking, BMI, and diabetes do not affect the results (Table 3).

Table 3 The CCR and ALS risk across different sex, age, BMI and eGFR subgroups.

Sensitivity analyses were conducted to evaluate the robustness of the results. Participants with a short latency from initial sampling to diagnosis (3-year lag or 10-year lag) were excluded with the aim of excluding reverse causality, which did not substantially affect the results. Furthermore, the association between CCR and ALS risk was tested by repeating the analyses after imputing missing covariates or removing extreme values of CCR. The conclusion remained stable (Table 4).

Table 4 Sensitivity analysis of the CCR on ALS incidence.

Discussion

The main finding of this study was that lower CCR was significantly correlated with a higher incidence of ALS. Our study provided convincing results after adjusting for numerous potential confounders in Cox regression models. These findings were robust across a range of sensitivity analyses, indicating that they are unlikely to be influenced by confounders or reverse causality. Stratified analysis further revealed that the impact of CCR on ALS incidence is age-dependent.

Serum creatinine and cystatin C are markers used to assess glomerular filtration rate and kidney function35,36. Serum creatinine, primarily generated in muscle through the metabolism of phosphocreatine and creatine, is significantly influenced by the amount of skeletal muscle mass. Its concentration typically remains stable if muscle mass is unchanged. However, serum creatinine levels are also affected by kidney function, which limits its reliability as a sole indicator of muscle mass37,38. Cystatin C is also freely filtered in the glomeruli. Unlike creatinine, this protein is a small nonionic protein excreted by all nucleated cells and usually remains stable. Its production is less affected by muscle mass39. Given the characteristics of serum creatinine and cystatin C, the CCR was developed to assess muscle mass16. It has been validated as a more reliable measure of muscle mass changes in ALS compared to serum creatinine16. Both creatinine and cystatin C can be quickly measured by blood sampling, making CCR a significant advantage for risk monitoring in large populations.

Several observational studies have shown that ALS patients experience muscle mass loss compared to normal controls. Reduced muscle mass serves as an indicator of poor prognosis in ALS and is associated with rapid disease progression6,7,8. Adequate nutritional support may slow ALS progression by preventing muscle mass loss40,41,42. However, these studies cannot determine whether muscle mass loss is an effect of ALS or a result of motor neuron or peripheral nerve degeneration. To our knowledge, this is the first study to demonstrate a prospective relationship between low CCR and the risk of developing ALS decades before a clinical diagnosis. We hypothesized that the previously reported association between low BMI and the incidence of ALS43 may reflect early changes in muscle mass. Our study suggests that muscle mass plays a role in ALS pathogenesis. Targeting skeletal muscle is particularly important, and there have been some advances in this area. For example, mild to moderate physical activity has been shown to increase muscle mass, improve quality of life, and raise ALSFRS-R scores in ALS patients44,45,46. Additionally, preclinical studies have produced encouraging data for drugs that increase muscle mass, including creatine, anabolic androgenic steroids (AAS), and myostatin inhibitors40,41,42. These studies suggest that therapies targeting skeletal muscle in ALS are very promising. Therefore, further clinical trials are needed for additional support and validation.

The exact mechanism linking CCR and ALS remains unclear, but several possibilities exist. Firstly, serum creatinine is produced by the metabolism of creatine and phosphocreatine, which are essential for mitochondrial function as demonstrated in both in vivo and in vitro studies47. Mitochondrial dysfunction has been proved to contribute to ALS pathogenesis48. In addition, several studies show creatine has antioxidant capacity and neuroprotective function49,50. Therefore, CCR may influence ALS through mechanisms involving mitochondrial dysfunction and oxidative stress. Secondly, the neuromuscular junction (NMJ) connects muscle fibers and motor neurons, facilitating communication between them. ALS is the classic example of severely compromised communication between muscles and nerves51. Motor neuron activity regulates muscle physiology and function, while muscles, in turn, affect neuronal activity by sending retrograde signals that preserve NMJ functionality and structure52. The so-called “dying back” hypothesis suggests that retrograde signals from skeletal muscle contribute to the centripetal motor neuron degeneration in ALS53. Thirdly, a recent study indicated that muscle vesicles in ALS patients are toxic to motor neurons, suggesting that skeletal muscle might be a source of vesicle-mediated toxicity in ALS54.

In subgroup analyses, we found that reduced CCR was associated with an increased risk of ALS incidence in participants aged ≤ 65 years. There are few studies on the association between muscle mass and ALS across different age groups. There is a potential mechanism. In the elderly, reduced muscle mass is mainly due to impaired muscle anabolism, whereas in young adults, it is mainly due to increased muscle catabolism driven by various factors55. Hypermetabolism significantly impacts ALS development. Because muscle activity is a major contributor to whole-body energy metabolism, increased muscle catabolism may play a crucial role in ALS hypermetabolism56. However, due to the selection bias of age with 39 to 72 years in the UK Biobank, the age-dependent pattern in the correlation of CCR on ALS may need further exploration.

It is important to note that in the primary analyses, we corrected for conventional demographic variables, socioeconomic factors, and lifestyle factors, as well as ALS-related risk factors, including diabetes, serum glucose, CRP, serum albumin, and sleep duration. Firstly, disorders of glucose metabolism are now gaining great attention in ALS. These disorders usually occur early in the course of ALS, even before disease onset. Our team published a mendelian randomization study demonstrating that genetically predicted type 2 diabetes is associated with a significantly lower incidence of ALS28. Other studies have shown an association between higher blood glucose levels and a reduced risk of ALS29,30. Secondly, CRP, the most commonly studied biomarker of inflammation, has also been linked to ALS. Studies have found that elevated levels of CRP are associated with a higher risk of mortality and faster disease progression in ALS32. Furthermore, a recent prospective study has shown that higher CRP levels are associated with an increased risk of developing ALS31. Thirdly, albumin is a non-glycosylated plasma protein whose metabolism is affected by nutritional intake and inflammatory diseases. Serum albumin is reduced in patients with ALS compared with healthy controls57,58, and lower serum albumin levels at diagnosis are associated with a poorer prognosis in ALS33,59. Finally, several studies have reported a high prevalence of sleep disorders in ALS patients60. Our team has also previously demonstrated an association between sleep and the risk of developing ALS34.

Our study has several methodological strengths. First, because of the rarity of ALS, identifying potential risk factors requires a large cohort population. The sample size of the UK Biobank is sufficiently large to identify potential risk factors for ALS onset. In addition, most epidemiological studies on ALS risk factors are case–control studies, which carry inherent risks of referral, selection, and recall bias. The prospective cohort design of the UK Biobank mitigates these biases to some extent. However, some limitations must be acknowledged. Most participants in this study were of European descent, so the conclusions may not be generalizable to other ethnic groups. We hope to validate our results in other populations in the future. Moreover, most socio-demographic, lifestyle, and health-related factors were self-reported, which may obscure risk estimates due to recall bias. Although we adjusted for major confounders, residual confounding by unknown or unmeasured factors remains a possibility. Furthermore, despite excluding subjects with ALS at baseline and patients with disease onset within 3 and 10 years, residual confounding is likely in any observational study. Reverse causality remains possible, and causality cannot be deduced from the association. Finally, since the diagnosis of ALS was recorded in medical records, the positive predictive value may not reach 70–91%. Therefore, the estimated effect of the CCR was small and could be influenced by diagnostic accuracy.

In conclusion, our study indicates a significant association between decreased CCR and an increased risk of ALS. This evidence suggests that premorbid muscle mass may play a crucial role in the pathogenesis of ALS. Further investigation into the molecular basis of these changes could lead to the development of presymptomatic biomarkers and novel therapeutic targets.