INTRODUCTION

Amyotrophic Lateral Sclerosis (ALS) is a terminal neurodegenerative disease associated with progressive impairment of motor functions1. Expected survival of most patients diagnosed with ALS is 2–5 years after onset2,3. The disease shows significant heterogeneity across individuals, and patients can be classified into subtypes based on multiple features, including the familial or sporadic occurrence of the disease, age of disease onset, symmetry of motor neuron involvement, and initial symptom location4. Most ALS patients experience a progression of symptom severity, which reflects ALS pathology spread in the central nervous system.

Most modern accounts of ALS revolve around two non-exclusive notions. The first is network spreading: that the spread of pathology, likely in the form of pathogenic misfolded proteins (e.g., TDP-43)5,6, occurs via synaptic contacts7,8,9,10,11,12,13,14,15. Here, initial infiltration via the corticospinal tract introduces pathogenic proteins in primary motor cortex, leading to progressive cell death and atrophy. This notion is supported by the existence of stereotypical patterns of pathology progression and the presence of pathology in regions that are topographically distant yet interconnected through the corticospinal tract14. The second notion is that of local vulnerability: that molecular and morphological features of specific cells predispose them to the disease16,17. Namely, the involvement of support cells has been noted in ALS, including astrocytes and pericytes, implicating energy homeostatic and vascular mechanisms. Here local atrophy leads to patterned deafferentation18,19,20,21,22. Importantly, the two perspectives may both be true; namely, pathology may spread via synaptic contacts, but the spread may be amplified in vulnerable neuronal populations, effectively guiding the network spread of atrophy.

What are the principles that shape the spatial patterning of atrophy in ALS? Here we address this question using the Canadian ALS Neuroimaging Consortium (CALSNIC) dataset (http://calsnic.org)23. We first establish that cortical atrophy reflects white matter architecture. We also assess the extent to which the spread of atrophy between brain regions depends on their molecular and cellular similarity, including transcriptomic similarity, neurotransmitter receptor similarity, metabolic similarity, and hemodynamic similarity. We then use methods from epidemiology to identify network epicenters of the disease process in the cortex. We show that cortical epicenters of atrophy co-localize with markers of metabolic and mitochondrial function. Finally, we show that individual differences in epicenter location can distinguish subtypes of patients (bulbar- versus spinal-onset) and correlate with clinical and cognitive function.

RESULTS

Data was derived from the CALSNIC repository23, and comprised N = 192 patients and N = 175 healthy control participants. Atrophy was estimated using deformation based morphometry (DBM), a morphometric technique that has previously been shown to be sensitive to tissue loss in both deep and superficial structures24, and across multiple neurodegenerative syndromes25,26,27,28,29. For details on demographic information, data acquisition and preprocessing, see “Methods”, Tables S1, and S2.

Spatial distribution of atrophy

We initially identify differences between the ALS patients and healthy controls. Figure 1a shows the group-average atrophy map. Throughout the manuscript, we use sign-inverted w-scores such that greater values correspond to greater atrophy: a w-score is a morphometric measure of atrophy that is corrected for differences in age, sex and imaging site. Consistent with previous reports, the map highlights pronounced atrophy throughout the brain, including both grey matter (cortex and subcortex) and white matter27,30,31,32,33,34. In Fig. 1a, negative inverted w-scores (originally positive w-scores) are not shown, these areas represent regions of enlargement and are mainly located in the ventricular and sulcal regions. For more detailed statistical comparisons between ALS patients and controls, refer to ref. 27.

Fig. 1: ALS-related white matter and grey matter atrophy.
Fig. 1: ALS-related white matter and grey matter atrophy.
Full size image

a Sagittal, coronal, and axial views of mean atrophy across individuals with ALS. Atrophy is estimated as the group-average w-score across individuals with ALS. The map is thresholded to exclude voxels with atrophy values smaller than 0. Data is displayed on a T2-weighted MNI template (MNI152-NonLinear2009cSym, 1 × 1 × 1 mm; P = 118, C = 117, A = 87). b Sagittal, coronal, and axial views of white matter tract parcels colored by mean atrophy. White matter tracts are defined based on the JHU white matter tractography atlas (thresholded at 25% probability)35,36. Significant atrophy is observed in bilateral corticospinal tracts (FDR corrected; left: p = 8.83 × 10−16, t-statistic  = 9.62; right: p = 4.13 × 10−16, t-statistic  = 9.84), anterior thalamocortical tracts (FDR corrected; left: p = 6.53 × 10−8, t-statistic  = 6.44; right: p = 2.04 × 10−5, t-statistic  = 5.26) and superior longitudinal fasciculus-posterior limb tracts (FDR corrected; left: p = 7.39 × 10−4, t-statistic  = 4.34; right: p = 1.68 × 10−4, t-statistic  = 4.75). c Cortical atrophy map parcellated based on volume-based Schaefer-400 parcellation44; maps are displayed on fs-LR inflated cortical surfaces. d Mean atrophy is calculated within each of the cytoarchitectonic classes defined by Von Economo45,46,47. The Von Economo classes are depicted in Fig. S1. Statistical significance is estimated using a spatial autocorrelation-preserving spin test (1000 repetitions). Black borders shown on the fs-LR flat cortical surface correspond to the primary motor cortex borders defined by Von Economo cytoarchitectonic parcellation45,46,47.

Given that most cortical atrophy is concentrated in primary motor cortex, we implement an initial sanity check: whether atrophy can correspondingly be observed in the corticospinal tract. We segment the voxel-wise atrophy map using the Johns Hopkins University (JHU) white matter tractography atlas35,36 (Fig. 1b). Analysis of white-matter tracts reveals significant involvement of bilateral corticospinal tracts, bilateral anterior thalamic radiation, and bilateral superior longitudinal fasciculus bundles (p < 0.05, False Discovery Rate (FDR) corrected; Fig. 1b). These projections have previously been associated with ALS and suggest that cortical atrophy is related to white matter atrophy37,37,38,39,40,41, consistent with the notion that neurodegenerative disease pathology affects not only the cell bodies (grey matter) but also axonal projections (white matter)42,43. In the subsequent analyses, we directly assess the relationship between local grey matter atrophy and network connectivity.

To cross-reference the atrophy map with network connectivity data, we apply a high-resolution functional parcellation, subdividing the atrophy map into 400 parcels according to the Schaefer atlas44 (Fig. 1c). The 400-parcel resolution is chosen because it divides the somatomotor cortex into fine parcels that align with body representation. In the cortex, the most pronounced atrophy is observed near the pre-central gyrus, a hub for motor function. Atrophy also extends into the temporal and frontal cortices (Fig. 1c). To test whether the disease selectively targets primary motor cortical neurons, we compute mean atrophy in each of the canonical cytoarchitectural classes according to the histological Von Economo atlas45,46,47 (Fig. S1). We observe significant enrichment of atrophy in the primary motor cytoarchitectonic class (FDR corrected; pspin = 6.99 × 10−3; Fig. 1d). Collectively, these results suggest that the present morphometric approach is sensitive to the pathophysiology of ALS.

Structural connectivity shapes cortical atrophy

We next assess the extent to which the spatial patterning of atrophy is related to structural connectivity. We compute the correlation between a node’s atrophy value and the mean atrophy of its structurally connected neighbours, weighted by streamline density estimated using diffusion MRI. In this context, neighbours of a node are those that share a direct physical connection with the node of interest (Fig. 2a). To ensure that connectivity estimates reflect the healthy connectome prior to disease onset and deafferentation, we estimated structural connectivity in a sample of N = 327 unrelated healthy young adults from the Human Connectome Project (HCP-S90048). Figure 2a shows a positive correlation between the two (Pearson correlation coefficient; r = 0.52), suggesting that pathology in a brain region is correlated with greater exposure to pathology in anatomically connected regions, an effect that has been demonstrated in other neurodegenerative syndromes29,49,50,51,52.

Fig. 2: Structural connectivity shapes ALS-related atrophy.
Fig. 2: Structural connectivity shapes ALS-related atrophy.
Full size image

a We test the hypothesis that local regional atrophy is related to atrophy in its anatomically connected neighbours. The scatter plot shows the atrophy of a node (y-axis) and the mean atrophy of that node’s structurally connected neighbours (x-axis) (Pearson correlation coefficient; r = 0.52). Grey circles indicate brain regions. Atrophy values are derived from the group-average atrophy map shown in Fig. 1c, d. The structural connectome is sourced from healthy young adults in the HCP-S900 dataset. b The observed correlation coefficients between node and neighbour atrophy (red circles) are shown with respect to three null models: (1) spatial autocorrelation preserving spin tests (pspin = 9.99 × 10−4, blue box plot), (2) degree-preserving rewired networks (p = 9.99 × 10−4, green box plot), and (3) degree- and edge length-preserving rewired networks (p = 0.026, red box plot). Asterisks indicate statistical significance with respect to each null model. c In the atrophy ranking method, epicenters are defined as nodes that are both highly atrophied and structurally connected to other highly atrophied nodes. To estimate the epicenter likelihood of a node, nodes are first ranked according to their atrophy and then ranked according to their neighbours' atrophy. The epicenter likelihood ranking of each node is defined as its mean ranking in the two lists. The epicenter maps are shown on both inflated and flat cortical fs-LR surfaces, with the top-ranked nodes highlighted by black borders. In the figure legend, LH stands for left hemisphere and RH stands for right hemisphere. For completeness, we additionally show the interindividual variability in node-neighbour correlation values and plot the mean, standard deviation, and mean divided by standard deviation of epicenter maps across individual ALS patients in Fig. S3.

To demonstrate that atrophy patterns are mainly driven by network topology and not by spatial proximity among regions or the spatial autocorrelation of the atrophy patterns, we apply three null models (Fig. 2b)53. The first null model is a spatial autocorrelation-preserving randomization that tests whether the correlation between node and neighbour atrophy is passively due to spatial autocorrelation in the atrophy map ("spin test”)54,55. This model generates a null distribution for node-neighbour correlation values by projecting the atrophy map to a sphere, applying random angular rotations, bringing the rotated map back to the cortical surface, and re-calculating the node-neighbour atrophy correlations using the values from the rotated atrophy maps. We observe significantly greater correlations for the empirical map compared to the rotated maps (pspin = 9.99 × 10−4), suggesting that the effect cannot be attributed to spatial autocorrelation. Moreover, the correlation between node and neighbour atrophy is significantly greater when using the empirical structural brain network compared to rewired null structural networks that randomize network topology, including both degree-preserving and degree- and edge length-preserving nulls (p = 9.99 × 10−4, 0.026; respectively)56. Collectively, these null models show that the correlation between node and neighbour atrophy is specifically due to network topology.

Lastly, to ensure that the results are not unduly influenced by the choice of parcellation, we repeated the analysis using the Schaefer-800 functional parcellation and replicated the observed dependence of atrophy on structural connectivity. Fig. S2 shows the correlation between node and neighbour atrophy in this case (Pearson correlation coefficient; r = 0.61). The observed correlation is again higher than values obtained using the spatially autocorrelated atrophy maps (pspin = 9.99 × 10−4) and surpasses the correlation values obtained using rewired null structural networks (p = 9.99 × 10−4).

Epicenters of cortical atrophy

Given that the structural connectome shapes the spread of ALS-related pathology, we next sought to identify the putative cortical epicenter of the atrophy. We apply two methods—one empirical and one computational—to back-reconstruct the spreading trajectory and infer the most likely cortical location of the epicenter: (1) a network-based node ranking method, and (2) a susceptible-infected-removed (SIR) dynamical model, detailed in the Fig. S4, Table S3, and Supplementary Information: The S.I.R model. Both methods have previously been applied to understand the course of multiple neurological diseases29,52,57,58,59,60,61.

The ranking method identifies an epicenter as a region that is severely impacted by disease-related atrophy and whose structurally connected regions also exhibit extensive atrophy. In this approach, brain regions are ranked based on both their own atrophy values and their neighbours’ atrophy values in two separate lists. Each cortical node is then assigned a value reflecting the node’s average rank across these two lists. Figure 2c showcases the final mean ranking values across brain regions; in this map, higher values indicate higher probability of a node being an epicenter. The two most probable epicenter locations are within the right and left pre-central gyrus (primary motor cortex).

In the second approach, we build an SIR model using the structural connectome as the only underlying foundation for the spread of pathogenic agents, and hence atrophy. The model works by simulating the misfolding of normal proteins in the cortex and their trans-neuronal spread through the structural connections between brain regions. The similarity between real atrophy map and simulated atrophy maps is computed at each time point (Fig. S4). Each trajectory in the plot corresponds to the similarity of real and simulated atrophy when a specific parcel is chosen as the epicenter for the spread of misfolded proteins. Brain parcels are ranked based on parcels’ maximal correlation value across all simulation time-points. The top two parcels which can best reproduce the atrophy pattern, are indicated in blue and yellow in Fig. S4, and again are located within the motor cortex. Altogether, both methods yield similar epicenter probability maps (Pearson correlation coefficient; r = 0.77, pspin = 9.99 × 10−4, nspin = 1000). Both maps suggest high probability of being an epicenter for parcels across the motor and premotor cortices, as well as dorsolateral prefrontal cortex, posterior parietal cortex, and superior temporal gyrus. The location of these epicenters is consistent with the ALS staging mechanisms developed by Brettschneider et al.14, where the motor cortex is identified as the initial site of pathology, with subsequent spread of the disease to more distal areas in the prefrontal and parietal brain regions.

Local biological features guide pathogenic spread

We next investigate the potential contribution of multiple biological features in guiding disease spread. We consider the hypothesis that spread may be more likely between regions that display or share specific biological features. Specifically, we reconstruct five inter-regional similarity networks that describe the biological similarity of pairs of brain regions. The networks include: (1) gene expression similarity, (2) neurotransmitter receptor similarity, (3) laminar differentiation similarity, (4) metabolic similarity, and (5) hemodynamic similarity (i.e., functional connectivity) (Fig. 3)51.

Fig. 3: Inter-regional biological similarity and atrophy.
Fig. 3: Inter-regional biological similarity and atrophy.
Full size image

Inter-regional similarity networks reflect the similarity of brain regions according to multiple biological features. We analyze transcriptomic (gene expression) similarity, receptor similarity, laminar similarity, metabolic similarity, and hemodynamic similarity51. The heatmaps visualize these networks. Negative-valued elements are excluded from all analyses. The node-neighbour atrophy correlations are estimated for each inter-regional similarity network. The group-averaged ALS atrophy pattern in Fig. 1c, d is used to derive the correlations. The significance of node-neighbour correlations is assessed with respect to spatial autocorrelation preserving spin tests (nspin = 10,000; FDR corrected; transcriptomic similarity: pspin = 1.55 × 10−2, receptor similarity: pspin = 1.55 × 10−2, laminar similarity: pspin = 0.14, metabolic similarity: pspin = 9.99 × 10−4, hemodynamic similarity: pspin = 0.03). Red dots depict node-neighbour correlations, and boxplots depict the corresponding spin test-estimated null distributions. Additionally, we construct linear regression models to predict ALS cortical atrophy using two variables: (1) weighted mean neighbour atrophy, where the weights are derived from the structural connectome, and (2) weighted mean neighbour atrophy, where the weights come from the interregional similarity matrices. We assess whether adding the second regressor (weighted mean neighbour atrophy based on a specific interregional similarity matrix) improves model fit (adjusted R2), compared to adding a regressor with the same spatial organization. Adjusted R2 is improved only when incorporating data from metabolic similarity matrix. See Fig. S5 for further details.

To assess whether regions with similar biological features are also more likely to experience atrophy, we compute node-neighbour similarity as in Fig. 2a. Namely, we define the “exposure” that region i has to region j’s atrophy as the product between the edge weight (cij if cij > 0) and the extent of atrophy in the neighbouring nodes. Here, we use the connection weights derived from each of the five inter-regional similarity networks. If the weights from a network yield a high value for node-neighbour atrophy, this suggests that the biological feature encoded by that network contributes to the spread of pathology through the synaptic connections51. We assess significance with respect to spatial autocorrelation-preserving spin tests and apply FDR correction to the resulting p-values.

We find that all networks of inter-regional biological similarity except laminar similarity yield significant correlations between node and neighbour atrophy values (Fig. 3). Interestingly, metabolic similarity—estimated using dynamic FDG PET—yields the greatest node-neighbour similarity compared to other tested interregional similarity matrices (Pearson correlation coefficient; r = 0.45, pspin = 9.99 × 10−4). For other types of biological similarity matrices, following node-neighbour correlation values are obtained: gene expression similarity, r = 0.35 (pspin = 1.55 × 10−2); neurotransmitter receptor similarity, r = 0.34 (pspin = 1.55 × 10−2); laminar similarity, r = 0.22 (pspin = 0.14, not significant), and hemodynamic similarity, r = 0.24 (pspin = 0.03).

Finally, we test whether adding information from any biological similarity matrix improves disease propagation prediction beyond what structural connectivity alone provides. To address this question, we construct linear regression models to predict ALS cortical atrophy using two variables: (1) weighted mean neighbour atrophy, where the weights are derived from the structural connectome, and (2) weighted mean neighbour atrophy, where the weights come from the interregional similarity matrices (Fig. S5a). We find that only addition of metabolic similarity information as a second regressor results in improved model fits (adjusted R2), compared to adding a randomized regressor with the same spatial autocorrelation pattern (Fig. S5b; increase in adjusted R2 from 0.266 to 0.280, pspin = 1.40 × 10−2). This is consistent with numerous reports that ALS is associated with metabolic dysfunction, including abnormal mitochondrial physiology leading to a decreased level of adenosine triphosphate (ATP) and oxidative stress, as well as dysfunction of astrocyte mitochondrial and glutamate transporters leading to increased capture of free glutamate and excitotoxicity41,62,63. The results suggest that network structure and the metabolic connectivity jointly contribute to the spatial patterning of atrophy. In the next subsection, we investigate the molecular and cellular features associated with atrophy in greater detail.

Molecular and cellular signatures of epicenters

Up to now, we find that atrophy patterns reflect network organization and are centered on a compact set of epicenters. We next ask whether the network epicenters of ALS atrophy are enriched for specific biological processes, cellular components, and cell types. We cross-reference the ALS epicenter probability map (Fig. 2c) with microarray gene expression from the Allen Human Brain Atlas64. We submitted the gene list to Gene Category Enrichment Analysis (GCEA) to isolate Gene Ontology (GO) categories in which the constituent genes are significantly more correlated with epicenter likelihood map than a population of randomized maps with preserved spatial autocorrelation65 (see “Methods”).

Consistent with the intuition developed in the previous subsection—suggesting involvement of metabolic features in disease spread—the top GO categories are mainly associated with metabolic processes and energetic homeostasis (Fig. 4a). These terms include “ATP metabolic process”66, “Tricarboxylic acid cycle”, “gluconeogenesis”, “mitochondrial calcium ion transport”67,68, “glycolytic process”69, “glycosphingolipid metabolic process”70,71, “mitophagy”72,73,74, “positive regulation of mitochondrial fission”75,76, “mitochondrial matrix”77, “mitochondrial membrane”77,78, “integral component of mitochondrial inner membrane”78, “integral component of mitochondrial outer membrane”78, “protein import into mitochondrial matrix”79,80, “amino acid transport”, “branched-chain amino acid catabolic process”81, “peroxisome”82,83, and “peroxisomal matrix”82,83, all of which point toward altered energy metabolism and mitochondrial dysfunction. Additionally, categories related to cells’ cytoskeletal structure, such as “actin filament-based movement”, “apical dendrite”84, “microtubule associated complex”84,85, “vesicle transport along microtubule” and “anchored component of membrane” are implicated in the disease, which point to the structural dysregulations happening in the cells causing axonal transport disturbance86, impairment of information integration in cells84, and impairing cell adhesion. A brief explanation of biological processes and cellular components shown in Fig. 4a, b can be found in the Gene enrichment details section of the Supplementary Information. A comprehensive list of significant terms can be found in the Supplementary Data 1: Gene-enrichment.

Fig. 4: Enrichment analyses of the genes associated with ALS epicenter map.
Fig. 4: Enrichment analyses of the genes associated with ALS epicenter map.
Full size image

a Top 30 biological process terms from the GO Consortium knowledge base associated with gene sets correlated with the ALS epicenter likelihood map, shown in Fig. 2c214. b All cellular components terms from the GO Consortium knowledge base that are significantly enriched in gene sets correlated with the ALS epicenter likelihood map214. Terms in both categories are ordered by their category scores (c-scores). c Cell type enrichment analysis of ALS epicenter map reveals significant hits for pericytes and endothelial cells87. Used gene markers for endothelial cells and pericytes are shown in Fig. S6. The graphical representation of pericytes and endothelial cells is created in BioRender (Farahani, A. (2025) https://BioRender.com/xswclks). For more details, including gene markers for each biological process, cellular component, and cell type, exact c-score values, and corresponding p-values, refer to the Supplementary Data 1: Gene-enrichment.

In addition, we estimated cell type enrichment associated with ALS epicenters using genetic markers of different cell types, as determined through single-nucleus droplet-based sequencing and single-cell transposome hypersensitive site sequencing of human brain cells87 (Fig. 4c). We find significant enrichment for pericytes88,89,90,91 and endothelial cells92,93 (see Fig. S6). This observation points to possible vascular dysfunction in ALS, potentially explaining the consistent enrichment of metabolic categories. The finding is consistent with previous reports that show a reduction in pericytes in the spinal cord in ALS93,94. These cellular changes occur even prior to the initial clinical manifestations of the disease95,96. Collectively, these findings show that the spatial patterning of atrophy in ALS is related to both network structure as well as local molecular and cellular features that confer greater vulnerability to the disease. In other words, axonal projections are the physical conduit for disease spread, but the trajectory of spreading is guided by local biological features.

The correlation coefficients between all stable genes (differential stability greater than 0.1) from the Allen human brain atlas, the group-average atrophy map (Fig. 1c, d) and the epicenter map (ranking approach, Fig. 2c), along with their corresponding p-values from spin tests and FDR-corrected p-values are detailed in the Supplementary Data 2: Gene-correlation.

Epicenter location is correlated with clinical presentation and symptom severity

If cortical epicenters reflect the spatial focus of ALS pathology, do they also correlate with the clinical manifestation? To address this question, we analyzed the covariance between individual patient epicenter maps and individual differences across a variety of clinical, cognitive, and demographic variables. We used a multivariate pattern learning algorithm—partial least squares (PLS)—to identify epicenter locations and clinical subtypes that maximally covary with each other97,98,99 (see “Methods”).

The analysis revealed two latent variables that accounted for 24.89% and 12.21% of covariance between epicenter maps and clinical scores (Fig. 5, Fig. S7). Both latent variables were statistically significant using permutation tests (p = 9.99 × 10−4 and 4.89 × 10−2, respectively), but only the first latent variable could be cross-validated (p = 0.039). The first latent variable captures a mainly primary motor cortical epicenter pattern (Fig. 5a). Individuals who display this epicenter pattern tend to have worse motor function, including abnormal index finger and foot tapping scores, daily physical functions (Revised Amyotrophic Lateral Sclerosis Functional Rating Scale; ALSFRS scores), and muscle tone. Interestingly, pathology in these epicenters is uncorrelated with most cognitive scores, except for worse cube counting and delayed recognition scores. In other words, atrophy load in this cortical location is linked with worse motor symptoms and a higher level of disability.

Fig. 5: Relating individual epicenter maps with clinical and behavioral measures.
Fig. 5: Relating individual epicenter maps with clinical and behavioral measures.
Full size image

a The first latent variable from a PLS analysis relating individual epicenter maps and clinical-behavioral measurements. Brain loadings are shown on the fs-LR inflated and flat cortical surfaces. Regions demarcated by the black border are those with the greatest effect sizes (parcels in which over 50% of the vertices have a Cohen’s d effect size exceeding 1) in the group-average activation map from S1200 Human Connectome Project package for the movement task contrasts219. Regions demarcated by green borders showcase areas 44 and 45 from the Glasser parcellation222. These regions, specifically in the left hemisphere104, correspond to the Broca’s area223. b The scatter plot visualizes the individual ALS participants' brain scores versus behavioral PLS scores (Pearson correlation coefficient, r = 0.54; Spearman correlation coefficient, r = 0.54); each participant’s score is colored based on the revised-ALSFRS total score. The total ALSFRS score quantifies the degree of functional disability resulting from the disease (greater values correspond to lower disease severity)149. c The bar plot visualizes the behavioral/clinical measures' loadings. The contribution (effect size) of individual measures is assessed by bootstrap resampling (1000 repetitions). For further details on the included measures refer to Table S4. Fig. S8 shows the PLS results when atrophy maps are used instead of epicenter maps.

Despite statistical significance and a large effect size, the second latent variable could not be cross-validated. We therefore relegate the figure to the supplement (Fig. S7) and only describe this latent variable for completeness. The second latent variable captures mainly a dorsolateral prefrontal cortical epicenter pattern. Individuals who display this epicenter pattern tend to have worse ALSFRS speech scores, as well as respiratory insufficiency (i.e., worse ALSFRS-respiratory insufficiency score). The clinical pattern also captures covariance between atrophy in medial prefrontal cortex and lower scores in multiple cognitive measures. Collectively, these patterns suggest that individual differences in epicenter location are closely linked with clinical presentation. Importantly, the two latent variables are reminiscent of spinal-onset and bulbar-onset ALS clinical subtypes, which we examine in detail next.

Atrophy epicenters in spinal- and bulbar-onset ALS

So far, we focused the analysis on a common atrophy pattern across the patient sample. However, ALS is heterogeneous, and in clinical practice individuals are often stratified according to the initial body region of onset100. Most individuals experience spinal (limb) onset of the disease where the first symptoms appear in the legs and/or hands, while a third of patients report the first symptoms being in bulbar areas3, citing difficulties in salivation, swallowing, and speaking101,102. The present dataset (CALSNIC) provides stratification for both bulbar- (N = 38) and spinal-onset (N = 140) ALS subtypes (see “Methods” and Table S5). There are also rare cases of patients who report respiratory difficulties in the initial stages of the disease103, but these were not included in the analysis. Neither were patients with mixed spinal- and bulbar-onset, nor those with frontotemporal dementia-onset.

Here we investigate whether cortical network epicenters are related to spinal- and bulbar-onset of the disease. We estimate epicenter probability maps for spinal and bulbar ALS with the two methods presented in the Epicenters of cortical atrophy subsection (the ranking approach, Fig. 6a and the SIR modeling approach, Fig. S9). The two methods yield similar epicenter likelihood maps for both disease onset types (Pearson correlation coefficient; rspinal = 0.76, rbulbar = 0.78, pspin = 9.99 × 10−4) (Fig. S9). Importantly, epicenter probability maps in spinal and bulbar ALS are different: in spinal-onset ALS atrophy is mainly confined to primary motor cortex and paracentral lobule; conversely, in the bulbar-onset ALS atrophy infiltrates areas in lower paracentral gyrus and inferior frontal gyrus. To highlight epicenter differences between these two subtypes, we contrast cortical epicenter maps for individuals with spinal and bulbar ALS using t-tests. In this analysis, we used individualized epicenter maps derived from the node-neighbour approach. The obtained t-statistic map identifies regions that are more likely to be epicenters in one type compared to the other (Fig. 6b). For reference, we use the Human Connectome Project motor task group average effect size maps to delineate and overlay borders of cortical regions associated with movement of the tongue, hands, and feet. Additionally, we include the borders of areas 44 and 45 of Glasser parcellation (Broca’s area). These areas are functionally involved in speech production104, a phenomenon that is affected in ALS. Consistent with their clinical subtypes, bulbar-onset individuals show cortical disease epicenters in regions linked to tongue movement and in Broca’s area, accounting for increased speech deterioration in the bulbar group compared to the spinal group105. Conversely, the spinal-onset individuals predominantly have more likely epicenters in regions associated with movement of feet.

Fig. 6: Atrophy epicenters in spinal- and bulbar-onset ALS.
Fig. 6: Atrophy epicenters in spinal- and bulbar-onset ALS.
Full size image

a Epicenter likelihood maps for spinal-onset ALS and bulbar-onset ALS. Maps are obtained for each subtype using two methodologies: ranking approach and the agent-based SIR modeling approach (see Fig. S9). The maps derived by the ranking method are presented here. b The brain maps depict the t-statistics derived from contrasting the bulbar- versus spinal-onset ALS epicenter maps. Regions demarcated by the black border are those with the greatest effect sizes (parcels in which over 50% of the vertices have a Cohen’s d effect size exceeding 1) in the group-average activation map from S1200 Human Connectome Project package for the movement task contrasts219. Regions demarcated by green borders showcase areas 44 and 45 from the Glasser parcellation222. These regions, specifically in the left hemisphere104, correspond to the Broca’s area223. Cyan borders highlight three parcels that retain significance following FDR correction (FDR-corrected; p = 0.011, 0.011, and 0.021). Two of these parcels are located within the tongue movement area, while the other is adjacent to the tongue movement area toward the premotor region, this area might be involved in tongue movement control, given the homologous relationship between motor and premotor areas106,107. We further contrasted the behavioral/clinical measures for spinal- and bulbar-onset ALS patients. Bar plots show measures that are significantly different between the two subtypes (FDR corrected). Atrophy maps for each subtype are shown in Fig. S10.

The analysis indicates a statistically higher epicenter likelihood in bulbar ALS compared to spinal ALS in three parcels (all located in the right hemisphere). Two of the three significant parcels are located within the thresholded area of tongue movement and the third parcel is situated toward the premotor area, this area may be involved in tongue movement control, given the homologous relationship between motor and premotor regions106,107.

Additionally, assessing behavioral measures in CALSNIC dataset show that individuals with the bulbar-onset ALS have severe inability in their speech, salivation, and swallowing, and they have increased jaw and upper limb reflexes compared to the individuals with spinal-onset ALS; meanwhile, those with spinal-onset ALS displayed more significant weakness in dressing and hygiene, walking, and stair climbing abilities. These results show how network epicenters of cortical atrophy align with the clinical manifestation of the disease.

Discussion

The present report explores how network structure and local biological features shape the spatial patterning of atrophy in ALS. We find that structural connectivity, together with inter-regional similarity of metabolic features, closely aligns with the spatial patterning of atrophy and clinical expression of ALS. We identify consistent and prominent disease epicenters in motor cortex. Epicenter probability maps show transcriptomic enrichment for biological processes involved in mitochondrial function as well as support cells, such as endothelial cells and pericytes. Finally, individual differences in epicenter location correspond to individual differences in clinical and cognitive symptoms and differentiate patient subtypes.

We consistently find that atrophy in a brain region is correlated with the atrophy of anatomically connected regions, suggesting that atrophy spreads via white matter projections. This is consistent with the notion that ALS pathology is related to misfolding and network spreading of TAR-DNA binding protein (TDP)-43108,109,110,111. According to this account, pathological changes in conformation (misfolding) of endogenous TDP-43 induce aggregation and further misfolding. Trans-synaptic spread of pathogenic misfolded TDP-43 results in patterns of cell death and atrophy that ultimately resemble brain network architecture. Our results support this account in two ways. First, the corticospinal tract exhibits the most pronounced atrophy (consistent with findings in refs. 27,112,113,114), as does primary motor cortex, suggesting that cortical regions with strong physical connections to the corticospinal tract experience the greatest pathology. Second, a region’s atrophy and the atrophy of its neighbours are correlated, suggesting that network-based exposure to the pathology confers increased risk of pathology. This correlation is significantly greater than in rewired null models that preserve degree sequences and edge lengths, suggesting that the effect is not trivially due to spatial proximity or the total number of connections in a given region, but rather due to the arrangement of connections and the overall topology of the network. Network-based spreading of pathogenic proteins is implicated not only in ALS but across multiple neurodegenerative diseases, and can be observed across multiple spatial scales (e.g., cell-to-cell or region-to-region)115,116,117,118,119, measurement techniques (e.g., histopathological staining or MRI-based atrophy)120,121,122 and model organisms (e.g., mouse models and humans)123,124,125,126. Altogether, these results emphasize the role of structural network architecture as the mediator of the pathological protein spread in ALS7,8,9,10 and further challenge the notion that ALS is a non-prion-like spreading disease127.

In addition to structural connectivity, we find that the local metabolic features of brain regions contribute to disease spread. Numerous theories posit that the propagation of ALS is affected by atypical metabolic function63,128,129. Consistent with this notion, we find that metabolic inter-regional similarity also contributes to the patterning of atrophy. In other words, spreading of pathology is not only affected by structural connectome architecture but also more likely to take place between neuronal populations that share similar metabolic characteristics.

Given that disease spread appears to follow network structure, we next sought to identify the most likely cortical epicenters. We assessed the likelihood of various brain regions acting as cortical disease epicenters using two methods: (1) data-driven ranking, and (2) agent-based SIR modeling. Both methods identified regions located in the primary motor area to be the likeliest epicenters, as well as regions in frontal and temporal cortex, such as the temporoparietal junction. These results are in line with histopathological findings, showing TDP-43 pathology in motor cortex (Brodmann areas 4 and 6) in the initial stages of the disease14. The involvement of frontal and temporal areas in later stages of the disease is also well-documented14,27,130.

What biological features predispose regions to act as disease epicenters? The epicenter probability map is correlated with the spatial expression of genes involved in metabolic processes, pointing toward mitochondrial functions such as ATP production, as well as the structural organization of mitochondrial membrane and matrix, and disruption of mitochondrial calcium ion transport. This finding is in line with the idea of mitochondrial stress in ALS. Namely, dysregulation of mitochondrial calcium transport, potentially driven by increased excitatory cytosolic calcium uptake or changes in the function of mitochondrial calcium transporters, results in mitochondrial injury and triggers mitophagy. In addition, mitochondrial dysfunction is thought to cause further structural abnormalities in multiple cell types; for example, deletion of mitochondrial calcium uniporter changes dendritic spine morphometry131. The observation that cortical atrophy in ALS co-localizes with cortical regions enriched for mitochondrial function and structure further emphasize the need to consider the metabolic dimension of the disease in future neuroimaging studies (e.g., through use of FDG-PET imaging129).

In addition, the epicenter likelihood map overlaps with transcriptomic signatures of pericytes and endothelial cells. Endothelial cells are single-layer cells lining the blood vessels132. Pericytes surround the endothelial cells by extending their long cytoplasmic processes on the surface of endothelial tubes. The interaction between endothelial cells and pericytes is needed in formation and maintenance of blood-brain barrier (BBB), and blood-spinal cord barrier (BSCB); which contribute to maintaining the controlled chemical composition of the neuronal milieu133,134. Multiple studies point to a breakdown in both BBB and BSCB in ALS89,135. In mouse models of ALS, damage to BSCB, BBB, and endothelial cells is observed early in the disease136,137, and often precedes the onset of motor symptoms95. Similar vascular deficits have also been reported in human patients138. Indeed, cerebrospinal fluid abnormalities in ALS are thought to at least partly originate from the increase in BBB permeability139,140. Postmortem histology confirms the loss of integrity of BBB and SCSB due to endothelial cell damage and pericyte degeneration88,93. Our findings corroborate that the scope of inquiry for studying ALS should be broadened beyond motor neurons, and that support cells—such as vascular cells—should also be considered as therapeutic targets. Interestingly, recent findings suggest that treatment of ALS mice models with unmodified human bone marrow CD34+ (hBM34+) cells accelerates BSCB repair, leading to differentiation into endothelial cells, reduced astrogliosis and microgliosis, and improved perivascular integration, ultimately promoting survival of motor neurons141; likewise, treatment with pericytes improves overall survival in SOD1 mutant ALS mice142.

Finally, the locations of individual network epicenters map onto individual differences in clinical manifestation. Overall, individuals with greater epicenter probability in motor cortex had worse motor symptoms and signs, including poorer finger and foot tapping, increased muscle tone and reflexes, and lesser capability to perform daily functions involving the limbs (Fig. 5). The multivariate clinical subtype also included lower scores in naming, sentence completion, and verbal fluency tasks, presumably corresponding to greater epicenter probability in left inferior frontal cortex (Broca’s area; Fig. 5). Interestingly, disease epicenters in patients with bulbar- and spinal-onset of the disease evolve in different trajectories. Specifically, patients with bulbar-onset ALS displayed greater epicenter likelihood in three parcels involved in tongue movement. Collectively, these results demonstrate that studying ALS pathology from a network perspective can help trace the spatial origin of the disease, identify the molecular and cellular contributions to pathology, and, ultimately, map individual differences in brain-behavior relationships and clinical subtypes.

These findings build on multiple empirical reports8,40,111,130,143 and modeling studies on the relationship between connectivity and atrophy in ALS. Namely, several studies have modeled ALS-related degeneration and histopathological measurements, using various approaches, such as random walk models8,9 and network diffusion models10,144, showing that across a variability of models the whole-brain atrophy is related to the structural connectome. Importantly, Pandaya et al.10 showed that incorporating ALS gene risks coming from immunocytochemistry, exome sequencing and genome wide association studies, into the models can improve model accuracy, highlighting the importance of considering biological data to understand the underlying mechanisms of ALS neurodegeneration. The present report goes a step further and uses a data-driven approach to gene ontology analysis without relying on predefined gene sets, thereby identifying biological processes and cellular components that confer vulnerability to atrophy in ALS.

The present results should be considered with respect to several methodological limitations. First, we estimated atrophy using in vivo MRI deformation based morphometry, a technique that is well-validated but that does not directly measure pathology (e.g., neuronal loss, TDP-43 deposition). Despite this limitation, the results recapitulate numerous histopathological hallmarks of the disease. Meanwhile, when quantifying the atrophy maps of ALS participants, the potential influence of education145 and handedness146,147 was not excluded from the data. As some studies point to the potential for these variables to influence neuroanatomical metrics, future studies should consider their inclusion to fully account for confounding factors. Second, all network spreading effects were estimated using an independent high-resolution diffusion MRI dataset, rather than individual patient connectomes. Likewise, inter-regional transcriptomic similarity, receptor similarity, laminar similarity, metabolic similarity, and hemodynamic similarity matrices are estimated from non-ALS participants. This methodological decision highlights the need for more multimodal imaging and more extensive biological assays in patient samples. Third, all results are based on a single dataset. Although we used multiple methods whenever possible (e.g., when identifying epicenters), as well as cross-validation (e.g., for the PLS model), ideally, the results should be independently confirmed using a different ALS dataset. Fourth, the present analyses were performed without stratifying patients according to genetic mutation. Comprehensive comparison of sporadic and genetic cases is important to understand the heterogeneity of ALS, and highlights the need for larger sample sizes and targeted recruiting in future data collection efforts. Fifth, we did not use longitudinal MRI to assess the spreading mechanism in ALS due to several limitations; namely, not all ALS participants completed all three visits (71/192 had all three follow-up data available). Moreover, patients with more severe symptoms and faster disease progression are often unable to complete follow-up visits, either due to passing away before the next scan or becoming too dependent on intensive care. As a result, focusing solely on the subset of patients who completed follow-ups would likely bias the findings toward individuals with slower progression, limiting the generalizability of the results to the broader ALS population. We encourage future studies to reassess the findings presented in this paper using alternative methodological approaches—such as different structural connectome and inter-regional similarity matrix construction pipelines, or different parcellation schemes—to further validate the robustness of the results.

In conclusion, we reveal that network structure and local metabolic features leave an indelible mark on the expression of neurodegeneration in ALS. These two factors are closely related and should be studied simultaneously, rather than in isolation. Conceptualizing neurodegeneration as a multiscale spreading process may help to identify molecular, cellular and regional targets for therapies that slow or divert the course of pathology.

Methods

All codes used to perform the analyses are available on GitHub at https://github.com/netneurolab/Farahani_ALS and on Zenodo at https://zenodo.org/records/15865751 (https://doi.org/10.5281/zenodo.15865751).

CALSNIC dataset

Data was retrieved from the Canadian ALS Neuroimaging Consortium (CALSNIC) dataset (http://calsnic.org). The dataset comprises data from individuals diagnosed with possible, probable, or definite ALS, according to the revised El Escorial Criteria148, alongside data from healthy controls. All participants underwent 3T magnetic resonance imaging (MRI), yielding 1 mm isotropic T1-weighted (T1w) data for each individual. We use data acquired from eight different imaging sites, including: University of Calgary (CAL), University of Alberta (EDM), McGill University (MON), University of Toronto (TOR), University of British Columbia (VAN), University of Miami (MIA), Université Laval (QUE), and University of Utah (UTA). We exclude data from participants if they had other central nervous system abnormalities or reported psychiatric illness. The CALSNIC dataset provides longitudinal neuroimaging data with approximately four-month intervals between sessions for some but not all participants. We only keep individuals with a scan acquired at baseline for further analysis, resulting in 192 individuals with ALS (70 female; mean age: 59.62 ± 10.20 years) and 175 healthy control participants (96 female; mean age: 55.41 ± 10.00 years). Note that data used in this study comes from two phases of the CALSNIC dataset launched to date, which have slight changes in the structural MR imaging paradigm. Consequently, when building the ordinary least square model to estimate disease-related atrophy, if data from the same imaging site is acquired during different project phases, we treat the data from each phase as a separate entity. This assumption leads to considering twelve imaging sites in total when building the model (CAL: phase 1, 2; EDM: phase 1, 2; MON: phase 1, 2; TOR: phase 1, 2; VAN: phase 1; MIA: phase 2; QUE: phase 2; UTA: phase 2). All participants gave written informed consent, and the CALSNIC data collection was approved by the health research ethics boards at each of the participating sites. All ethical regulations relevant to human research participants were followed. Details regarding the demographic information of subjects per imaging site are provided in Table S1. Additionally, information on magnetic resonance vendors and imaging sequence details can be found in Table S2. For further data description refer to the original publication23.

Behavioral/clinical measures

The CALSNIC dataset, in addition to the magnetic resonance imaging data, provides behavioral/clinical assessments for individuals. These encompass ALS-related motor and cognitive evaluations, including Revised Amyotrophic Lateral Sclerosis Functional Rating Scale (ALSFRS), Edinburgh Cognitive and Behavioral ALS Screen (ECAS), finger/foot tapping tests, and evaluations for abnormal muscle tone and reflexes. Data on individuals’ sex, age, symptom duration, years of education, and handedness are also provided. Below is a brief overview of each measure.

ALSFRS is a rating measure that quantifies the level of disability in ALS patients. Higher values of this scale (with a maximum of 48) indicate lower disease severity. Specifically, ALSFRS assesses the participants’ functional capability in speech, salivation, swallowing, handwriting, gastrostomy, cutting food and handling utensils, dressing and hygiene, turning in bed and adjusting bed clothes, walking, climbing stairs, dyspnea, orthopnea, and respiratory insufficiency149. ECAS is a cognitive screening test to assess executive function, letter and semantic fluency, attention, memory, language, and visuospatial function of individuals150. A higher total score for ECAS (up to 136) signifies better cognitive performance. The tapping score, which is a clinical motor symptom severity indicator, quantifies the number of taps a participant can perform in 10 s by their fingers or feet. A higher count in this test reflects better motor function. In CALSNIC dataset, the tapping scores for both index-finger, and foot are measured two times per participant. Muscle tone and reflex measures are also included, with lower scores signifying normal tone and reflex conditions. For further elaboration on what each measure quantifies, refer to Table S4.

In this study, we use behavioral/clinical measures to relate the cortical epicenter locations to the behavioral manifestations of individuals with ALS (see Epicenter location is correlated with clinical presentation and symptom severity and Atrophy epicenters in spinal- and bulbar-onset ALS sections). Of the 192 patients included in the study, 8 individuals have incorrect ECAS administration; these participants are therefore excluded from analyses relating brain and behavior data (PLS analysis).

Deformation based morphometry

T1w data is preprocessed and Deformation Based Morphometry (DBM) maps are derived per participant (ALS and control participants in CALSNIC) using the Montreal Neurological Institute Medical Imaging NetCD (MNI-MINC) tools, publicly available at https://github.com/BIC-MNI/minc-toolkit-v2. The preprocessing steps include image denoising151, intensity inhomogeneity correction152, and image intensity normalization into range (0–100) using histogram matching; next, each T1w image is first linearly and then nonlinearly registered into the MNI152-NonLinear2009cSym standard brain. DBM maps are derived by estimating the local deformation needed in each voxel in an individual’s T1w image to nonlinearly match it to the standard template (MNI152-NonLinear2009cSym standard brain—T1w contrast). The required deformation is estimated by the Jacobian determinant of the inverse nonlinear deformation field and can be used as an indirect estimate of brain atrophy153,154. DBM values lower than 1 indicate that the corresponding region is smaller in the participant than in the template (atrophy in the participant compared to the template). Conversely, DBM values greater than 1 indicate that the corresponding region in the participant space is larger than the same region in the template space (expansion in the participant compared to the template). Collectively, the DBM maps encode the morphological differences between the T1w data of a given participant and the standard brain defined by the MNI template.

Atrophy maps

The Jacobian determinant from the DBM analysis serves as a dependent variable, influenced not only by the diagnosis but also by factors such as age, sex, and imaging site. To isolate factors unrelated to diagnosis from the DBM maps of ALS patients and to obtain a more accurate measure of disease-specific atrophy, we compute w-score map per individual155. The w-score value at each voxel quantifies the normalized deviation of the observed DBM value from its expected DBM value, adjusted for age, sex, and imaging site. The expected value for DBM at each voxel is estimated using an ordinary least squares model (DBMexpected = β1 × age + β2 × sex + β3) that is constructed based on control participants’ data. Greater absolute disparity between the observed and expected value of DBM indicates more severe atrophy or expansion in a given voxel. The formula to calculate the w-score for a voxel of interest is provided in the following:

$$\,{\mbox{w-score}}=\frac{{{\mbox{DBM}}}_{{{\rm{observed}}}{\mbox{-}}{{\rm{ALS}}}}-{{\mbox{DBM}}}_{{{\rm{expected}}}{\mbox{-}}{{\rm{ALS}}}}}{{\mbox{std}}({{\mbox{DBM}}}_{{{\rm{observed}}}{\mbox{-}}{{\rm{HC}}}}-{{\mbox{DBM}}}_{{{\rm{expected}}}{\mbox{-}}{{\rm{HC}}}})}$$
(1)

where std stands for standard deviation, DBMobserved is the raw DBM value without covariate adjustments, and DBMexpected is the predicted DBM value for a participant defined based on the already trained model (β1 × age + β2 × sex + β3). A negative w-score in this context signifies greater atrophy than the mean value expected for a healthy participant, with normal aging and sex effects accounted for. After calculating the w-score maps of all ALS participants under study, w-score maps are averaged across ALS patients to create a single collective atrophy map. To simplify interpretation, the w-score maps are multiplied by  −1 so that the larger numerical values correspond to more atrophy. Throughout the manuscript, the w-score maps (post-inversion) are referred to as the “atrophy map”. The group-average ALS atrophy map is visualized in Fig. 1a.

Brain tissue parcellation

ALS atrophy maps are parcellated using two different atlases. For cortical regions, we apply the Schaefer-400 (and Schaefer-800) functional parcellation44, and to examine the sub-cortex, specifically the white matter tracts, we use the Johns Hopkins University (JHU) white matter tractography atlas (thresholded at 25% probability)35,36. We initially downloaded the MNI152-FSL versions of these atlases from https://github.com/ThomasYeoLab/CBIG/tree/master/stable_projects/brain_parcellation/Schaefer2018_LocalGlobal/Parcellations and https://web.mit.edu/fsl_v5.0.10/fsl/doc/wiki/Atlases.html, respectively. We then registered them into the MNI template of interest (MNI152-NonLinear2009cSym) using “antsRegistrationSyN” and “antsApplyTransforms” from Advanced Normalization Tools156,157.

Von Economo classes

To investigate the specificity of ALS cortical atrophy, we explored seven cytoarchitectonic classes coming from the Von Economo atlas45,46.

To assign von Economo classes to Schaefer-400 parcels44, we applied the classification provided by Scholtens et al.47 to the cortical brain surface, yielding vertex-level assignments. These vertex-level assignments were then used with the winner-take-all approach (majority voting) to generate parcel-level network assignments. With these network assignments established for each of the 400 parcels, we calculated the average atrophy values across the parcels belonging to each network class, resulting in a single atrophy value per class.

Network reconstruction

Brain networks (structural connectivity and inter-regional similarity networks) are retrieved from netneurolab51,158. Below we briefly describe how each network is reconstructed.

Structural connectome

Structural connectivity is a matrix that provides information regarding the white matter connections across pairs of brain regions. In this study, the dataset used to build the connectome comes from 327 unrelated healthy participants (182 female; age: 22–35 years) from the Human Connectome Project (HCP-S900 release), scanned using a 3T Connectome Skyra scanner. The diffusion MRI data is acquired with the spin-echo echo-planar imaging sequence (TR  = 5520 ms; TE  = 89.5 ms; FOV  = 210  × 180 mm2; voxel size = 1.25 mm3; b-value = three different shells, i.e., 1000, 2000, and 3000 s/mm2; number of diffusion directions = 270; and number of b0 images = 18). Diffusion MRI data is preprocessed using the HCP minimal preprocessing pipeline. For detailed acquisition information and preprocessing steps refer to these refs. 48,159.

Structural connectome is reconstructed from the diffusion MRI using the MRtrix3 package (https://www.mrtrix.org)160. Different tissue types of cortical and subcortical grey matter, white matter, and cerebrospinal fluid are segmented using T1-weighted image for anatomical constrained tractography161. Fiber orientation distributions are generated using a multi-shell multi-tissue constrained spherical deconvolution algorithm (algorithm introduced by Jeurissen et al.162 is used to estimate the response function). Initially, a tractogram is generated with 40 million streamlines, with a maximum tract length of 250 and a fractional anisotropy cut-off of 0.06. Spherical-deconvolution informed filtering of tractograms (SIFT2) is used to reconstruct whole brain streamlines weighted by cross-section multipliers163. To build a structural connectome, the reconstructed cross-section streamlines are mapped onto the Schaefer-400 parcellation (as well as onto the Schaefer-800 parcellation)44. For further insights into individual network reconstructions, consult the reference provided164. A group consensus structural network is then created such that the mean density and edge length distribution observed across individual participants is preserved165. The weights of the edges in the consensus networks represent the averaged number of streamlines between parcels, computed across participants with nonzero connections158.

Hemodynamic similarity

Hemodynamic similarity, often referred to as functional connectivity, summarizes the similarity across brain regions in terms of the synchronization and similarity of their co-fluctuation in the BOLD signal. The dataset used to build the connectome comes from 326 unrelated healthy participants (181 female; age: 22–35 years) from the Human Connectome Project (HCP-S900 release), scanned using a 3T Connectome Skyra scanner48. In this dataset, each participant has undergone four 15-minute resting-state functional MRI scans, each with a TR of 720 ms. Data is preprocessed using the HCP minimal preprocessing pipeline. For detailed preprocessing steps, refer to the cited reference159. The voxel-wise functional MRI data is parcellated using the Schaefer-400 atlas44. The parcellated time-series are then used to construct functional connectivity matrices, computed as Pearson correlation coefficient between pairs of regional time-series for each of the four scans per participant. To obtain a group-level functional connectivity matrix, mean functional connectivity across all participants and scans is computed. This matrix is normalized using Fisher’s r-to-z transformation51.

Metabolic similarity

Metabolic similarity estimates the similarity between brain regions in terms of glucose metabolism or, in other words, in terms of energy consumption. This network is reconstructed using positron emission tomography (PET) images of the [F18]-fluordoxyglucose tracer. The dataset includes 26 healthy participants (77% female; age: 18–23 years) who participated in a 95-minute simultaneous MR-PET scan acquired using a 3T molecular MR scanner166. PET images are preprocessed according to ref. 167. Each volume of the PET time-series is registered to the MNI152 template space and is parcellated according to the Schaefer-400 atlas44. Parcellated time-series from pairs of brain regions are then correlated (Pearson’s correlation coefficient) to construct a metabolic connectivity matrix for each participant. Subsequently, by averaging connectivity matrices across all participants, a group-average metabolic connectome is obtained. This matrix is normalized using Fisher’s r-to-z transformation51.

Gene expression similarity

Correlated gene expression quantifies the transcriptomic similarity between pairs of brain regions. The underlying data to construct this connectome comes from the bulk tissue microarray expression data collected from six post-mortem brains (1 female; age: 24–57 years, mean age: 42.50 ± 13.38 years). This data is provided by the Allen Human Brain Atlas (AHBA) (https://human.brain-map.org)64 and is processed using the abagen toolbox, publicly available at https://github.com/rmarkello/abagen168, yielding a map for each gene in the parcellated MNI template (Schaefer-40044). Genes with high differential stability across donors (threshold of 0.1) are considered for the analysis, resulting in 8687 stable genes. A region × region gene expression similarity matrix is constructed by correlating normalized gene expression profiles between pairs of brain regions (Pearson correlation coefficient). This matrix is normalized using Fisher’s r-to-z transformation51.

Note that only two donors from the AHBA had tissue samples taken from the right hemisphere. This irregular sampling results in limited spatial coverage of expression in the right hemisphere; to resolve this, tissue samples were mirrored bilaterally across the left and right hemispheres. Consequently, the final gene expression profile for each region was estimated as the mean of both ipsilateral samples (from all six donors with left hemisphere samples) and contralateral samples (from the two donors with right hemisphere samples)169.

Receptor similarity

Receptor similarity measures how correlated the receptor density profiles are between brain regions. To construct this network, PET tracer images for 18 neurotransmitter receptors and transporters are used61,170. These receptors/transporters cover nine neurotransmitter systems, including dopamine (D1171, D2172,173,174,175,176, DAT177), norepinephrine (NET)178,179,180, serotonin (5-HT1A181, 5-HT1B181,182,183,184,185,186,187,188, 5-HT2189, 5-HT4189, 5-HT6190,191, 5-HTT189), acetylcholine (α4β2192,193, M1194, VAChT195,196), glutamate (mGluR5)197,198, GABA (GABAA)199, histamine (H3)200, cannabinoid (CB1)201,202,203,204, and mu-opioid (MOR)205. Each of these PET tracer images is parcellated based on the Schaefer-400 atlas44 and normalized using z-scores. A region × region receptor similarity matrix is constructed by correlating receptor profiles across all pairs of brain regions (Pearson correlation coefficient). This matrix is then normalized using Fisher’s r-to-z transform51. More information on the PET data acquisition is provided in Table S6.

Laminar similarity

Laminar similarity, estimated from histological data, assesses the similarity in cellular distributions across cortical layers within pairs of brain regions51,206. Data is recovered from the high-resolution (20 μm) BigBrain atlas, a postmortem Merker-stained histological atlas of a 65-year-old male207. Staining intensity profiles are sampled across 50 equi-volumetric surfaces within the cortical grey matter, enabling the assessment of neuronal density and soma size variations across cortical layers. These intensity profiles also help delineate boundaries among cortical layers, such as supragranular (layers I–III), granular (layer IV), and infragranular (layers V–VI). The BigBrainWarp toolbox208 is used to transform the data to the surface-based fs-LR template, which is then parcellated based on the Schaefer-400 atlas44. A laminar similarity matrix is estimated by computing the partial correlation between regional intensity profiles. The matrix is then normalized using Fisher’s r-to-z transform51.

Disease exposure

In this section, our objective is to assess how a specific biological brain connectome affects the spread of pathology across the cortex in ALS. We initially apply a threshold to the connectome under study (e.g., metabolic similarity) to retain only positive values; the thresholded connectome is considered as a “network”, whose “nodes” correspond to the Schaefer-400 parcels44 and the “edges” are defined based on the values in the off-diagonal elements of the connectivity (similarity) matrix. We assume that these edges provide potential pathways for pathology propagation. At each node, we define the global disease “exposure” as a measure quantifying the extent of atrophy its neighbouring nodes are experiencing, weighted according to the strength of the edges that connect the node with its neighbours. Here, “neighbours” are defined as nodes directly connected to the node in focus. The following is a mathematical formulation of disease exposure at a given node, denoted as node i:

$${D}_{i}=\frac{1}{{C}_{i}}{\sum }_{j\ne i,j=1}^{d}{d}_{j}\cdot {c}_{ij}$$
(2)

here Di represents the disease exposure at node i; which is calculated as the weighted average of atrophy across the neighbours of node i. The atrophy values of neighbours are denoted by dj, where j is the neighbour’s identifier. Each neighbour atrophy value, dj, is multiplied by the edge strength between region i and node j, represented by cij. Ci in the formula represents the weighted degree of connections made by node i within the network. To assess whether connectome architecture is related to the spatial patterning of atrophy, we correlate the degree of atrophy in individual nodes and their respective disease exposures. This analysis is repeated while considering various brain connectomes (including structural connectivity, gene-expression similarity, receptor similarity, laminar similarity, metabolic similarity, and hemodynamic similarity).

Epicenter mapping: data-driven method

With an atrophy map and an underlying connectome, it is possible to identify putative disease epicenters using either epidemiological data-driven methods, or computational modeling (see Epicenters of cortical atrophy)29,52. The data-driven approach is based on the notion that one of the main factors of disease spread is the existence of structural connectivity across brain regions (nodes). Here an epicenter is defined as a node that is both highly atrophied, and physically connected to nodes that are also highly atrophied (Fig. 2c). Epicenters are identified using two separate ranking procedures. In the first ranking procedure, nodes are ranked in ascending order according to their mean atrophy. The second ranking procedure is performed on the array containing the weighted neighbour atrophy values (nodes’ exposure values). The disease exposure value at each node is calculated as previously described in Disease exposure. This new set of values per node is also ranked in ascending order, quantifying the involvement of node’s neighbours in pathology. Finally, we calculate the average ranking of a node in the two lists, reflecting its likelihood to act as a disease epicenter29,52,58,209,210,211.

Epicenter mapping: agent-based spreading model

Disease epicenters can also be identified using a modeling approach known as the agent-based Susceptible-Infected-Removed (SIR) model59. In brief, this model simulates the brain spread of pathology considering the structural connectome as a network through which misfolded proteins (agents) can propagate15 (see Supplementary Information: The S.I.R model for model equations). In the case of ALS, these agents may represent TDP-43108,109,110.

We apply this methodology to find epicenters (model parameters outlined in Table S3). The epicenters are identified as the nodes that—if chosen as the disease’s initial point of pathology (seed region)—will lead to the highest Pearson correlation coefficient value between the simulated and observed cortical atrophy pattern during the spread of the agent. Here, the disease spread simulation is performed over a total of 10,000 time steps. Nodes are then ranked based on the maximum correlation value between the observed patterns of cortical atrophy and the simulated atrophy patterns when each node is used as the initial seed.

Gene category enrichment analysis

Biological processes and cellular components that are correlated with the ALS epicenter likelihood map are identified using a gene category enrichment analysis (GCEA). Cortical maps for biological processes and cellular components are defined according to the gene expression data coming from the AHBA64. The data is preprocessed and mapped to parcellated brain regions using the abagen toolbox, publicly available at https://github.com/rmarkello/abagen168. To perform the enrichment analysis, we use the ABAnnotate Matlab-based toolbox, publicly available at https://github.com/LeonDLotter/ABAnnotate212. The package is adapted from the toolbox initially developed by Fulcher et al. (https://github.com/benfulcher/GeneCategoryEnrichmentAnalysis)65. The GCEA procedure assesses whether genes in a particular category are more correlated with a given brain phenotype than a random phenotype with comparable spatial autocorrelation (ensemble-based null model)65.

To address spatial auto-correlation effects, 30,000 spatially auto-correlated null maps are generated from the epicenter likelihood map using the neuromaps toolbox (method = “vasa”170,213) and are inputted to ABAnnotate package for the testing procedure. After matching category and Allen Human Brain genes based on gene symbols, and removing the genes with differential stability lower than 0.1, the Pearson correlations between the epicenter map, the null maps, and all gene expression maps are calculated. For each null map and each category, null category scores are obtained as the mean z-transformed correlation coefficients. Positive-sided p-values, indicative of the relationship between the epicenter map and each category, are determined by comparing the actual category scores to the null distribution, with subsequent False Discovery Rate (FDR) correction applied. For gene-category annotations, we use the GO biological processes and cellular components214 as well as the cell-type categories introduced by Lake et al.87.

Statistics and reproducibility

The sample sizes, statistical methods used, and validation protocols are described for each result in the relevant subsections. Throughout the manuscript, statistical significance is assumed at p-value smaller than 0.05. In short, we use spin test (or network null models) when assessing the associations among brain maps, see Methods: Null models for more details. Significance of PLS latent variables is assessed by permutation tests. The generalization of PLS results is assessed using cross-validation approach; see Methods: Partial least squares for more details. Comparison of epicenter maps across patient subtypes is done using t-tests per parcel. Whenever multiple comparison correction is needed, the FDR correction is applied.

Null models

Reported associations among brain maps and/or networks are assessed with respect to three null models53. Here we briefly outline the logic and implementation of each. First, to assess the effect of spatial autocorrelation on spatial associations between brain maps, we use the so-called spatial auto-correlation preserving permutation tests, commonly referred to as “spin tests”54. Briefly brain phenotypes (e.g., cortical atrophy maps) are projected to spherical projection of the fsaverage surface. This involves selecting the coordinates of the vertex closest to the center of mass for each parcel. These parcel coordinates are then randomly rotated, and original parcels are reassigned to the value of the closest rotated parcel (n repetitions). For parcels where the medial wall is the closest, we assign the value of the next closest parcel instead. Following these steps, we obtain a series of randomized brain maps that have the same values and spatial autocorrelation as the original map but where the relationship between values and their spatial location has been permuted. These maps are then used to generate null distributions of desired statistics, such as null node-neighbour correlation values215.

When evaluating the role of the structural connectome in disease spread, we use two additional network randomization methods53. One approach constructs degree-preserving randomized networks53,216. In this case, the atrophy map is unchanged, but the structural connectome itself is randomized 1000 times. In each randomization realization, each edge is rewired 10 times to generate a randomized network with the same size, density, and degree sequence as the actual network216. Using these null networks, the Pearson correlation coefficient between node and its neighbours’ is recalculated, and a two-sided p-value is estimated in a non-parametric manner.

Finally, the most conservative approach to test the role of structural connectome in disease spread involves using degree- and edge length-preserving randomized networks. In this approach, the atrophy map is kept unchanged, but the structural connectome is randomized 1000 times to create randomized networks with preserved size, density, degree sequence, and edge length (sometimes referred to as “cost”)53,56. To achieve this, edges within the structural connectome are categorized based on Euclidean distance into 10 bins. Within each bin, pairs of edges are selected randomly and swapped, with the total number of swaps equaling the number of regions in the network multiplied by 20. This process is repeated 1000 times, yielding 1000 randomized structural networks. These randomized networks are then used to construct a null distribution for the node-neighbour Pearson correlation coefficient, and to calculate a two-sided p-value.

Partial least squares

The goal of partial least squares (PLS) analysis is to relate two data matrices to each other97,217. In the present case, the two matrices represent epicenter likelihood maps (participants × regions) and clinical-behavioral data (participants × measures). In the PLS analysis, we included data from 184 ALS participants and excluded data of 8 participants as they had incorrect administration of ECAS scores. We included 62 clinical-behavioral measures per participant; the details of these measures are provided in Table S4. In cases where participant’s data for a measure is missing, it is imputed using the median value from participants for whom the measure is available. For additional information on the number of ALS participants with missing values per measure, refer to Table S7. The analysis is initialized by computing the covariance between brain (X) and behavior features (Y). The resulting covariance matrix is subjected to singular value decomposition:

$${{{\bf{X}}}}^{{\prime} }{{\bf{Y}}}={{{\bf{USV}}}}^{{\prime} }$$
(3)

where U and V are orthonormal matrices of left and right singular vectors and S is a diagonal matrix of singular values. Each combination of a left singular vector, a right singular vector, and a singular value constitutes a latent variable. The elements of each singular vector weight the contribution of individual features to the overall multivariate pattern. In the present analysis, these weights correspond to a spatial pattern of cortical epicenters and clinical phenotypes that optimally covary with each other. To estimate the extent to which individual patients express these atrophy or behavioral patterns, patient-specific brain and behavioral scores are calculated. Scores are computed by projecting the original data onto the respective singular vector weights, such that each individual is assigned a brain and a behavioral score, indicating the degree to which a patient expresses each atrophy pattern and behavioral phenotype97.

The proportion of covariance accounted for by each latent variable is a measure of effect size and is quantified as the ratio of the squared singular value to the sum of all squared singular values. The statistical significance of each latent variable is estimated by permutation testing. This involves randomly permuting the order of observations (i.e., rows) of data matrix X for a total of 1000 repetitions, followed by constructing a set of “null” brain-behavior correlation matrices for the permuted brain and unchanged clinical data matrices. These “null” correlation matrices are then subjected to SVD to generate a distribution of singular values under the null hypothesis that there is no association between brain epicenters and behavioral measures. A non-parametric p-value can be estimated for a given latent variable as the probability that a permuted singular value exceeds the original, non-permuted singular value.

The contribution of individual features to the model is estimated using bootstrap resampling. Participants (rows of data matrices X and Y) are randomly sampled with replacement (1000 repetitions), resulting in resampled correlation matrices that are then subjected to SVD. This bootstrapping procedure generates a sampling distribution for each singular vector weight. A bootstrap ratio for each behavioral measure is then computed, defined as the ratio of its singular vector weight and its bootstrap-estimated standard error. High bootstrap ratios are indicative of features that make a large contribution to the latent variable and are consistent across samplings.

Finally, we use cross-validation to evaluate the out-of-sample correlation between cortical epicenter patterns and behavioral features. We use 100 random divisions of the dataset, allocating half of the data for training and the other half for testing. In each repetition, we apply PLS to the training data and estimate singular vector weights. Subsequently, each realization of the test data is projected onto the derived weights derived from the training set. We then estimate patient-specific scores and their correlation in the test sample. The procedure is repeated 100 times to establish a distribution of out-of-sample correlation values. To assess the statistical significance of these out-of-sample correlation values, we conduct permutation tests (100 repetitions). During each permutation, we shuffle the epicenter matrix rows and repeat the analysis to create a null distribution of correlation coefficients between epicenter and clinical scores in the test sample. This null distribution is then used to estimate a non-parametric p-value.

Somatotopic map from the HCP movement task

To localize the cortical activation boundaries associated with different body part movements, we utilize the Human Connectome Project’s group-average activation maps from the HCP-S1200 release. The group-average activation maps include the average strength of functional activation across 997 healthy young adults (532 female; age: 22–35 years) who completed 3T task fMRI runs (for detailed information on fMRI acquisition parameters and the preprocessing steps, refer to refs. 159,218,219). Here, we incorporate a set of group-average motor task activation maps, which reveal the somatotopic organization of the sensorimotor cortex. These motor tasks are originally developed by Bucker et al.220 and Yeo et al.221 and involve participants responding to visual prompts to execute specific movements. These movements include finger tapping (left and right), toe squeezing (left and right), and tongue moving. Each movement type is performed in a 12 s block, encompassing 10 movements, and is preceded by a 3 second cue. There are two runs per participant in total, each containing 13 blocks: two for tongue movements, four for hand movements (split evenly between right and left), four for foot movements (also evenly split), and three 15-second fixation blocks. To associate Schaefer-400 parcels with specific motor functions, we first threshold the group-average Cohen’s d maps at the vertex level to highlight regions with significant effect sizes (Cohen’s d greater than 1) in each contrast map of interest. Next, we identify parcels in which over 50% of vertices surpassed this threshold value. This method enabled us to link distinct cortical parcels to distinct motor functions.

Contrasting spinal- and bulbar-onset ALS

To compare cortical epicenter likelihood maps between spinal- and bulbar-onset ALS subtypes, we conduct a t-test for each cortical brain parcel. This analysis allows us to identify cortical areas that differ across the two disease subtypes in terms of their capacity to spread the disease. Here, individuals are assigned to the spinal subtype (N = 140) when the onset region is specified as either “upper motor neurons”, “lower motor neurons”, or a combination of both. Conversely, individuals reporting an onset region of “bulbar”, “bulbar-speech”, or “bulbar-speech and bulbar-swallowing” are categorized under the bulbar ALS subtype (N = 38) (see Table S5). We use individual epicenter likelihood maps as inputs for the t-test models, resulting in a map of t-statistics that illustrates the cortical epicenter differences between ALS subtypes. To ensure the reliability of the differences in epicenter locations between the two groups, we apply FDR correction to the p-values from the t-tests (n = 400). This correction signifies three cortical parcels in the right motor/premotor area, involved in the tongue movement, as the regions that have greater epicenter likelihood in the bulbar-onset group.

We also examine the behavioral/clinical differences across the disease subtypes. We compare groups (bulbar- and spinal-onset ALS) using Mann-Whitney U test for each metric, followed by adjustment for multiple comparisons using FDR. 62 different metrics are included for assessment in total, after removing the metrics which are not reported for more than 20 participants. Among the included subjects, 8 had incorrect ECAS administration. For these subjects, we retained them in this analysis by assigning “Not a Number” values for the corresponding ECAS measures, as they still provided useful information for other clinical and behavioral measures. If missing data is available for a measure, it is not incorporated in the t-statistic calculation (see Table S8 for the number of included subjects per measure for each subtype). We find significant differences in metrics including “ALSFRS-Speech” (FDR corrected; two-sided p = 2.58 × 10−14), “ALSFRS-Salivation” (p = 4.88 × 10−8), “ALSFRS-Swallowing” (p = 2.16 × 10−10), “ALSFRS-Dressing&hygiene” (p = 3.63 × 10−2), “ALSFRS-Walking” (p = 2.81 × 10−2), “ALSFRS-Climbingstairs” (p = 1.91 × 10−2); “Reflexes-Jaw” (p = 1.13 × 10−3) and “Reflexes-RightArm” (p = 4.37 × 10−3) (Fig. 6b).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.