Introduction

In Parkinson’s disease (PD), the progressive neurodegeneration of dopaminergic neurons in the pars compacta of the substantia nigra (SNc) leads to the characteristic motor symptoms. Diagnosis primarily relies on clinical criteria, oftentimes aided by nuclear imaging of the dopamine system. While magnetic resonance imaging (MRI) is predominantly used to rule out other potential diagnoses, quantitative MRI techniques have been increasingly used in research to assess nigral pathology in PD. These MRI techniques show strong potential as future biomarkers to aid diagnosis, track disease progression and gain further insight into the relation between individual symptomatology and the spatial pattern of neuropathology1,2.

Quantitative structural MRI can in patients with PD visualize two key features of the neurodegenerative process in the SNc. The first feature is the structural loss of neuromelanin (NM) containing dopamine neurons which give the region its characteristic macroscopic appearance as the “black body” (i.e., substantia nigra)3,4,5. NM-sensitive MRI techniques thus provide in-vivo markers for the integrity of dopaminergic SNc neurons6. The relationship between NM-sensitive MRI and both nigral NM content as well as striatal dopamine innervation has been confirmed in post-mortem and in-vivo studies in healthy individuals, but also PD patients6,7,8,9.

Another structural feature of PD is the excessive regional accumulation of iron in the SNc10,11,12,13. MRI can detect local iron concentration because iron influences the magnetic susceptibility and relaxation behavior14,15,16. Postprocessing of T2*-weighted images to R2*-maps and quantitative susceptibility maps (QSM) allows for quantitative in-vivo assessment of iron tissue concentration in Parkinson´s disease17,18,19,20,21. Although both, R2* and QSM, are iron-sensitive, they are affected differently by tissue and MRI properties. For instance, R2* also reflects regional myelination and is more sensitive to non-local field disturbances. Thus, their magnitude and distribution are correlated, but not identical22.

These two features of PD, the decrease in NM and the increase in iron concentration, occur at the same time, but are also interconnected. While being essential for cell function in normal amounts, excessive iron concentrations generate harmful reactive oxygen species. There exists a direct dependency between iron and NM where iron is crucial for NM synthesis and NM chelates iron, acting as the primary iron storage site in the SNc10,11,12,13. When iron exceeds the chelation capacity of NM, it leads to increased free radical formation, contributing to nigrostriatal neurodegeneration13,23. This creates a detrimental cycle in PD, where the loss of NM-containing neurons releases chelated iron, exacerbating its neurotoxic effects. The regional patterns of iron accumulation and NM loss and their link to clinical symptoms, remain poorly understood23,24. One aim of this study was therefore to map the difference and overlap of the spatial pattern of NM- and iron-sensitive MRI in the SNc in PD.

One key structure where these processes co-occur is the nigrosome-1 (N1) sub-region in the SNc which has a high density of dopaminergic neurons and shows prominent cell loss in Parkinson’s disease3,4. NM-sensitive MRI has previously revealed signal reduction in the postero-lateral SNc, corresponding to N120,24,25,26. In iron-sensitive MRI, the N1 appears as a hyperintense “swallow tail sign” in healthy individuals because of its relatively low iron content27. The “swallow tail sign” disappears in PD, indicating iron accumulation in the dorsolateral SNc27,28,29,30. The observed correlations between NM- and iron-based MRI signals in N1 and the more ventromedially placed N2 suggest a spatial overlap of NM loss and iron accumulation in Parkinson´s disease in the nigrosomes20,24,25. However, in other parts of the SN one might expect differences in the regional expression of NM loss and iron accumulation. For instance, the substantia nigra pars reticulata (SNr) lacks NM-containing dopamine neurons but shows high iron concentrations in Parkinson’s disease31,32.

Structural changes in the SN, as revealed by MRI, are closely linked to the severity of parkinsonism. Numerous studies have shown that the severity of motor symptoms, assessed in the OFF-medication state, correlates with reduced NM signal or increased iron content in the SN9,17,21,31,33,34,35. However, studies on the relationship between motor symptoms in the ON-medication state and nigral NM loss and iron accumulation showed mixed results, with some finding significant correlations31,34,36,37,38 and others not20,39,40,41. In this study, we used ultra-high field 7T MRI to compare the spatial patterns of NM loss and iron accumulation in the SNc and explored which of the MRI modalities best reflected the severity and asymmetry of motor impairment in the ON-medication state.

We recruited 49 patients with PD and 27 healthy, age-matched controls and performed NM-sensitive magnetization transfer-weighted (MTw) and iron-sensitive R2* and QSM MR images and computed voxel-wise difference maps between the two groups. We then tested for associations between individual motor symptom severity and MR signal change in the N1 as well as the rest of the SN. Finally, we explored voxel-wise correlations between the three acquired MR-modalities in each group separately.

Results

Clinical and imaging data

Demographical and clinical characteristics of the study population are summarized in Table 1. Of the 49 patients recruited, 3 were excluded due to contraindications to 7T MR-scanning, 1 due to unreported neurological conditions, 1 due to incidental findings on structural MRI and 1 due to generally poor scan quality across all modalities. Two of the 27 healthy controls were excluded due to initially unreported neurological conditions and another healthy control participant dropped out before the MRI scan session. Thus, the resulting study population consisted of 43 patients and 24 healthy controls.

Table 1 Demographics and clinical characteristics

The PD group and the healthy control group did not differ in age (Wilcoxon rank-sum, Z = 550, p = 0.661) or gender (χ2(1) = 0.37, p = 0.543) (see Table 1 for distribution in sub-samples). The PD group spanned a large range in terms of disease duration and time since symptom onset (0–17 years and 1–18 years respectively) and included patients with mild to moderate disease (Hoehn & Yahr stages 1–3). In the PD group, motor laterality was well balanced with 18 and 25 patients most severely affected in the right and left side respectively. The proportion of scan data available following exclusion of imaging data of poor quality did not differ between groups (NM imaging available in 42 patients and 24 healthy controls (χ2(1) = 2.395, p = 1), R2* imaging available in 32 patients and 21 healthy controls (χ2(1) = 0.901, p = 0.342), QSM imaging available in 36 patients and 23 healthy controls (χ2(1) = 1.152, p = 0.283).

Between-group differences in local neuromelanin and iron content

Voxel-wise analyses of between-group differences of both the NM-maps (MT-weighted neuromelanin contrast-to-noise ratio (MT CNR) maps, Fig. 1a) and the iron-sensitive QSM maps revealed NM loss and iron accumulation centered around the nigrosomes. The QSM maps showed a much broader extent of significant group-differences across the SNc compared to the NM maps. However, there was a clear overlap of NM loss and iron accumulation in a bilateral region corresponding to the location of N1 (Fig. 2b, c, z-coordinates 116-124). The assumption that this region corresponds to the N1 was corroborated by the group template that was constructed from MT-weighted images from healthy controls. This template displayed the well described clustering of NM-containing dopamine neurons into nigrosomes (Fig. 1b). The NM-loss was accordingly located within the hyperintense dorsolateral cluster corresponding to the location of the N1. Furthermore, patients displayed voxels with higher quantitative susceptibility ventromedially in a cluster corresponding to the location of N2 (Fig. 2c, z-coordinates 112-124). The NM contrast was significantly decreased in a similar, albeit smaller area and only in the right SNc (Fig. 2b, z-coordinates 112-116). Additionally, patients had lower NM contrast in a bilateral area just medial to the N1 in the dorsal tier of SNc (Fig. 2b, z-coordinate 116) corresponding to the location of N4. R2* values were also increased in regions corresponding to N1 and N2, but the clusters showing a between-group difference were more confined and differences did not survive correction for multiple comparisons (Fig. 2d).

Fig. 1: Reference regions and regions of interest.
figure 1

a A midbrain search-space (green) and lateralized cylindrical reference regions (blue and red) were manually defined on the MNI-template and used to calculate lateralized MT CNR maps of the midbrain. b For the ROI analysis in patients, the ROIs were defined manually on MT-weighted, high-resolution QSM and low-resolution R2* templates constructed by averaging data from all healthy controls. For each modality an ROI was defined for the right (blue) and left (red) SN and N1 (based on dorsal nigral hyperintensity, low quantitative susceptibility and low R2* for the MTw, QSM and R2* modalities respectively). Mean MT CNR, R2* and QSM values were extracted from each lateralized ROI for the assessment of correlations with contralateral motor severity and motor asymmetry.

Fig. 2: Threshold free cluster enhancement (TFCE) of the midbrain.
figure 2

TFCE analyses in a midbrain search-space (ROI) a of the between group difference revealed a decrease in MT-weighted contrast to noise ratio (MT CNR) in patients with PD compared to healthy controls in areas corresponding to the bilateral nigrosomes N1 and N4 and right N2, b Patients had increased QSM values in areas corresponding to bilateral nigrosomes N1 and N2, c No significant voxel-wise difference was found for R2*, d Statistical maps in b and c are thresholded at family-wise error corrected p < 0.05. In d, the statistical map is thresholded at t > 2 for visualization purposes, none of the differences are significant after correction for multiple comparisons.

Associations between the MRI-based read-outs and measures of disease severity

Given the predominant cell loss in N1 in PD and its clear delineability in the templates of all imaging modalities in the healthy control group (Fig. 1), we analyzed associations between disease severity and MRI readouts separately for the N1 and the rest of the substantia nigra (Table 2 and Fig. 3a). We found significant associations between iron-sensitive modalities and residual motor severity where higher contralateral R2* values in the SN and N1 as well as higher QSM values in the N1 were associated with higher motor severity. More specifically, higher UPDRS-III scores were significantly associated with higher contralateral R2* values in the SN (t(30.84) = 4.06, adj.p = < 0.001, 0.51 (±0.12) 1/s increase in R2* for each one-point increase in UPDRS-III) and N1 (t(34.49) = 5.14, adj.p < 0.001, 1.02 (±0.20) 1/s increase in R2* for each one-point increase in UPDRS-III). Similarly, higher UPDRS-III scores were significantly associated with increased contralateral quantitative susceptibility in the N1 (t(40.66) = 2.85, adj.p = 0.024, 1.22 (±0.43) ppb increase in quantitative susceptibility for each one-point increase in UPDRS-III). No significant associations were found between UPDRS-III scores and NM in the SN (t(56.38) = −0.89, adj.p = 0.506) or N1 (t(53.98) = −1.76 adj.p = 0.145).

Fig. 3: Associations between MRI measures and motor severity.
figure 3

Plots show standardized beta-coefficients and confidence intervals for each ROI/modality combination for each mixed linear model. a Associations between MRI-measures in the SN and N1 and motor severity (contralateral UPDRS-III in the ON-medication state), b Associations between the asymmetry of MRI-measures and the asymmetry of motor severity. c Associations between MRI-measures in the substantia nigra and N1 and LED. * Denotes significant association.

Table 2 Relationship between imaging measures and clinical variables assessed using linear mixed models

Evaluating associations between MRI modalities and LED, a trend was observed where lower SN NM contrast was related to higher dopaminergic deficiency, but this did not survive correction for multiple comparisons (t(36.52) = −2.26, p = 0.024, adj.p = 0.081) and no effects on LED dose were found in the remaining models (Fig. 3c).

Since clinical symptoms might be susceptible to the variability in disease duration, we also tested models including disease duration as additional factor. This did not change the reported above (Supplementary Table 2).

UPDRS-III scores and LED were not significantly correlated (Supplementary Fig. 2). Further visualization of the data for each hemisphere is provided as scatterplots in Supplementary Fig. 1a, and the full models are presented in Supplementary Table 1.

In an exploratory analysis, we also tested relationships between the asymmetry of motor symptoms and the asymmetry of MRI measures in the SN and N1 for all three modalities (Fig. 3b). Asymmetry in R2* in the N1 and SN were significantly related to asymmetry in residual motor severity (R2* in N1: t(27) = −4.41, adj.p = 0.002, effect size = −0.949 (±0.215); R2* values in SN: t(27) = −4.01, adj.p = 0.003, effect size = −0.434 (±0.108)). After correcting for multiple comparisons, neither asymmetry in the N1 nor in the SN in QSM- or NM-imaging were related to asymmetry in motor symptoms. Further information can be found in Supplementary Table 3, a visualization of the data is provided as scatterplots in Supplementary Fig. 1. We again explored whether adding disease duration as a factor changed the results, which again it did not (Supplementary Table 4).

Two-sample t-tests were used to test for differences between patients and healthy participants in midbrain NM (one-tailed testing for lower NM in PD than HC) and iron measures (one-tailed testing for higher susceptibility in PD than HC). The models included mean-centered regressors for age and sex. Significance was established based on a randomization procedure42 based on voxel-wise threshold-free cluster enhancement (TFCE) statistics43.

Voxel-wise between modality correlations

Voxel-wise MT CNR was strongly negatively correlated with R2* in antero-medial aspects of the substantia nigra in both patients with PD and healthy controls (Fig. 4A). MT CNR and QSM were also found to correlate negatively in the SN and red nucleus in patients, but voxel-wise correlation was only significant in a few voxels in healthy controls (Fig. 4B). Finally, R2* and QSM correlated positively in the majority of the SN and in the red nucleus (Fig. 4C). However, even though both the spatial distribution and the strength of the correlation patterns appears to be different between PD patients and healthy controls (in healthy controls, correlations between MT CNR and QSM appear stronger and more widely spread, but weaker and more confined for the correlations between the other modality pairs), we found no statistical difference between the correlation values in any of the pairwise correlations (Supplementary Fig. 3).

Fig. 4: Voxel-wise between modality correlations.
figure 4

Voxel-wise correlations between MT CNR, R2* and QSM in patients and healthy controls. Significance was accepted at a threshold of family-wise error rate adjusted p < 0.05. Thresholded maps of voxel-wise Pearson’s correlation coefficients are presented for A MT CNR & R2*, B MT CNR & QSM, and C R2* & QSM for patients and healthy controls respectively. Maps are overlayed on study specific templates for spatial reference.

Discussion

In this study, we leveraged the increased signal-to-noise ratio of ultra-high field MRI to investigate the sensitivity of MTw, QSM and R2* maps to detect dopamine neuron loss and iron accumulation in the SN in PD and their relation to motor disability. Disease-induced structural changes in NM and iron signal overlapped in the N1 and N2. NM signal was also reduced in the N4 region, while QSM showed a larger spatial extent of group differences. While R2* maps did not reveal differences in iron signal between groups, this modality showed the strongest relationship with residual motor symptoms.

Patients displayed a bilateral NM loss and increased iron accumulation in the SN. NM loss was mostly confined to the nigrosomes 1, 2 and 4. R2* maps displayed visually comparable patterns of increased iron signal confined to N1 and N2 regions in PD patients, but these differences did not survive correction for multiple comparisons. The reduced NM and increased quantitative susceptibility in nigrosomal subregions are consistent with the findings of recent studies using voxel-wise analysis of iron sensitive MRI20,25. We confirm and extend these studies by showing a reduced NM signal in a region corresponding to N4. The loss of NM contrast in N1 matches histopathological findings described by Damier et al.3 as well as previous neuroimaging studies that subdivided the SNc into subregions24,26,37. Together, our results indicate that N1 but also N2 and N4, with their high density of dopaminergic neurons, are the most sensitive regions for MRI mapping of structural changes44. Although significant changes in nigrosomes beyond N1, such as N2 and N4, can be detected at field strengths below 7 tesla25, the majority of significant findings regarding these regions come from ultra-high field MRI studies20,24,45. These reports, together with our findings, underscore the potential of ultra-high field MRI systems with their increased signal-to-noise ratio to detect structural disintegration in nigrosomes other than N1.

The spatial overlap of NM reduction and iron accumulation in the N1 and N2 regions supports a link between regional neuronal loss and iron accumulation in PD. Iron is essential for dopamine synthesis, and NM is an end product of the dopamine-iron complex chain involving the oxidization of dopamine by iron23. Since NM stores iron, the death of dopaminergic neurons releases iron, leading to a co-localization of NM loss and iron accumulation in PD23. This notion is supported by recent post-mortem studies that showed a concomitant loss of dopaminergic neurons in N1 and an increase in both cellular iron and extracellular iron, as well as NM-bound iron46. Some in vivo studies also reported this regional co-occurrence of NM loss and iron accumulation20,24,25,47. Of note, one study found no correlation9, but methodological differences make comparisons among studies challenging.

We computed voxel-wise correlations to explore spatial co-expression of NM and the two MR-derived iron signals in patients and healthy controls. Both patients and healthy controls showed high correlations between the NM signal and R2* values in the SNc, suggesting that the two modalities probe similar tissue properties in the healthy and degenerated SN. However, the NM contrast correlated more strongly with regional quantitative susceptibility in patients than in healthy controls, indicating that QSM may better reflect disease-related iron accumulation than R2*. However, none of the differences in correlations between groups were statistically significant (Supplementary Fig. 3), so these interpretations remain speculative and require further investigation with larger datasets.

Beyond the overlap in the N1 and N2 regions, there were some notable differences in the spatial expression of disease-related changes in NM and iron-sensitive MRI measures. Regional quantitative susceptibility increases were more widespread than NM losses, suggesting iron accumulation may precede dopamine neuron loss in the SNc24. The different spatial patterns could be due to varying MR sensitivity to NM loss versus iron accumulation. However, our finding of significant NM decrease without iron accumulation in the N4 region argues against lower NM-sequence sensitivity as the sole factor.

While NM-sensitive MTw and QSM maps showed significant group differences, voxel-wise R2* values did not differ between patients and healthy controls. In agreement with previous studies20,31,38, R2* values in N1 and N2 increased in patients with PD, but these changes were not statistically significant. In our study, R2* maps appeared to be noisier and more prone to artifacts compared to QSM maps. Despite including only R2* maps of acceptable quality, noise in the fitted R2* values likely contributed to the lack of significant differences. Accordingly, previous studies comparing R2* to QSM suggest that QSM might be more sensitive for detecting alterations in nigral iron content31,40.

While R2* values had low sensitivity for detection of between-group differences, regional R2* maps were more consistently related to residual motor symptoms in the ON-medication state. Both contralateral motor symptoms and motor symptom asymmetry were significantly associated with R2* values in the SN and the N1 region. We therefore infer that R2* maps may be more suited to explore inter-patient differences and their relation to motor dysfunction rather than detecting a deviation in nigral iron content at the group level relative to healthy controls. The finding that nigral iron accumulation is associated with motor symptom severity aligns with some previous studies, both OFF31,34 and ON medication38,48, though others found no association20,35,39,41. While R2* and QSM maps are both sensitive to iron, outside N1 only R2* was significantly associated with residual motor severity. When interpreting this finding, it is worth to point out that R2* is also sensitive to other tissue properties, such as myelination and non-local field disturbances22. This results in smaller signal differences between patients and healthy controls and reduces the sensitivity for between-group differences. The observed structure-dysfunction relationship might thus reflect a combined effect of PD on iron concentration and other tissue properties in the SN that may render the nigral R2* signal sensitive to variations in residual motor symptoms in the ON-medication state. This hypothesis aligns with the idea that residual motor symptoms ON medication relate to structural changes beyond dopaminergic cell death, but more studies are needed to investigate how disease symptoms map onto these different imaging modalities.

While MRI measures of regional iron accumulation were significantly correlated with greater residual motor impairment in the ON-medication state, no significant association was found between decreased NM signal and residual motor impairment in the ON-medication state. It is possible that UPDRS-III scores would have shown a negative correlation with the nigral NM signal if motor impairment had been measured in the OFF-medication state49. Previous studies on motor severity and the nigral NM signal yielded conflicting results, with some finding associations OFF9 and ON medication36,37 and others finding no associations OFF33,50 or ON medication48. In our data, the NM signal showed a trending association with LED, an indirect measure of dopaminergic degeneration, suggesting that iron accumulation and dopamine cell loss might reflect different aspects of dopaminergic degeneration. However, this association did not survive correction for multiple comparisons, so this interpretation remains speculative.

We used high-resolution 7T MRI to map NM loss and iron accumulation in PD. Imaging of the SN was performed at an unmatched spatial resolution, with MT-weighted images acquired at 0.4 mm in-plane resolution and 1 mm slice thickness. This higher resolution allowed us to create a detailed template demonstrating the anatomical details of the SNc, surpassing previous studies using 2–3 mm slice thickness7,26,33,37,50,51,52,53,54,55. However, the study still has a number of limitations:

First, while right-left field inhomogeneity has been previously reported for Philips MRI scanners56,57, we did not observe any right-left difference in image quality. The apparent between-group differences in the three MRI modalities with seemingly more pronounced differences in the right hemisphere are not reflecting significant site-specific differences between groups.

Second, our multimodal mapping approach did not include a diffusion weighted sequence. Especially diffusion measures of unrestricted diffusion, reflecting extracellular space, have been shown to be increased in the SN in PD2,58,59. This increase in free water has been attributed to several neurobiological causes60 such as cell loss61,62 or neuroinflammation63. The implementation of diffusion-weighted imaging with diffusion measures such as mean diffusivity or apparent diffusion coefficient as well as free water modeling might have provided valuable information about micro-and macrostructural features to gain further insights into the neurobiological causes underlying the differences in spatial distribution and sensitivity to clinical s of our MRI measures.

Third, lacking post-mortem data, we cannot claim with certainty that the hyper- (MTw images)/hypointense (R2* and QSM) region in the posterolateral SNc that we used for delineating the N1 in healthy controls exclusively reflects the N1. This approach assumes that N1 is a region containing a high density of pigmented neurons and low tissue iron concentration. Healthy controls show a dorsal hyperintense area in the SN on T2*-weighted or QSM scans, creating the characteristic swallow-tail sign, representing a subregion with low iron concentration27,29,64,65. It is a matter of ongoing debate, whether this region represents the cluster of dopaminergic neurons termed nigrosome-144. A recent post-mortem study defined the N1 region on histological sections and then applied biophysical modeling to R2* and QSM maps of the SN66. The main contributor to R2* relaxation in the N1 was the NM-bound iron, suggesting that a main source of the swallow-tail sign is related to the high concentration of NM containing neurons in the N1. It was argued that the reduced iron signal in N1 might be driven by several counteracting mechanisms with different contributions over the course of PD. Potential mechanisms include an increase in NM-bound iron in surviving dopamine neurons, a reduction in the number of NM positive neurons as well as an increase in ferritin-bound iron outside dopaminergic neurons66. Regardless of the terminology applied, the posterior lateral aspect of the SN is an important subregion to study in PD, because histopathological and imaging studies have independently shown that changes in this region scale with clinical severity and distinguish patients with Parkinson’s disease from healthy persons9,17,18,25,28,29,30,37,45,46,67,68,69,70,71,72,73,74.

Fourth, analyses of associations between clinical scores and MRI measures were conducted in ROIs delineated using healthy control group templates for each modality. Accordingly, we focused on left and right N1 ROIs, as these were clearly delineated on all modalities prior to performing any group-level analyses. Interestingly, cluster-level inferences revealed group differences not only in N1 but also in N2 and N4. Future studies should investigate the relationship between symptom severity and structural changes in N2 and N4. Furthermore, while the N1 regions were similarly located and sized in all three modalities, the spatial extent of the SN differed between modalities. While it would have been possible to define the ROIs using a single modality (e.g., based on the MT-weighted template) and then derive average signal intensities across all three modalities, we chose to test the regions as defined by the acquired images. This approach directly addresses the more clinically relevant question of whether each acquired image modality is sensitive to PD symptoms. Importantly, ROIs were delineated on group templates from healthy controls before conducting the analyses, thus avoiding potential circularity issues.

Fifth, the ROIs on MT-weighted, R2*, and QSM templates likely include different SN subregions. Since the SNr has high iron concentrations, regions defined on R2* and QSM templates likely include SNr voxels. To account for this, we repeated correlation analyses using regions of interest defined on the MT-weighted template. Results were consistent with the main analysis, confirming our conclusions.

Structural 7T MRI of the SN is a valuable tool for high-resolution mapping of NM loss and iron accumulation in Parkinson’s disease. Disease-related microstructural changes occur in clusters, with the most prominent changes localized in the nigrosomes. Although dopamine neuron disintegration and iron accumulation are closely related in PD, MR-based mapping of NM loss and iron content provides complementary insights into SN damage. Our cross-sectional results highlight the potential of multimodal 7T MRI for tracing nigral involvement and delineating structure-dysfunction relationships in PD.

Methods

We recruited 49 patients with PD and 27 healthy, age-matched controls as part of a larger study investigating brainstem changes in Parkinson’s disease. Patients were recruited from the Movement Disorders outpatient clinic at the Department of Neurology, Bispebjerg Hospital (Copenhagen, Denmark) and private practice neurology clinics based in the Copenhagen Region. All participants in the Parkinson’s disease group were required to have a clinical diagnosis of Parkinson’s disease and were additionally required to meet the Movement Disorders Society Clinical Diagnostic Criteria for “Clinically Established Parkinson’s disease” or “Clinically Probable Parkinson’s disease”75. Exclusion criteria were pregnancy or breastfeeding, history of other neurologic or psychiatric disease, pacemaker or other implanted electronic devices and claustrophobia. Healthy control participants were recruited by online advertisements on www.forsoegsperson.dk and were required to be 18 years or older with no history of neurologic or psychiatric disease and fulfill none of the exclusion criteria mentioned above. The study was approved by the Regional Committee on Health Research Ethics of the Capital Region of Denmark (Record-id: H-18021857). All participants gave their written informed consent to participate in the study which was registered on ClinicalTrials.gov (Identifier: NCT03866044).

Study procedures

Participants underwent a full neurological examination to exclude participants that exhibited symptoms of any neurological condition other than Parkinson’s disease. On the day of clinical examination, patients had taken their usual dopaminergic treatment and motor severity was assessed around 10–11 AM in the ON-medication state using the Unified Parkinson’s Disease Rating Scale (UPDRS)76. UPDRS-III items representing lateralized symptom severity were summed for the right and left extremities respectively. Additionally, we registered age and sex, as well as patients’ disease duration, time since onset of motor symptoms and medication status and calculated patients’ levodopa equivalent daily dose77.

Magnetic resonance imaging

Structural MRI data were collected with a Philips Achieva 7T scanner (Philips, Best, The Netherlands) equipped with a 32-channel Nova head coil (Nova Medical, Inc., MA, USA). To assess the integrity of pigmented (neuromelanin containing) SNc neurons, we acquired MT-weighted images using a 3D high-resolution (voxel size = 0.4 × 0.4 × 1.0 mm) ultra-fast gradient echo sequence aligned to the AC-PC line with a field-of-view covering the midbrain and rostral pons (640 × 640 × 34 voxels), echo time/repetition time = 4.1/8.1 ms, flip angle = 7 degrees, 2 averages. MT-saturation was achieved by applying 16 block-shaped pre-pulses at a frequency offset of 2 kHz (flip angle = 278 degrees, duration = 10 ms), acquisition time = 8 min. In MTw imaging, radiofrequency pulses saturate spins of protons in the “macromolecule-bound” pool, thereby enhancing contrast78,79. MTw GRE sequences avoid the long scan durations and high specific absorption rates associated with turbo spin echo based NM sensitive imaging at ultra-high field MRI78. For coregistration purposes, we also acquired T1-weighted images with identical acquisition parameters to the MT-weighted sequence, but with a 0 degree flip angle for the off-resonance pre-pulses. For the calculation of QSM maps, a high-resolution (voxel size = 0.4 mm isotropic), single-echo, T2*-weighted 3D fast field echo (FFE) sequence was acquired aligned to the AC-PC line with a field of view covering the midbrain and basal ganglia (640 × 640 × 125 voxels), EPI factor 5, 125 axial slices, echo time/repetition time = 15/31 ms, flip angle = 15 degrees, acquisition time = 7 min. A multi-echo, 3D FFE sequence with a lower resolution (voxel size = 1 mm isotropic) was acquired for the calculation of R2*-maps with a field-of-view covering the entire head (164 × 224 × 224 voxels), 164 sagittal slices, 1 signal average, echo time/repetition time = 3.2, 6.6, 10, 13.4, 16.8 & 20.2 ms/23 ms, flip angle = 15 degrees, acquisition time = 14 min. In addition, we acquired T1-weighted, high-resolution (voxel size = 1 mm isotropic) Magnetization Prepared RApid Gradient Echo (MPRAGE) images with a field-of-view covering the entire head (200 × 288 × 288 voxels) realized from a 3D-T1 TFE sequence with a non-selective inversion pulse, echo time/repetition time/inversion time = 2.2/4.9/1300 ms, acquisition time = 3:30 min. Patients were scanned in the ON-medication state to limit tremor-related movement while inside the scanner.

All images and derived quantitative maps were visually inspected and images of low quality due to subject related (e.g., head movements) or technical artefacts (e.g., susceptibility artefacts involving the brainstem and adjacent brain regions) were excluded from further analysis.

Image processing

Quantitative susceptibility maps were calculated from the magnitude and phase images of the single-echo T2*-weighted acquisition using the Multi-Scale Dipole Inversion method implemented in the QSMbox toolbox in Matlab (https://gitlab.com/acostaj/QSMbox), which was extended to also do a dipole inversion on angulated scans80. The postprocessing consisted of brain masking (BET2) using the magnitude image, phase unwrapping, Laplacian Boundary Value filter (LVB), followed by a variable spherical mean value filter (vSHARP), and finally a multi-scale dipole inversion using default settings apart from the regularization term, which was set to \(\gamma ={10}^{2.7}\)80,81,82,83. Voxel-wise quantitative R2* maps were calculated from the multi-echo T2*-weighted magnitude images using the Auto-Regression on Linear Operations (ARLO) algorithm for fast mono-exponential fitting84.

Imaging data were co-registered and normalized to template space using the ANTs software (v. 2.2.0)85 following a procedure similar to the pipeline used in our previous study86: T1-weighted MPRAGE images were up-sampled to a resolution of 0.5 mm isotropic and served as anatomical reference images. These images, as well as high resolution MT-weighted and T1-weighted images were corrected for bias using the ANTs software (N4 bias field correction, 5 resolution levels, number of iterations per level: 50 × 50 × 30 × 20, convergence threshold: 1 × 10−6, isotropic sizing for b-spline fitting: 200)87. Within-subject co-registration was achieved by rigid-body registration of the high resolution T1-weighted images to the T1-weighted reference scan and by rigid-body registration of the high resolution MT-weighted images to high resolution T1-weighted images and subsequently to the reference images. Within subject co-registration of susceptibility and R2* maps was achieved through rigid-body registration of the first echo from the multi-echo T2*-weighted acquisition to the reference images and the single echo T2*-weighted magnitude image to a combined magnitude image from the multi-echo T2*-weighted acquisition (root sum of squares image created in QSMbox).

A study specific T1-weighted template was created from individual MPRAGE images from 31 patients and 22 healthy control participants using rigid, affine, and nonlinear diffeomorphic registration (greedy symmetric normalization, SyN) with five resolution levels. All participants MPRAGE reference images were normalized to this study specific template using an identical approach as the one used for creating the template. The study specific T1-weighted template was registered to the MNI 0.5 mm ICBM152 (International Consortium for Brain Mapping) T1-weighted, non-linear, asymmetric template again using rigid, affine and non-linear registration to bridge the normalization of individual T1-weighted images88. Finally, we concatenated transformations from within-subject registration and to-template normalization to normalize all images to MNI template space for analysis, and created MT-weighted, R2* map and quantitative susceptibility map templates by averaging normalized images from all healthy participants (Fig. 1b, top row).

Calculation of MT-weighted neuromelanin contrast-to-noise ratio (MT CNR) maps was performed in template space in FSL (v. 6.0.4) using the following procedure. A midbrain search-space was manually segmented on the T1-weighted MNI template following the description provided by Iglesias et al.89 and divided into right and left midbrain search-spaces and two 3.5 × 4 mm cylindrical reference regions were placed in the right and left crus cerebri using the ITK-SNAP software (v. 3.8.0) (Fig. 1a) similar to the procedure described by Rua et al.90.

Neuromelanin maps were calculated as contrast to noise ratio (CNR) from normalized MT-weighted images by subtracting the mean signal intensity of the lateralized reference regions (\({\mathrm{mean}}_{\mathrm{reference}}\)) from the voxel signal intensity (\(\mathrm{Signal}\,{\mathrm{Intensity}}_{\mathrm{voxel}}\)) and dividing by the standard deviation of these reference regions (\({\mathrm{SD}}_{\mathrm{reference}}\)):

$${\mathrm{CNR}}_{\mathrm{voxel}}=\frac{\mathrm{Signal}\,{\mathrm{Intensity}}_{\mathrm{voxel}}-{\mathrm{mean}}_{\mathrm{reference}}}{{\mathrm{SD}}_{\mathrm{reference}}}$$

The right and left normalized neuromelanin maps were then masked by the right and left midbrain search-spaces and combined into one volume with neuromelanin values relative to their respective lateralized reference regions.

To test our hypotheses that loss of NM signal and increase in R2* and QSM values should indicate higher disease severity, we assessed the relationships between our two measures of disease severity (levodopa equivalent daily dose (LED) and residual motor severity (UPDRS-III scores in the ON-medication state) and signal intensity across modalities in manually defined regions of interest (ROIs). Regions of interest were manually defined on the three templates (MT-weighted, R2* and QSM) constructed from healthy participants’ imaging data. On the MT-weighted template, the SN was segmented based on the visual boundary between the neuromelanin rich hyperintense voxels of the SNc and the surrounding midbrain (25 slices in z-direction), and a bilaterally symmetric area located in the lateral tier of the SNc was defined as a putative nigrosome-1 region-of-interest (12 slices). Similarly, on the R2* and QSM templates, the hyperintense substantia nigra was segmented manually (25/26 slices, respectively), and the bilateral dorsolateral hypointensity, corresponding to the dorsolateral nigral hyperintensity (i.e., the swallow tail sign) on conventional T2*- and susceptibility weighted images, was defined as putative nigrosome-1 regions of interest (18/8 slices respectively). This resulted in a set of SN and nigrosome-1 ROIs specific to each modality defined on an independent set of healthy control data to avoid circularity. Because the ROIs were defined based on high/low intensities in the HC group template, we did not use them for between-group comparisons to avoid bias toward detecting differences. Using HC-derived boundaries to compare voxel intensities between the PD and HC groups would have inherently skewed the analysis toward larger differences.

Statistical analysis

Two-sample t-tests were used to test for differences between patients and healthy participants in midbrain NM (one-tailed testing for lower NM in PD than HC) and iron measures (one-tailed testing for higher susceptibility in PD than HC). The models included mean-centered regressors for age and sex. Significance was established based on a randomization procedure42 based on voxel-wise threshold-free cluster enhancement (TFCE) statistics43. Data were spatially smoothed by convolution with a 1 mm full width at half maximum isotropic Gaussian kernel. Permutation testing was used to control for multiple comparisons over space using TFCE and maximum permutation statistics with 10,000 permutations randomly shuffling group labels. Significance was accepted at a threshold of familywise error-rate adjusted p < 0.05. We also explored to what degree the MRI measures were correlated with one another. To this end, we computed Pearson’s correlation coefficient across MRI measures across subjects for each voxel. This was done separately for patients and healthy controls. The correlation coefficients were subsequently transformed using Fisher’s Z-transform followed by TFCE. A non-parametric null distribution was formed using 10,000 random permutations by randomly shuffling participant labels for one modality. Again, multiple comparisons adjustment was achieved by maximum permutation statistics and significance was accepted at a threshold of familywise error-rate adjusted p < 0.05. The analyses were implemented in Python (Python Software Foundation, v. 3.8.3).

We used separate linear mixed-effects models for each MRI modality (Fig. 1b and Supplementary Table 3). The models were fit using the R (R Core Team 2019, v4.0.4) package lme4. We used the mean MRI signal from each ROI from each hemisphere separately as dependent variable. Explanatory variables included fixed effects of interest (UPDRS-III scores contralateral to the ROI and LED), fixed effects of no interest (age, sex and hemisphere) as well as random intercepts for subjects. Patients’ age and sex were included since the Parkinson’s disease phenotype is linked to both age and sex91,92,93. p values were obtained using the lmerTest package94. We also tested whether adding disease duration as fixed effect would affect the results (Supplementary Table 4). Visual inspection of residual plots did not reveal any obvious deviations from homoscedasticity or normality. Statistical tests were two-tailed, and t-statistics and p values for model coefficients were calculated using Satterthwaite’s method. Significance was accepted at false discovery-rate adjusted p < 0.05.

Additionally, we explored whether the degree of asymmetry in our imaging measures (asymmetry of signal changes in the SN and nigrosome-1 in neuromelanin, R2* and QSM MR-images) was related to the asymmetry of motor symptoms. Asymmetry of motor severity was calculated by subtracting the sum of UPDRS-III items from the left side of the body (left upper and lower extremity scores) from the sum of UPDRS-III items from the right side of the body (right upper and lower extremity scores). These models included the same regressors as above, but with ∆ mean ROI signal (mean signal in right ROI minus mean signal in left ROI) instead of mean ROI signal as dependent variable and excluding the factor ‘hemisphere’ (Supplementary Table 1). We also tested whether adding disease duration as fixed effect would affect the results (Supplementary Table 2). For all statistical tests, significance was accepted at a threshold of family-wise error rate adjusted p < 0.05.