Introduction

Multiple sclerosis (MS) is one of the most common causes of non-traumatic disability among young adults1. It is a chronic inflammatory, demyelinating, and neurodegenerative disease of the central nervous system, with neurodegenerative processes, which manifests, among other changes, as progressive accumulation of T2-hyperintense white matter lesions, microstructural abnormalities in normal-appearing white matter, and atrophy of multiple brain structures, detectable through magnetic resonance imaging (MRI)2. Besides being classically defined by motor and sensory symptoms, MS also affects cognition in 40–70% of patients3, with cognitive impairment contributing to reduced employment, lower quality of life, and poorer adherence to treatment4,5,6,7,8. While cognitive deficits such as those affecting processing speed, executive functions, attention, working memory, verbal episodic memory, and visuospatial episodic memory have been consistently studied in MS3,9, other functions, particularly within the social cognitive domain, remain comparatively understudied. Social cognition refers to the set of cognitive functions that enable individuals to process, store, recognize, manipulate, and apply socially relevant information10,11. In MS, some impairments in social cognition have been documented11, including difficulties in facial emotion recognition12 and theory of mind - the ability to infer others’ thoughts, feelings, and beliefs12,13. Increasing evidence has linked these deficits to both grey matter pathology13,14,15,16 and white matter damage12,13,17,18,19,20, and studies using resting-state functional imaging have highlighted the involvement of frontal and temporal networks in these alterations21,22,23,24. However, other components of social cognition, such as social memory, remain largely unexplored in this population.

Social memory is a key component of declarative memory, defined as the ability to encode and retrieve the identity and previous interactions with others25,26. This capacity is essential for navigating complex social environments, sustaining interpersonal bonds, and adapting behavior based on past experience27. Importantly, deficits in social memory could plausibly contribute to the other well-documented impairments in social cognition observed in MS.

Overcoming the prior absence of suitable tools to assess social memory27,28, a social navigation task was developed to mimic real-life social interaction in a dynamic process in which participants make choices during encounters with virtual characters that reflect their evolving perception of each character29. These choices are used to construct an individual “social map” structured along two dimensions, power (reflecting dominance or submission) and affiliation (reflected in sharing private information or physical touch), the two main factors that influence relationships29. Derived metrics from this map have been shown to predict activity in the hippocampus and in the precuneus/posterior cingulate cortex (PPCC)29,30. At the end of the task, a multiple-choice questionnaire is administered to assess recall of the characters involved in specific social events encountered during its progression, providing a behavioral measure of episodic social memory retrieval29. Together, the navigation task and recall questionnaire provide complementary behavioral indices of social memory, encompassing both the dynamic mapping of social relationships and the explicit retrieval of socially relevant episodes.

In the present study, we aimed to assess social memory function in people with MS using a behavioral framework that included a social navigation task performed during task-based functional MRI acquisition, and an episodic social memory recall questionnaire. We hypothesized that MS patients would show impaired social memory. In addition, we sought to investigate whether differences in social memory performance were associated with functional or structural brain changes. Specifically, we predicted that lower social memory performance in MS would be associated with reduced volume (as measured on T1-weighted imaging) or altered activity of the hippocampus or PPCC, higher T2-hyperintense white matter lesion burden, and greater microstructural abnormalities in normal-appearing white matter as assessed by diffusion tensor imaging metrics. We report novel evidence suggesting that MS patients exhibit impairments in episodic social memory recall, which correlate with right PPCC volume and white matter lesion load.

Results

Behavioral data: social navigation and episodic social memory recall

We first characterized participants’ behavioral profiles, focusing on two distinct components of social memory: social navigation and episodic social memory recall. Demographic and clinical characteristics, including age, sex, educational level, and Hospital Anxiety and Depression Scale (HADS) scores, are presented in Table 1. For the MS group, we additionally report time since first symptom, total number of relapses, Expanded Disability Status Scale (EDSS) score, and Modified Fatigue Impact Scale (MFIS) score. No significant group differences were found in any of these variables.

Table 1 Demographic and clinical characteristics of the MS and control groups.

To assess implicit social navigation, we analyzed the final cosine and vector length values for each character (Fig. 1), using two-way ANOVAs with group (MS vs. healthy controls) as a between-subject factor and character identity (C1–C5) as a within-subject factor. Analyses revealed a significant main effect of character identity for both metrics (Fig. 2), but no significant main effect of group (cosine: p = 0.986; vector length: p = 0.726) or group-by-character interaction (cosine: p = 0.512; vector length: p = 0.536). Multiple post-hoc pairwise differences between characters were identified. Similarly, when participants were asked to explicitly place each character on a two-dimensional affiliation–power map at the end of the task, their placements did not differ between groups (Supplementary Fig. S1).

Fig. 1
Fig. 1
Full size image

Schematic representation of the social navigation task performed during fMRI. (a) Overview of MRI acquisition sequence and structure of the social navigation task. MRI scans were acquired in the chronological order shown from left to right: T2-weighted FLAIR imaging, T1-weighted imaging, task-based fMRI, and diffusion tensor imaging (DTI). fMRI was acquired during a structured, narrative-driven task in which participants assumed the role of a person relocating to a new city. The task included three types of slides: narrative slides, which presented contextual information or character dialogue; decision slides, in which participants chose between two response options; and black slides, which served as a “baseline” for fMRI analysis. Each decision slide was followed by a black screen upon the participant’s response; together, these lasted a total of 12 s. (b) Conceptual representation of the latent social space underlying the task. This two-dimensional space was defined by orthogonal axes of affiliation (x-axis; related to intimacy, empathy, or emotional closeness) and power (y-axis; related to hierarchy, dominance, or authority). Prior to the experiment, all possible interaction options were pre-classified as reflecting either affiliation or power, with each choice scored as + 1 or − 1 depending on whether it reinforced or opposed the respective dimension. These cumulative scores determined the evolving position of each character (C) in the social space, relative to a fixed central position representing the participant (6,0). The resulting position of each character was represented as a vector (V), with the angle (θ) encoding the relative weighting between power and affiliation in the relationship, and the length indicating the character’s distance in this latent space. (c1, c2) Example social maps from two participants. Each map illustrates the trial-by-trial trajectory of the five main characters through the two-dimensional power-affiliation space. Colored lines represent the cumulative path of each character, beginning at the neutral origin (0, 0) and evolving according to the participant’s decisions across 12 interactions. The maps demonstrate individual variability in how social relationships were shaped throughout the task.

Fig. 2
Fig. 2
Full size image

Group comparison of social navigation metrics and episodic social memory recall. (a) Cosine of the final vector angle associated with each of the 5 main characters (C1-C5), reflecting the direction of the social vector in the power-affiliation space at task end. A two-way ANOVA revealed a significant main effect of character identity (F(3.46, 159) = 5.71, η2p = 0.11, p < 0.001), while the effect of group (F(1, 46) = 0.00, η2p = 0.00, p = 0.986) and the interaction term (F(4, 184) = 0.82, η2p = 0.12, p = 0.512) were not significant. Post-hoc Tukey’s multiple comparisons test showed significant differences between C1 and C2 (p < 0.001) and C1 and C4 (p = 0.004), while other comparisons were not significant. (b) Final vector length, representing social distance, for each character. A two-way ANOVA again revealed a significant main effect of character (F(3.82, 176) = 6.60, η2p = 0.13, p < 0.001), with no significant effect of group (F(1, 46) = 0.12, η2p = 0.00, p = 0.726) or interaction (F(4, 184) = 0.79, η2p = 0.02, p = 0.536). Post-hoc Tukey’s multiple comparisons test revealed shorter distances for character C5 relative to C1 (p = 0.009), C2 (p < 0.001), and C3 (p = 0.018), as well as a difference between C2 and C4 (p = 0.019). All other comparisons were not significant. (c) Performance on the episodic social memory recall questionnaire, in which participants were asked to identify which of the 6 characters was involved in specific social events of the storyline. MS patients correctly identified fewer characters than healthy controls, reflecting lower memory performance (U = 154, Z = − 2.93, r = − 0.42, p = 0.003, Mann–Whitney test). (d) Residual memory scores derived from a linear regression model in which age, sex, educational level, and HADS score were included as predictors of the number of correct responses on the episodic social memory questionnaire. The residuals, representing the portion of variance not explained by these covariates, were then compared between groups. A significant difference between groups remained (U = 168, Z = − 2.62, r = − 0.37, p = 0.009, Mann–Whitney test). Panels a and b show group means and standard deviations, with individual data points overlaid. Panels c and d display boxplots representing medians and interquartile ranges. C1–C5 correspond to the five main characters in the storyline, labeled according to the order in which they were introduced. MS: n = 26; Controls: n = 22. p * < 0.05; ** p < 0.01; *** p < 0.001.

In contrast, MS patients scored significantly lower than healthy controls on the questionnaire designed to assess episodic social memory recall - defined as the conscious retrieval of specific social interactions and events (p = 0.003; MS median = 25; controls median = 21) (Fig. 2). To determine whether this difference could be explained by demographic variables or symptoms of anxiety and depression, we used a linear regression model to predict questionnaire scores based on age, sex, educational level, and HADS scores. We then compared the residuals - representing performance unexplained by these variables - between groups. The difference in performance remained significant after this adjustment (p = 0.009) (Fig. 2). Furthermore, exploratory analyses revealed that worse episodic social memory recall was associated with longer disease duration (time since first symptom), greater number of relapses, and higher EDSS scores within the MS group, while no significant correlation was found with MFIS score (Supplementary Fig. S2). No significant group differences were found in any of the additional cognitive or self-report questionnaires, including global cognition (Montreal Cognitive Assessment - MoCA), social anxiety and avoidance (Liebowitz Social Anxiety Scale), general self-efficacy (General Self-Efficacy Scale), personality traits (Neuroticism, Conscientiousness from the NEO Five-Factor Inventory), or subjective social status across three domains (MacArthur Scale for society, neighborhood, and employment) (Supplementary Fig. S1).

Neuroimaging correlates of social memory dysfunction

Given the behavioral evidence for deficits in social memory, we next investigated whether neural correlates in the hippocampus and PPCC - regions implicated in social navigation29,30 - might underlie this impairment. Prior studies had shown that the cosine of the social vector angle correlates with hippocampal activity, while vector length correlates with activity in the PPCC.

Group comparisons of blood oxygenation level dependent (BOLD) signal revealed no significant differences, using both region-of-interest (ROI)-based analyses (with cosine angle as predictor for hippocampal BOLD and vector length for PPCC BOLD) or whole-brain approaches. Similarly, within-group analyses did not identify significant associations in these regions or elsewhere. In these analyses, no clusters emerging at uncorrected thresholds (p < 0.001) survived correction for multiple comparisons using family-wise error (FWE).

Structurally, MS patients showed significantly reduced hippocampal volumes bilaterally compared to healthy controls (Fig. 3). While smaller hippocampal volumes in MS patients tended to be associated with worse episodic social memory recall, these correlations were not statistically significant (Fig. 3). No group differences were observed in PPCC volume (Fig. 3); however, within the MS group, smaller right-sided PPCC volumes were significantly associated with lower episodic social memory recall scores (p = 0.032), with a similar but non-significant trend on the left (p = 0.063) (Fig. 3). These associations remained directionally consistent, although not statistically significant, when volumetric measures were correlated with residual memory scores derived from a model controlling for age, sex, education, and HADS scores (Supplementary Fig. S3).

Fig. 3
Fig. 3
Full size image

Volume of the hippocampus and PPCC - comparison between groups and association with episodic social memory recall in MS. (a1, a2) Comparison of hippocampal volume adjusted to eTIV between MS and control participants. MS patients exhibited significantly reduced right hippocampal volume (a1; U = 187, Z = − 2.24, r = − 0.32, p = 0.025, Mann–Whitney test) and left hippocampal volume (a2; U = 156, Z = − 2.87, r = − 0.41, p = 0.004, Mann–Whitney test). (b1, b2) Association between hippocampal volume/eTIV and performance on the episodic social memory questionnaire in the MS group. Both right (b1; ρ = 0.11, p = 0.578, Spearman correlation) and left (b2; ρ = 0.14, p = 0.494, Spearman correlation) hippocampal volumes showed a non-significant trend toward lower volume being associated with fewer correct responses. (c1, c2) No significant group differences were found in PPCC volume/eTIV, either in the right (c1; U = 256, Z = − 0.86, r = − 0.12, p = 0.389, Mann–Whitney test) or left hemisphere (c2; U = 276, Z = − 0.46, r = − 0.07, p = 0.645, Mann–Whitney test). (d1, d2) In the MS group, right PPCC volume/eTIV was significantly and positively correlated with the number of correct responses on the episodic social memory questionnaire (d1; ρ = 0.42, p = 0.032, Spearman correlation). A similar, non-significant trend was observed for the left PPCC (d2; ρ = 0.37, p = 0.063, Spearman correlation). Boxplots display medians and interquartile ranges; scatterplots represent individual MS patients. Regression lines and 95% confidence intervals are shown for panels b1, b2, d1, and d2. P-values from each Spearman correlation are displayed in the bottom-left corner of each scatterplot; significant values (p < 0.05) appear in bold. MS: n = 26; Controls: n = 23. * p < 0.05; ** p < 0.01; eTIV, estimated total intracranial volume.

Given the central role of white matter pathology in MS - manifesting both as visible lesions and more diffuse microstructural abnormalities - we investigated whether white matter integrity was associated with episodic social memory recall. Lesion burden was first quantified using a segmentation tool for T2-hyperintense white matter lesions visible on T2-FLAIR images, yielding estimates of total lesion count and lesion volume. As expected, MS patients exhibited significantly higher lesion counts and lesion volume compared to healthy controls (Fig. 4). Importantly, in the MS group, higher lesion count (p = 0.017) and greater lesion volume (p = 0.002) were both significantly associated with poorer performance on the episodic social memory recall questionnaire (Fig. 4). After adjusting for age, sex, educational level, and HADS score, the association with lesion volume remained statistically significant (p = 0.002) (Supplementary Fig. S4).

Fig. 4
Fig. 4
Full size image

T2-FLAIR hyperintense white matter lesion load and its association with episodic social memory recall in MS. (a) Total number of T2-FLAIR hyperintense white matter lesions. MS patients exhibited significantly higher lesion counts than healthy controls (U = 537, Z = 5.17, r = 0.75, p < 0.001, Mann–Whitney test). (b) Total volume of T2-FLAIR hyperintense white matter lesions. Lesion volume was significantly greater in the MS group (U = 557, Z = 5.56, r = 0.80, p < 0.001, Mann–Whitney test). (c) Association between total lesion count and the number of correct responses on the episodic social memory questionnaire in the MS group. A significant negative correlation was observed (ρ = − 0.47, p = 0.017, Spearman correlation), indicating that higher lesion count was associated with poorer memory performance. (d) Lesion volume was also significantly and negatively correlated with the number of correct responses on the episodic social memory questionnaire (ρ = − 0.60, p = 0.002, Spearman correlation). Boxplots display medians and interquartile ranges; scatterplots represent individual MS patients. Regression lines and 95% confidence intervals are shown in panels c and d. P-values from each Spearman correlation are displayed in the bottom-left corner of each scatterplot. MS: n = 25; Controls: n = 23. *** p < 0.001.

To assess the contribution of microstructural damage in normal-appearing white matter, we analyzed diffusion tensor imaging metrics. As expected, MS patients showed reduced fractional anisotropy and increased mean diffusivity in white matter regions without T2-FLAIR hyperintensity compared to healthy controls (Fig. 5). In the MS group, both lower fractional anisotropy (p < 0.001) and higher mean diffusivity (p = 0.019) (Fig. 5) in normal-appearing white matter were significantly associated with worse performance on the episodic social memory recall task. These associations remained statistically significant after adjusting for age, sex, educational level, and HADS score (fractional anisotropy: p = 0.007; mean diffusivity: p = 0.009) (Supplementary Fig. S5).

Fig. 5
Fig. 5
Full size image

Normal-appearing white matter diffusion metrics and their association with episodic social memory recall in MS. (a) Fractional anisotropy in normal-appearing white matter. MS patients exhibited significantly lower FA compared to healthy controls (U = 109, Z =- 3.41, r = −0.50, p < 0.001, Mann–Whitney test). (b) Mean diffusivity in normal-appearing white matter. Mean diffusivity was significantly higher in the MS group (U = 421, Z = 3.45, r = 0.51, p < 0.001, Mann–Whitney test). (c) Association between fractional anisotropy and performance on the episodic social memory questionnaire in the MS group. A significant positive correlation was observed (ρ = 0.65, p < 0.001, Spearman correlation), indicating that lower fractional anisotropy was associated with poorer memory performance. (d) Mean diffusivity was significantly and negatively correlated with the number of correct responses on the episodic social memory questionnaire (ρ = − 0.48, p = 0.019, Spearman correlation), indicating that higher MD was associated with worse memory performance. Boxplots display medians and interquartile ranges; scatterplots represent individual MS patients. Regression lines and 95% confidence intervals are shown in panels c and d. P-values from each Spearman correlation are displayed in the bottom-left corner of each scatterplot. MS: n = 24; Controls: n = 22. *** p < 0.001.

Discussion

In addition to motor and sensory symptoms, cognitive impairment is highly prevalent in MS, affecting approximately 40–70% of patients3 and contributing to higher rates of unemployment, poorer quality of life, and reduced treatment adherence4,5,6,7,8. Nonetheless, certain cognitive functions - including social memory - remain understudied, often due to the lack of reliable assessment tools27,28. In this context, our study provides novel evidence of a deficit in episodic social memory in MS, along with insights into its neural correlates.

Our findings are consistent with previous reports of declarative episodic memory impairment in MS, particularly in verbal and visuospatial domains3,9, and it remains unclear whether social memory deficits reflect a distinct phenomenon or part of broader episodic memory dysfunction. Importantly, in our study, while recall of social episodes was significantly impaired in the MS group, social navigation performance was comparable between groups, suggesting the retrieval of discrete social experiences and the construction of broader “social space” representations may rely on at least partially distinct cognitive and neural mechanisms. Although our social memory questionnaire assessed recall performance, the observed deficit may reflect impairments during encoding, retrieval, or both; future work should explicitly disentangle these processes, similar to what has been done in animal models31 and in studies of verbal episodic memory in MS32. Social memory deficits in MS may also reflect the influence of other impaired domains, consistent with network-based analyses in healthy individuals showing that various cognitive functions, including social cognition, executive function, working memory, and language, are interrelated, forming components of a cognitive network33.

These findings also expand the understanding of social cognitive deficits in MS, adding to the established literature on theory of mind11. Notably, while theory of mind deficits are not consistently associated with disease duration or disability13, we found that episodic social memory recall performance was significantly associated with disease duration, number of relapses, and EDSS scores, suggesting that it may reflect cumulative disease burden. Moreover, the presence of these deficits in patients with otherwise preserved global cognition mirrors previous findings showing a dissociation between theory of mind and general cognitive performance13, positioning episodic social memory recall as a potentially sensitive marker of disease progression.

Although previous studies using this task have reported associations between social navigation metrics and activity in the hippocampus and PPCC in healthy individuals29,30, we did not replicate those findings in our control group. This divergence may reflect technical and sample-related differences. While earlier studies employed relatively homogeneous samples, our participants were recruited from a more naturalistic, community-based population, with broader variability in age and educational level (Table 1), in a different linguistic and cultural background. In particular, variations in how participants interpret social cues or assign meaning to the narrative may have influenced neural responses to the task. Furthermore, the naturalistic structure of the task, while enhancing ecological validity, introduces behavioral variability that may obscure subtle neural effects in smaller samples. Additionally, although the cosine and vector length metrics were used based on prior studies, they may not fully capture the complexity of social mapping mechanisms. Importantly, the task nonetheless demonstrated behavioral sensitivity in the differentiation between characters observed across both groups. Whether preserved social navigation in MS is maintained through undetected compensatory neural mechanism remains unclear and could be further explored using complementary approaches, such as with resting-state fMRI.

Whereas functional activation of the hippocampus and PPCC did not differ between groups, we observed bilateral hippocampal atrophy in MS patients, consistent with previous reports34, along with a significant association between reduced right PPCC volume and poorer recall performance. Our volumetric data suggest that the PPCC - and possibly the hippocampus - may contribute to episodic social memory retrieval. This interpretation is supported by prior evidence implicating the posterior cingulate cortex35 and the precuneus35,36 in other episodic memories retrieval. In addition, reduced precuneus volume has been associated with impaired social cognition37, while posterior cingulate gyrus volume has been specifically linked to theory of mind performance13,38. Likewise, hippocampal atrophy has been related to deficits in verbal34,39 and visuospatial episodic memory39. While MS is classically characterized by demyelinating white matter lesions, it is also increasingly recognized as a condition of widespread neurodegeneration and grey matter pathology, including parenchymal atrophy40,41,42,43, potentially reflecting both neuronal44,45 and synaptic loss45. Importantly, grey matter damage has consistently shown stronger associations with cognitive impairment than white matter abnormalities46,47. Our results therefore contribute to the growing literature implicating cortical atrophy in cognitive decline in MS48,49. Given the heterogeneity of the hippocampus along its longitudinal axis in terms of gene expression, electrophysiology, connectivity, and function50,51, future studies could explore whether subregional volumetric differences, particularly along the anterior-posterior axis52, might serve as more sensitive markers of social memory dysfunction in MS. Moreover, reductions in the volume of subcortical grey matter structures, including the thalamus53,54 and basal ganglia54, have also been associated with cognitive impairment in MS and warrant investigation in the context of social memory. Finally, while our findings rely solely on cross-sectional structural data, longitudinal changes in regional brain volume may provide complementary insight into cognitive dysfunction in MS55.

Our findings also underscore the relevance of white matter pathology in episodic social memory dysfunction. Although traditionally underappreciated, white matter integrity is now increasingly recognized as fundamental to cognition56, and more specifically to social cognition57. In MS, both T2-FLAIR hyperintense lesions and microstructural abnormalities in normal-appearing white matter represent hallmarks of its pathology and contribute to cognitive dysfunction3. In particular, T2-hyperintense lesions have been linked to deficits in theory of mind and facial emotion recognition12. In the present study, both greater lesion volume and higher lesion count were associated with poorer episodic social memory recall. However, only the association with lesion volume remained statistically significant after adjusting for age, sex, education, and symptoms of anxiety and depression. This supports the notion that lesion volume may more accurately reflect the functional impact of white matter damage on distributed cognitive networks than lesion count, which may be underestimated due to lesion confluence. These findings are consistent with prior studies linking lesion volume to cognitive impairment58,59. Interestingly, there is considerable histopathological heterogeneity in demyelination, and it remains unclear whether distinct patterns of demyelination60, axonal injury61, remyelination62, or the presence of chronic active lesions63 differentially impact social memory function. Beyond T2-FLAIR hyperintense lesions, normal-appearing white matter exhibits diffuse damage, including widespread inflammation, microglial activation, and axonal loss64. In line with this, MS patients in our study exhibited significantly reduced fractional anisotropy and increased mean diffusivity in normal-appearing white matter, reflecting diffuse microstructural damage. Crucially, both lower fractional anisotropy and higher mean diffusivity were significantly associated with poorer episodic social memory recall performance, and these associations remained robust after adjusting for age, sex, education, and symptoms of anxiety and depression. These findings align with prior studies reporting associations between fractional anisotropy and mean diffusivity metrics of white matter and verbal or visuospatial episodic memory performance in MS65. Reductions in fractional anisotropy particularly in normal-appearing white matter have also been associated with cognitive impairment66. Furthermore, reduced fractional anisotropy and increased mean diffusivity across widespread white matter tracts have been associated with social cognitive deficits in theory of mind in MS18. Although our analysis focused on global diffusion metrics, future studies with larger samples employing tract-specific approaches could provide more precise insights into disconnection mechanisms underlying social memory deficits in MS.

While our findings indicate macroscale neural correlates of social memory impairment in MS, the specific circuits involved remain undefined. In rodent models, social memory has been shown to depend on the activity of distinct hippocampal subregions, particularly dorsal CA2 and ventral CA1, as well as projections from the latter to the nucleus accumbens31,67. Future studies could specifically investigate whether alterations in these hippocampal subfields or their downstream projections contribute to episodic social memory deficits in MS. Hippocampal CA2 subregion, in particular, displays a distinctive molecular profile68, including high expression of vasopressin receptor 1b and oxytocin receptor, positioning it as a potential hub for neuromodulatory regulation of social memory69,70. Moreover, the unique molecular signature of CA2 also makes it an appealing therapeutic target - for instance, through pharmacological modulation of TREK-1 channels using agents such as selective serotonin reuptake inhibitors71,72.

This was a unicentric, observational, and cross-sectional study, which limits the extent to which causal inferences can be made. The relatively small sample size reduces statistical power and limits the generalizability of our findings. Moreover, the inclusion of only mildly impaired patients (EDSS < 4) was necessary to ensure the feasibility of the task but restricts extrapolation to individuals with more advanced disease. Additionally, the social memory deficit we report may be influenced by other cognitive domains that are frequently affected in MS, as discussed above. Although the MoCA did not reveal significant group differences, this may reflect the limited scope and sensitivity of the screening tool. Future studies with larger samples and more comprehensive cognitive assessment will be needed to better disentangle the contribution of other cognitive processes to social memory impairment. Finally, although the task captures relevant aspects of social interaction, it does not fully reflect the complexity and personal specificity of real-world social experiences. More immersive or individually tailored paradigms may be better suited to detect subtler deficits.

In conclusion, this study provides seminal evidence of a novel cognitive deficit in MS - impaired episodic social memory - and identifies associated structural correlates involving both grey and white matter. While these findings should be interpreted with caution given the study’s limitations, they contribute to refining the cognitive characterization of MS. This approach may also be extended to other neuropsychiatric conditions in which social memory may be affected.

Methods

Participants and study design

MS patients were recruited from Neurology outpatient consultations at Hospital de Braga, Portugal. Inclusion criteria were: age between 18 and 55 years, and a diagnosis of relapsing-remitting or secondary progressive MS according to the 2017 McDonald criteria73. Exclusion criteria were: EDSS score ≥ 4 (used to exclude individuals with relevant physical disability that could compromise task execution or introduce confounding variability in cognitive performance); corticosteroid use within 8 weeks prior to cognitive and neuroimaging assessment; a history of other neurodegenerative or systemic diseases affecting the central nervous system; history of head trauma with loss of consciousness; a history of substance use disorders; current use of medications with potential cognitive effects (trospium chloride, oxybutynin, tricyclic antidepressants, amphetamines, or modafinil); illiteracy or inability to understand written informed consent; and any contraindication to MRI. Control participants were recruited via patient referral, typically acquaintances or family members, and were required to meet the same inclusion and exclusion criteria, except those specifically related to MS. Demographic and clinical data collected included age, sex, educational level (dichotomized as up to mandatory schooling - i.e., completion of 12 years of formal education - versus higher education), HADS score, and MFIS score. For the MS group, additional variables were recorded: time since first symptom, number of relapses, and EDSS score. All participants were subsequently contacted to schedule the behavioral and neuroimaging assessment session. Importantly, all participants who met inclusion criteria and were recruited successfully completed the cognitive and neuroimaging assessments. No exclusions occurred between recruitment and evaluation. Missing data in some analyses were solely due to technical issues during MRI acquisition or task execution, as detailed below. The session began with MRI acquisition and completion of the social navigation task, followed by the episodic social memory recall questionnaire, explicit spatial mapping of social relationships, global cognitive testing, and psychosocial self-report measures (described below). All participants provided written informed consent prior to participation. The study was approved by the Ethics Committee of Hospital de Braga (reference 173_2022). All procedures were conducted in accordance with relevant guidelines and regulations, and the research was performed in accordance with the Declaration of Helsinki. A total of 49 individuals participated in the study, including 26 MS patients and 23 healthy controls. Due to a hardware malfunction during the social navigation task, response data and corresponding fMRI measures were unavailable for one control participant, resulting in a final sample of 22 healthy controls for those analyses. Additionally, for one MS patient, the 3D T2-FLAIR sequence was not acquired due to a protocol error; consequently, T2-hyperintense lesion burden metrics were computed for 25 MS patients. Regarding diffusion imaging, data were excluded for two MS patients - one due to the missing 3D T2-FLAIR sequence and another due to incomplete acquisition - as well as for one control participant due to severe image artifacts; the final diffusion analysis thus included 24 MS patients and 22 healthy controls.

Social navigation task

The first behavioral assessment consisted of the social navigation task performed during fMRI acquisition. The task was adapted from the original paradigm described by Tavares et al.29. Although not a game in the strict sense, the task followed a role-playing format: participants assumed the protagonist role of a person relocating to a new town, aiming to secure employment and housing through interactions with local residents. The task was implemented using PsychoPy 3.0 (https://www.psychopy.org), which controlled stimulus presentation and response collection during scanning. The total task duration was 32 min and 51 s (during which 1350 functional volumes were acquired).

Participants were presented with slides showing cartoon characters “speaking” through word bubbles. These included only character images and written text, with no spatial or contextual visual cues. Each character had distinct traits that implicitly conveyed aspects of their social role. While lying in the scanner, participants used response boxes to choose one of two predefined replies to each interactive slide. These choices were designed to shape the evolving relationship with each character and were preclassified as modulating either the power dimension (i.e., giving or receiving instructions, orders, or demands) or the affiliation dimension (i.e., engaging in personal conversation or accepting/initiating physical contact). Although the task structure resembled a “choose-your-own-adventure” narrative in terms of participant engagement, the storyline and sequence of interactions were fixed and not contingent on previous choices. Participants were instructed to respond naturally and intuitively, as they would in real-life social situations.

Each of the 5 main characters appeared in 12 interactive trials (6 affecting power, 6 affecting affiliation), and these interactions cumulatively determined the character’s trajectory within an abstract, participant-centered two-dimensional social space defined by power (vertical axis) and affiliation (horizontal axis) (Fig. 1). A sixth, neutral character was also included and appeared in 3 socially neutral interactions that did not involve shifts in power or affiliation. This character remained fixed at the center of the social space (coordinates 0,0) and served to simulate neutral social interactions, as in previous implementations of the task. At the beginning of the task, all main characters were assigned the same neutral starting position on both axes, at coordinates (0, 0). The participant’s reference point was fixed at coordinates (6, 0). The evolving relationship between participant and character was represented as a vector originating from the participant’s fixed position and pointing toward the character’s current location. With each new decision, the character moved one unit along one of the axes (up/down for power; toward/away for affiliation). These updates yielded a dynamic vector whose angle cosine represented the orientation in power-affiliation space, and whose length quantified the absolute social distance.

These parameters were computed continuously and later used to model neural activity across trials. The final position of each character reflected the cumulative outcome of interactions and served as a behavioral signature of that character’s location in the participant’s implicit social map (Fig. 1).

The task was translated into Portuguese by two independent researchers. To assess clarity and alignment with the original classifications, 10 volunteers independently labeled each of the 60 interactive trials as targeting either power or affiliation. Overall agreement with the predefined categories was approximately 98%, with 590 out of 600 classifications matching the original. Specifically, two participants disagreed on three trials each, and another two on two trials each, with discrepancies distributed heterogeneously across items. Volunteers also rated the polarity of each response (+ 1 or − 1) based on whether the option was expected to increase or decrease power/affiliation, achieving 100% agreement with the predefined scoring system. This preliminary testing also served to estimate adequate reading times: narrative slides were displayed for 2 to 19 s, while decision slides were shown for up to 12 s (Fig. 1). Once a participant responded, the decision slide was replaced by a blank black screen, which remained visible until the 12-second window had elapsed. This allowed for a variable inter-trial interval that served as “baseline” for fMRI analysis.

Eight task versions were created to accommodate participant age and linguistic adaptations to match the participant’s gender. Four versions used older-looking characters for participants aged 35 and above, while four others used younger-looking characters. These versions also ensured balanced character gender representation.

Episodic social memory questionnaire

Immediately after the task, participants completed a 30-item multiple-choice questionnaire assessing episodic social memory recall. As no explicit instruction to memorize was given, encoding was assumed to occur implicitly during the social navigation interactions. Each item referred to a specific social event that had occurred during the narrative (e.g., “Who invited you to dinner?”), and participants were asked to identify which character had been involved. All 6 characters from the task were included as response options for each question. Each item had a single correct answer. The questionnaire version was matched to the specific storyline used for each participant to ensure consistency in character roles and response accuracy across adapted versions. The total number of correct responses was used as a behavioral index of episodic social memory recall.

Explicit spatial evaluation of social relationships, global cognition, and psychosocial self-report measures

Participants subsequently completed an explicit spatial evaluation of social relationships, as well as global cognitive and psychosocial questionnaires. They were asked to position each of the 6 characters on a two-dimensional grid defined by power and affiliation axes, based on how they perceived each character in relation to themselves. For each character, the cosine of the social vector angle and the vector length were computed and used as metrics of participants’ explicit spatial representation of social relationships. Global cognitive functioning was assessed with the MoCA. The remaining instruments were self-report measures: the Neuroticism and Conscientiousness subscales of the NEO Five-Factor Inventory were used to assess personality traits; the Liebowitz Social Anxiety Scale measured social fear and avoidance; the General Self-Efficacy Scale assessed perceived general self-efficacy; and the MacArthur Scale of Subjective Social Status evaluated subjective social rank across society, neighborhood, and employment domains.

MRI data acquisition

All scans were acquired on a 3 T Siemens Magnetom Avanto scanner using a 32-channel head coil. The protocol began with the acquisition of structural images: a 3D T2-weighted FLAIR sequence (repetition time/echo time [TR/TE] = 5000/392 ms; inversion time [TI] = 1600 ms; voxel size = 0.56 mm isotropic; 288 sagittal slices; field of view [FOV] = 250 × 250 × 161 mm), followed by a 3D T1-weighted magnetization-prepared rapid acquisition with gradient echo sequence (TR/TE = 2450/4.13 ms; TI = 1100 ms; flip angle = 9º; voxel size = 1 mm isotropic; 176 sagittal slices; FOV = 256 × 256 × 256 mm). Functional data were collected during performance of the social navigation task using a T2*-weighted echo-planar imaging sequence (TR/TE = 1460/12 ms; flip angle = 90º; voxel size = 3 mm isotropic; 44 interleaved transverse slices; anterior-to-posterior encoding; FOV 200 × 200 mm). A gradient field map with matching geometry was also acquired (TR = 700 ms; TE1/TE2 = 5.19/7.65 ms; flip angle = 54°; voxel size = 3 mm isotropic; 44 interleaved transverse slices; right-to-left encoding; FOV = 200 × 200 mm). The first 8 s of each BOLD acquisition were discarded prior to preprocessing to allow for MR signal stabilization. Diffusion-weighted data were acquired using a single-shot spin-echo echo-planar imaging sequence (TR/TE = 5600/93 ms; voxel size = 2.0 × 2.0 × 3.5 mm; 44 interleaved transverse slices; anterior-to-posterior encoding; FOV = 220 × 220 mm), with 64 non-collinear diffusion directions and b-values of 0 and 1000 s/mm². To facilitate correction of susceptibility-induced distortions, an additional image with reversed phase-encoding direction (posterior-to-anterior) was acquired using identical parameters.

MRI preprocessing and analysis

DICOM files were converted to NIfTI format using dcm2niix74. Structural and functional MRI preprocessing was performed using fMRIPrep 22.1.075.

T1-weighted images underwent intensity non-uniformity correction, skull stripping, tissue segmentation, cortical surface reconstruction, and normalization to Montreal Neurological Institute space. Regional brain volumes were then extracted using FreeSurfer utilities76. For volumetric analysis, bilateral hippocampal and PPCC volumes were normalized to estimated total intracranial volume (eTIV) by computing the volume-to-eTIV ratio.

T2-hyperintense white matter lesion segmentation was conducted using the Lesion Prediction Algorithm from the Lesion Segmentation Toolbox v3.0.077 in Statistical Parametric Mapping (SPM) software78. The algorithm was applied to 3D T2-FLAIR images to generate lesion probability maps and binary masks, yielding estimates of total lesion count and lesion volume.

Each BOLD run underwent skull-stripping, head motion and slice-timing correction, and co-registration to the T1-weighted image. Confound regressors were estimated, non-steady-state volumes were removed, and Independent Component Analysis Automatic Removal of Motion Artifacts was applied. The data were smoothed with a 6 mm FWHM Gaussian kernel, normalized to Montreal Neurological Institute space and projected to surface space. First- and second-level general linear model analyses were conducted in SPM12. For each participant, behavioral and timing files were generated from the social navigation task using a custom Python script. Behavioral files included interaction type, character identity, event onsets, reaction times, and parametric modulators (cosine of the vector angle and vector length). Timing files specified the onsets, durations, and types of task events. Two separate first-level general linear models were modeled per participant: both included regressors for narrative and decision trials, and were additionally modulated by either vector angle cosine or vector length. The six motion parameters were included as nuisance regressors in both models. All regressors and their temporal derivatives were convolved with the canonical hemodynamic response function and contrasted against baseline. Second-level (group-level) analyses included both ROI and whole-brain approaches. ROI analyses tested for BOLD signal modulation by vector angle cosine in the hippocampus and by vector length in the PPCC, based on prior literature29. Whole-brain exploratory analyses were also conducted. All functional analyses, including those conducted within the predefined ROI, were performed on a voxel-wise basis. One-sample t-tests were used to assess within-group effects, and two-sample unpaired t-tests were used to assess between-group differences. Statistical maps were thresholded at a voxel-wise level of p < 0.001 (uncorrected), with cluster-level correction for multiple comparisons using FWE at p < 0.05.

Diffusion tensor imaging data were processed with the FMRIB Software Library Diffusion Toolbox79. Geometric distortions were corrected using reversed-phase‐encoding images, followed by eddy-current and head motion correction with gradient reorientation. Nonbrain tissue was removed, and the diffusion tensor model was fitted within a subject-specific white matter mask to generate fractional anisotropy and mean diffusivity maps. A lesion mask from 3D T2-FLAIR was co-registered to diffusion space and subtracted from the white matter mask, which was further refined by removing voxels with fractional anisotropy ≤ 0.2. Mean fractional anisotropy and mean diffusivity were then extracted from the resulting normal-appearing white matter mask.

Statistical analyses

Normality of continuous behavioral, clinical, and demographic variables was assessed using the Shapiro–Wilk test, except for vector angle cosine and vector length metrics, which were analyzed using parametric two-way ANOVA. As detailed in Supplementary Table S1, the variables age, total HADS score, MacArthur–Employment, total number of relapses, EDSS score, and most importantly, number of correct answers in episodic social memory recall questionnaire did not follow a normal distribution. Therefore, nonparametric tests were consistently used, with the exception of cosine and length of social vectors.

Between-group comparisons for continuous variables were conducted using the Mann–Whitney U test, and categorical variables were analyzed using Fisher’s exact test. Associations between continuous variables were evaluated with Spearman’s rank correlation coefficient. For social navigation metrics (cosine of vector angle and vector length), we conducted two-way ANOVAs with group (MS vs. healthy controls) as a between-subject factor and character identity (C1–C5) as a within-subject factor, followed by Tukey’s post-hoc tests.

To control for potential confounders - specifically, age, sex, education level, and total HADS score - we used linear regression models to predict episodic memory recall scores. The residuals from these models, representing variance unexplained by those covariates, were then compared between groups using the Mann–Whitney U test and used in correlation analyses.

Descriptive statistics for each variable were selected to reflect the nature of the distribution and are specified in the figure legends. Statistical significance was defined as p < 0.05 (two-tailed).

All Mann–Whitney U tests were performed in IBM SPSS Statistics version 29.0, and other statistical tests and data visualizations were conducted in Prism version 10 (GraphPad Software).