Abstract
Body size estimation—the ability to judge the size and shape of one’s own body—is a key perceptual component of body image. However, its neural basis, and the basis for inter-individual differences in accuracy, remain poorly understood, partly due to limitations in existing assessment tools. We used an adapted, fMRI-compatible version of Somatomap 3D, an interactive task in which participants manipulate a rotatable 3D avatar by adjusting the size and shape of 26 individual body parts to match their perceived body. Twenty-eight healthy male and female adults completed the task during fMRI. Brain activity in a priori regions of interest from previous studies of body processing was modeled using a general linear model incorporating event-specific parameters and parametric modulators related to task performance. Inter-individual differences in body size estimation accuracy were calculated using multidimensional scaling of body part estimation errors, and scores were correlated with BOLD signal eigenvariates from regions of interest. Task engagement was associated with significant activation in hypothesized body-selective and multisensory regions, including bilateral extrastriate body area, right fusiform body area, right superior parietal lobule, and bilateral premotor cortex. Multidimensional scaling identified a primary subdimension reflecting distortions in body part girths, which was significantly associated with neural responses in bilateral superior parietal lobule. No other brain regions showed significant associations with inter-individual differences in estimation accuracy. These results suggest that body size estimation engages a distributed network of visual, motor, and parietal regions. Among these, only the superior parietal lobule showed a significant association with inter-individual variation in body size estimation accuracy for body part girths, supporting its role as a candidate neural substrate for altered body representation in psychiatric conditions such as eating disorders and body dysmorphic disorder.
Introduction
Perception of one’s body is a multidimensional construct shaped by visual, somatosensory, proprioceptive, and vestibular systems, integrated across distributed brain networks1. A key perceptual component of body image is body size estimation (BSE)—the ability to visually judge the size and shape of one’s own body2,3,4. Disruption in BSE is common in psychiatric conditions such as eating disorders (e.g., anorexia nervosa)5 and body dysmorphic disorder6. Yet, despite its clinical relevance, current tools to assess BSE often lack ecological validity and spatial specificity, and the neural mechanisms supporting BSE remain incompletely understood.
BSE tasks typically use metric and depictive techniques7. Metric methods involve direct size estimations using tools such as tape measures or calipers. Depictive methods—more commonly used—require participants to compare their perceived body size to visual representations, such as body drawings8 or size-modified silhouettes9. While informative, these methods typically use static, two-dimensional stimuli that lack the depth, orientation, and perspective of naturalistic visual body perception. Other paradigms using digitally distorted photographs10,11 or virtual avatar comparisons12,13 offer improved visual realism and body shape control, but often remain static and lack participant interaction. Interactive 3D approaches, such as manipulable avatars, allow participants to actively engage with a dynamic body model, better approximating real-world self-perception like viewing oneself rotating in front of a mirror. If the goal is to understand how visual processing of the body may be altered in conditions involving body image disturbance, it is essential to use stimuli that engage the visual system in a more dynamic manner.
To address these limitations, Somatomap 3D was developed. It is a digital tool that allows participants to interactively rotate and manipulate a 3D avatar by adjusting the size and shape of 26 body parts to match their perceived current body14. Unlike prior avatar-based paradigms that algorithmically applied fat distributions that scale larger or smaller across the whole body according to population averages7,15, Somatomap 3D enables localized estimation and adjustment of body parts. This feature is important given BSE distortions in conditions like anorexia nervosa often affect specific areas such as the abdomen, hips, or thighs5. Prior studies using Somatomap have identified body-part-specific over- and underestimations in nonclinical samples14 and in clinical samples of individuals with body dysmorphic disorder and anorexia nervosa5,6, and shows high test-retest reliability16. However, no study to date has combined body-part-specific 3D avatar manipulation with neuroimaging to investigate the neural basis of BSE. Identifying these mechanisms is essential for refining models of body perception and informing interventions for body image disturbances.
Prior research has implicated several brain regions in visual body processing and possibly BSE. These include the extrastriate and fusiform body areas (EBA and FBA), which show selective responses to human bodies and are implicated in body part recognition, self-referential processing, and the perception of body size and shape17,18,19,20. Also involved in body processing are the temporo-parietal junction (TPJ), a hub for multisensory integration, spatial perspective-taking, and body ownership10,21; the superior parietal lobule (SPL), associated with visuospatial attention and metric body representation10,22; the premotor cortex (PMC), given its role in motor imagery and mental body transformations12,23,24; and the primary visual cortex (V1), given its role in early visual processing and size perception25. Together, these regions formed the basis of our a priori regions of interest (ROI), selected based on theoretical and empirical links to perceptual and visuospatial aspects of the Somatomap 3D task.
The goal of this study was to characterize neural activation patterns associated with engagement in BSE, and those associated with its accuracy, using fMRI. We hypothesized that the EBA, FBA, TPJ, SPL, PMC, and V1 would be engaged during avatar manipulation, and that neural activity would scale with trial-by-trial variation in body part estimation accuracy. To address these complementary components of the primary aim, we examined activation during avatar engagement and performed parametric modulation analyses using BSE percent error, the trial-wise percent discrepancy between estimated and actual body part size, divided by the actual body part size. We also applied multidimensional scaling (MDS) to participants’ estimation error profiles across the 26 body parts. This unsupervised machine learning approach was motivated by the goal of characterizing inter-individual differences in how people perceive and represent their own bodies. MDS allowed us to reduce data complexity by summarizing participants’ perceptual profiles into a smaller number of latent dimensions that reflect systematic patterns in estimation errors across the body. These data-driven dimensions provide a compact, multivariate representation of BSE performance that captures how each individual’s pattern of errors is positioned within the structure of estimation patterns identified across participants. Using these MDS dimensions, as a secondary aim, we then tested how inter-individual differences along these dimensions are associated with BSE-related neural activation in the ROIs. We hypothesized that differences in participants’ overall patterns of body part estimation errors would be associated with activation in the same ROIs during body estimation. Finally, we conducted exploratory whole-brain event-related analyses and parametric modulation to explore broader neural responses related to task features. Together, these approaches aimed to map the functional neural activation patterns underlying BSE and those associated with BSE accuracy.
Methods
Participants
This study was conducted in accordance with the ethical principles of the Declaration of Helsinki and was approved by the Centre for Addiction and Mental Health Research Ethics Board (075–2021, 2022). Thirty participants (12 males, mean age = 24.43 ± 5.15, age range = 18–40, mean years of education = 15.83 ± 3.36) from the Greater Toronto Area were recruited as part of a larger study of visual processing. Participants met with a clinician who performed a structured diagnostic interview and were excluded if they met criteria for any current DSM-5 disorders, neurological disorder, or any major medical disorder that may affect cerebral metabolism (e.g., diabetes, thyroid disorders), were pregnant, could not safely be scanned (e.g., ferromagnetic implants or other devices), or had visual acuity (corrected or uncorrected) worse than 20/35 in each eye as verified with a Snellen close vision eye chart. All provided informed consent prior to enrolment. Two participants were excluded due to technical errors during data acquisition. The final analyzed sample comprised 28 participants (12 males and 16 females, mean age = 24.25 ± 5.28, age range = 18–40, mean years of education = 15.61 ± 3.36).
Procedures
Somatomap 3D task
Participants used Somatomap 3D to engage with a three-dimensional avatar on a computer screen, as previously described5,6,14. Participants used sliders to adjust 26 individual body parts on a rotatable 3D avatar to reflect their perceived current body shape (see Fig. 1). They were instructed to “configure the body (from the neck down) to best match your current body shape.” Adjustments were made one body part at a time, and participants were allowed to revisit and modify any previously adjusted parts until satisfied with the overall configuration. The body parts included: neck length and girth, shoulder width, bust girth, chest girth, abdomen protrusion, upper arm length and girth, elbow girth, lower arm length and girth, wrist girth, hand size and length, torso length, waist size, hip size and width, thigh length and girth, knee girth, lower leg length and girth, ankle girth, and feet width and length. Participants were given 10 min to complete the task and were given a warning when they had 3 min remaining.
Participants first completed the task on a desktop computer to get accustomed to using a trackball, and then performed the task again in the scanner. All data presented here corresponds to the scanner session, completed using an MR-compatible trackball (Current Designs, Inc., Philadelphia, United States). For this study, the Somatomap 3D task was packaged in Unity (v2021.1.5f1; Unity Technologies, San Franscisco, CA, United States) as a standalone Windows Desktop application. The WebLink software (v2.2.161; SR Research Ltd., Ottowa, ON, Canada) was utilized to capture screen recordings, eye movements, and mouse and keyboard events, including the fMRI triggers used to synchronize events with the scan (see Supplementary Materials S1 for more details).
Body measurements
Research staff obtained physical measurements of the corresponding 26 body parts after the Somatomap 3D fMRI task procedures. Physical measurements were taken using a tape measure and were completed twice for the left and right side (where applicable) and then averaged. See ref.26 for a detailed description of how each body part was measured. Height and weight measurements were recorded using a stadiometer and a calibrated scale, respectively.
Example of the Somatomap 3D interface. Participants selected either a masculine-appearing or feminine-appearing avatar and could adjust 26 different body parts using the slider located on the right. Body parts were navigated using the left and right buttons on the bottom panel, which corresponded to the previous and next body part, respectively. The currently selected body part was displayed centrally. Avatars could be freely rotated along the x- and y-axes to allow inspection and modification from multiple viewing angles.
Data acquisition
The MRI data were acquired using a Siemens MAGNETOM Prisma 3 T MRI scanner (Siemens Healthineers, Erlangen, Germany) at the Toronto Neuroimaging Facility located at the University of Toronto. Imaging was conducted with a 32-channel head coil. Structural images were acquired using a T1-weighted MPRAGE sequence (TR/TE: 2300/2.27 ms; flip angle: 8°; 256 × 256 matrix; voxel size: 1 mm3; 192 slices). Functional images were acquired using a multi-band echo planar imaging (EPI) sequence (TR/TE: 1000/30 ms; multi-band acceleration factor: 5; flip angle: 60°; 104 × 104 matrix; voxel size: 2 mm³; 62 slices). Additionally, field maps were acquired in opposite phase encoding directions as echo planar spin-echo (epse) sequences (TR/TE: 6629/60 ms; flip angle: 90°; 104 × 104 matrix; voxel size: 2 mm3; 65 slices) to estimate the displacement map for susceptibility distortion correction.
Preprocessing
Images were processed using fMRIPrep v22.0.227. For more detailed information about the fMRI preprocessing, please refer to the Supplementary Materials S2. Briefly, spatial normalization of the T1-weighted image to standard MNI space was performed through nonlinear registration. The processing of the BOLD timeseries consisted of head-motion estimation, slice time correction, and susceptibility distortion correction utilizing two spin echo field maps of opposite phase encoding directions. The processed BOLD timeseries were then resampled in their native space in a single interpolation step, and finally, resampled into standard MNI space, generating the spatially normalized, preprocessed BOLD runs. Spatial smoothing was performed with a Gaussian kernel of 6 mm FWHM prior to automated removal of motion artifacts with independent component analysis (ICA-AROMA)28. Volumes with excessive motion, as determined by DVARS from the FSL motion outliers’ tool (with default settings), were included as confound regressors in the first-level analyses described below.
ROI and mask creation
ROIs were defined using term-based meta-analytic activation maps from Neurosynth (www.neurosynth.org), further constrained using probabilistic anatomical masks from the Harvard-Oxford and Jülich atlases (thresholded at 50%) to improve anatomical specificity. This process yielded bilateral ROIs for EBA, FBA, TPJ, PMC, SPL, and V1. Final ROI hemisphere-specific voxel counts, MNI-space centres of mass, Neurosynth search terms, and thresholded atlas regions are provided in Table 1. All 12 ROIs were then merged into a single mask for subsequent analyses (see Fig. 3).
Data analysis
BSE measurements
Body measurements from Somatomap 3D were converted into centimetres from arbitrary units using a piecewise linear interpolation approach based on pre-measured physical dimensions of the reference 3D avatar, as previously described14. Specifically, each body-part slider was mapped to centimetre values using independent linear scaling functions defined between the minimum, midpoint, and maximum slider positions. For each body part, a BSE percent error score was calculated by subtracting the actual physical measurement from one’s estimated body part size obtained from Somatomap and dividing by the actual measurement. This method allows for direct comparisons across individuals and body parts of varying sizes by expressing estimation error relative to the actual measurement. This is desirable given some body parts are larger, and thus, are prone to larger deviations in raw centimetre error according to general psychophysical principles like Weber’s Law29.
fMRI modelling and within-mask analyses
Using mouse click timestamps, BSE size adjustment events were defined in the time-series as the periods when the participant was adjusting the avatar body size. Rotation events were defined in the time-series as periods when the participant was rotating the position of the avatar. The contrast included intervening periods between these rotations and size adjustments, when participants used the trackball to navigate between body parts (see Supplementary Materials S1 for additional details). These intervals control for low-level motor activity and visuospatial processing demands associated with body part selection, isolating perceptual and decisional components of the BSE task.
Preprocessed fMRI data were analyzed using FSL FEAT (v6.0.4). At the first level, a single general linear model (GLM) was constructed that included all task events and several parametric regressors. Events included (i) BSE size adjustment and (ii) rotation events, both modeled with durations equal to the event-specific response time (RT), which were combined at the second level to construct the (iii) ‘avatar engagement’ events. As part of our primary aim, we focused on the avatar engagement periods, as the self-paced, continuous nature of Somatomap 3D often results in BSE and rotation processes unfolding in close succession, making a combination of the two an integrated measure of task engagement.
Parametric regressors included (i) BSE size adjustment events modulated by the demeaned BSE percent error for the event (ii) BSE size adjustment events modulated by the demeaned RT, and (iii) rotation events modulated by the demeaned RT. All parametric regressors were modeled as event-related and with durations matched to trial-specific RT. The BSE percent error regressor was included as the primary hypothesis-driven regressor, whereas RT-based parametric regressors were included to account for variability in trial duration. Because these RT- and rotation-based modulators served only as nuisance regressors to account for task-structure variability, rather than testing specific hypotheses, their associated results are reported in the Supplementary Materials for completeness. Together, the avatar-engagement contrast and the BSE percent error parametric modulator addressed the two components of our primary aim: characterizing neural activation during BSE engagement and testing whether activation scaled with estimation accuracy.
At the group level, analyses were conducted using FSL’s FLAME 1 (FMRIB’s Local Analysis of Mixed Effects, stage 1), with sex included as a covariate to account for differences in the avatar model across sexes. (All participants were presented with the avatar model that aligned with sex assigned at birth.) Within-mask analyses constituted our primary analytic approach. Contrast estimates from the first-level GLM were entered into the group-level analysis, and statistical inference was performed using a cluster-based thresholding approach, with clusters defined at Z > 3.1 and a family-wise error (FWE)-corrected cluster significance threshold of p < .05 applied across the entire mask to control for multiple comparisons.
Multidimensional scaling (MDS) of BSE accuracy calculations
To capture inter-individual differences in BSE across 26 body parts in a parsimonious manner, we applied MDS, an unsupervised, data-driven machine learning technique that identifies latent dimensions underlying latent patterns of dissimilarity—in this case, differences in BSE percent error—by positioning each participant in a low-dimensional space based on their distance from others. We used the smacof package in R30,31. The smacof algorithm was chosen for its flexible implementation and reliable convergence properties, offering improved stability and fit compared to classical MDS algorithms32. Dissimilarity was quantified as pairwise Euclidean distances across body part errors, and the MDS algorithm positioned participants to best preserve these distances in the output space. This yielded latent dimensions that captured dominant patterns of variation in BSE distortion. Because participants interacted with gender-specific base avatars (i.e., male- or female- presenting), and prior work has indicated that physical characteristics like height and weight can explain unique variance in BSE beyond body mass index alone14, participants’ height, weight, body mass index, and gender were regressed out of percent error scores to control confounding effects prior to MDS. MDS was then conducted on the residuals, producing participant coordinates along each dimension.
Associations between MDS and BSE-related brain activation
As part of our secondary aim to test whether inter-individual differences in BSE accuracy were associated with BSE-sensitive neural responses, we derived subject-level, ROI-specific estimates of task-related activation to serve as input for correlational analyses with MDS subdimension scores. We extracted the mean of the first principal eigenvariate of the BOLD time series associated with BSE size-adjustment events (i.e., the body-part estimation periods) from voxels within each ROI, separately. We reasoned that these events, excluding rotation events, more directly index the perceptual estimation process captured by the MDS dimensions. Eigenvariates captures the dominant pattern of task-related signal change within each ROI, providing a summary index of activation intensity across voxels during body estimation. We selected the eigenvariate approach because alternatives such as percentage signal change are based on homogenous averaging across voxels33, which could “dilute” the signal of interest when an ROI contains both activated and deactivated voxels. Pearson correlations were performed between MDS subdimension scores and eigenvalues, with the false discovery rate (FDR) adjusted using the Benjamini-Hochberg procedure for six comparisons (FBA, EBA, SPL, PMC, TPJ, and V1).
Exploratory whole-brain analyses
Complementing the within-mask analyses, we conducted an exploratory whole-brain analysis using the same regressors described above. Group-level statistical maps were generated with the same cluster-based correction (Z > 3.1; FWE p < .05), applied across the entire brain. These results were interpreted descriptively to identify additional task-related regions outside the a priori ROIs.
Results
Behavioural results
Across all participants’ BSE trials (n = 2,852 total), the mean trial duration was 3.28 s (SD = 2.43), with a median of 2.68 s and an interquartile range (IQR) of 2.64 s. Interstimulus interval (ISI) durations were defined as the time elapsed between consecutive events. The mean ISI duration was 4.14 s (SD = 3.24), with a median of 3.82 s and an interquartile range (IQR) of 3.47 s. For further details concerning behavioural performance, see Supplementary Materials S3. As can be visualized in radar plots (see Fig. 2), participants tended to overestimate some body parts (16/26), and underestimated others (10/26).
Radar chart depicting mean body size estimation (BSE) inaccuracy (percent error) across 26 body parts, arranged clockwise in the order of overestimation to underestimation. Positive values indicate mean overestimation of body part size, negative values indicate mean underestimation, and zero (dotted line) denotes perfect accuracy.
Within-mask results
The ROI analysis showed significant activation for avatar engagement in the right FBA, right SPL, bilateral PMC, and bilateral EBA (see Table 2; Fig. 3). No significant activity was detected in the TPJ or V1.
Parametric modulation analyses within the ROI mask showed no significant effects for BSE percent error. Results from other event-related and parametric regressors are reported in the Supplement (see Supplementary Materials S4).
Statistical maps showing significant activation clusters and ROI mask visualizations. Panel (A) Colour-coded ROI mask: red = extrastriate body area, orange = fusiform body area, blue = primary visual cortex, gold = temporoparietal junction, pink = superior parietal lobule, brown = premotor cortex. These regions were merged into a single mask for the ROI analysis. Panel (B) Regions showing increased BOLD response during avatar engagement, thresholded at z > 3.1. A dark red-to-yellow colour scale indicates increasing z-score magnitude. Maps are displayed on a glass brain.
MDS results
A three-dimensional MDS solution was selected to balance model fit and interpretability. Stress values, which index how well the low-dimensional configuration preserves the original inter-region dissimilarities (with lower values indicating better fit), decreased from 0.40 in one dimension to 0.25 in two dimensions and 0.17 in three, with further reductions tapering off thereafter (e.g., stress = 0.13 in 4D, 0.10 in 5D), supporting a three-dimension solution as a reasonable approximation (see Supplementary Figure S6). We interpreted the three dimensions by computing Pearson correlations between each participant’s subdimension scores and their estimation error for each body part. Because MDS subdimensions do not have intrinsic directionality, meaning that the axis can be flipped in sign without altering the MDS solution, this step was used to determine whether any dimension aligned systematically with a specific error profile. Subdimension 1 was positively correlated with errors for girth-related body parts across the avatar (e.g., neck girth, hip size, arm girth etc.; r range 0.30 to 0.86) (see Supplementary Figure S7). In contrast, subdimensions 2 and 3 showed more heterogeneous patterns, with moderate correlations involving both length and height-based body parts. Given its relevance for potential future clinical applications in eating disorders and body dysmorphic disorder, often characterized by concerns about body parts girths being too large (i.e. “fat”), we used subdimension 1 for the analysis with BSE-related brain activation.
MDS associations with BSE-related brain activation results
A significant negative correlation was detected between SPL activity and MDS subdimension 1 values (r = −.50, p =.006, FDRadjusted q = 0.038) (see Fig. 4). No significant correlations were detected between subdimension 1 values and EBA (r = −.29, p =.130, FDRadjusted q = 0.260), FBA (r = −.24, p =.212, FDRadjusted q = 0.318), PMC (r = −.19, p =.342, FDRadjusted q = 0.410), TPJ (r = −.03, p =.880, FDRadjusted q = 0.599), or V1 (r = −.30, p =.126, FDRadjusted q = 0.260) activity.
Scatterplot depicting a negative relationship between SPL mean BOLD signal (first eigenvariate) during BSE and MDS subdimension 1 values. On the right, three views of the SPL ROI from which the BOLD signal (averaged right and left) was extracted. MDS = multidimensional scaling, SPL = superior parietal lobule, BOLD = blood-oxygen-level dependent, L = left, R = right.
Exploratory whole-brain analyses
The whole-brain analysis for the avatar engagement contrast revealed significant activation across motor, visual, somatosensory, fronto-insular and cerebellar areas, spanning 9 clusters (see Table 3; Fig. 5). Clusters were interpreted based on the Harvard-Oxford and Jülich Histological Atlas labels. The largest clustered in the right parietal operculum and extended dorsally into the superior and inferior parietal lobules. The second largest peak was observed in the right IFG and encompassed the ventral PMC, insula, and frontal operculum. A smaller cluster was identified in the same cortical territory in the left hemisphere. Bilateral occipitotemporal clusters were detected: one spanning right EBA, FBA, and another confined to left EBA. Activation was also observed in the medial superior frontal gyrus, including peaks in the supplementary motor area (SMA) and dorsal PMC. Additional clusters were detected in left IPL and left cerebellum, with peaks in lobule VI and lobule VIIb. No significant activity was associated with BSE percent error. Results from other event-related and parametric regressors are reported in the Supplement (see Supplementary Materials S4).
Statistical maps showing significant activation clusters from exploratory whole-brain analysis. Regions showing increased BOLD response during avatar engagement, thresholded at z > 3.1. A dark red-to-yellow colour scale indicates increasing z-score magnitude. Maps are displayed on a glass brain. The ROI-mask (light blue) is overlaid to show regions in whole-brain analysis that were outside the a-priori mask.
Discussion
This study aimed to characterize the neural activation patterns associated with BSE, and with BSE accuracy, in healthy individuals, using an interactive 3D avatar fMRI task. Consistent with our hypotheses, avatar engagement was associated with bilateral EBA, bilateral PMC, right FBA, and right SPL activation, but, contrary to our hypotheses, not TPJ or V1 activation. While no ROI, or region from the whole-brain analysis, showed significant modulation by trial-wise BSE accuracy, BOLD signal in the SPL was significantly associated with inter-individual differences in BSE error patterns, as indexed by MDS. Exploratory whole-brain analyses revealed broader task-related engagement across visuospatial, sensorimotor, and salience regions, and recapitulated the ROI results, demonstrating those regions’ robustness to whole-brain FWE correction. This study provides evidence for a functional map of brain regions engaged during whole, and body-part specific, size estimation, providing a reference point for future clinical research.
Task-evoked ROI engagement
We hypothesized that six ROIs, previously implicated in visual and multisensory body representation, EBA, FBA, SPL, PMC, TPJ, and V1, would be activated during BSE-related engagement with the avatar. Four of these—bilateral EBA, right FBA, right SPL, and bilateral PMC—were significantly engaged. This activation pattern aligns with the task’s dual demands: participants needed to make localized adjustments to specific body parts while simultaneously maintaining a coherent global body configuration to accurately contextualize proportions of sizes and shapes. The EBA and FBA support part-based and whole-body form perception, respectively34,35,36. The PMC, in the context of this task, may facilitate internal motor simulation during avatar manipulation24, while the SPL may support visuospatial mapping of body features37.
By contrast, the TPJ and V1 were not significantly activated during the task. The TPJ, frequently implicated in perspective-taking, spatial self-location, and multisensory integration21,38, may not have been substantially engaged due to the third-person (allocentric) vantage point from which participants viewed the avatar. Rather than adopting the avatar’s perspective (egocentric)—critical for evoking TPJ activity21,39,40,41, participants adjusted its form from an external viewpoint. The primary visual cortex processes the retinotopic and perceived size of objects, spatial frequency, and other low-level features such as contrast and orientation42,43,44. While V1 likely contributes to initial size encoding, its activity may not have been prominent in the Somatomap 3D task, which emphasizes higher-order visual comparisons and spatial reasoning. These cognitive demands may be more effectively supported by downstream regions involved in body perception (EBA, FBA), visuospatial integration (SPL), and motor processes (PMC).
Inter-individual and trial-wise associations between BSE accuracy and neural activity
A key finding was that bilateral SPL activity during BSE was significantly associated with inter-individual differences in BSE accuracy, as indexed by MDS. This result stands in contrast to the nonsignificant findings from parametric modulation analyses, which assess trial-by-trial fluctuations in BOLD signal relative to behavioural metrics. This suggests a dissociation between the neural correlates of stable, trait-like differences in body representation and the transient processing of accuracy at the level of individual body parts.
The SPL–MDS association reflects a between-subjects relationship: the degree to which an individual consistently engages the SPL during BSE relates to the structure of their estimation error profile across body parts, rather than to estimation accuracy per se. This could be due to the SPL’s involvement in global integrative functions— such as constructing and updating a spatial model of the body—rather than encoding transient estimation errors. Consistent with this role, the SPL integrates proprioceptive, tactile, and visual inputs45 to maintain a dynamic, body-centered reference frame46,47 and is implicated in spatial cognition and multisensory body schema37. Within the context of Somatomap, SPL engagement may reflect efforts to align internally stored body representations with visual feedback from the manipulated avatar48.
The absence of significant modulation by trial-wise BSE accuracy may be due to the heterogeneity of neural processing across different body parts. Prior studies suggest that distinct cortical territories represent specific body parts in a topographically organized manner (e.g., the somatosensory homunculi), and even body-selective visual regions may exhibit differential sensitivity depending on body part identity49,50. Thus, treating estimation accuracy across all body parts as repeated instances of the same phenomenon may have obscured region- or body-part-specific relationships. An alternative, future design could involve presenting multiple trials for each body part to allow body-part-specific modelling of accuracy-related signal, enabling more precise tests of regionally distinct neural contributions.
Notably, none of the other regions found to demonstrate significant task engagement—EBA, FBA, or PMC—showed significant correlations with individual differences in perceptual accuracy. This suggests that while these regions contribute to shared perceptual and motor imagery processes required during avatar engagement, they may not process stable, individualized representations of body size in the same manner as SPL.
Exploratory whole-brain analyses
Although the present study included hypothesis-driven ROI analyses motivated by prior work, exploratory whole-brain analyses suggest engagement of distributed networks and regions during the task. These included components of a visuospatial network (e.g., SPL)51, components of a sensorimotor network (e.g., PMC, cerebellum)24,52, body-selective visual regions (e.g., EBA, FBA)35,36, and components of the salience network (e.g., insula, frontal operculum)53. Several of these regions—including the SPL, PMC, EBA, and FBA—overlapped with predefined ROIs, lending convergent support for their involvement in avatar engagement during the task.
Widespread activation was observed across parietal and frontal regions associated with both visuospatial and sensorimotor networks. Visuospatial regions included IPL, SPL, and postcentral gyrus—components of the multisensory body schema network, theorized to be involved in integrating proprioceptive, tactile, and visual input37. Sensorimotor-related activation included the dorsal PMC and SMA, regions associated with internally guided motor simulation and visuomotor transformation23,24,54. Posterior cerebellar lobules VI and VIIb were also engaged, consistent with their roles in sensorimotor coordination and cross-network integration involving attention and visual processing55,56.
Body-selective and higher-order visual areas were engaged, including bilateral EBA, and right-lateralized FBA, largely overlapping with ROI analyses.
Lastly, activation was observed in regions commonly associated with salience-related processing, including the insula and frontal operculum. The insula, particularly its anterior portion, supports interoceptive awareness and affective salience57,58, while the adjacent frontal operculum has been identified in sustained vigilance and monitoring59. These areas may support self-monitoring processes engaged during avatar adjustment, such as evaluating the match between visual input and internal body representation.
Taken together, from these exploratory results, we might speculate that avatar engagement recruits a distributed network that integrates perceptual, spatial, motor, and self-monitoring processes. While interpretations should remain cautious due to the exploratory nature of these analyses, the overlap with key ROI findings—particularly in the SPL, PMC, and EBA—provides convergent support for the core regions implicated in body estimation.
Somatomap task performance
Participants showed systematic variation in both the magnitude and direction of estimation error across body parts. Percent error tended to be positive, indicating a general tendency to overestimate body part sizes, with the most pronounced overestimation observed for upper arm girth. However, consistent underestimation was observed for regions near the midsection, including torso length, abdomen protrusion, waist size, and hip size. This diverges from prior findings in clinical populations. For example, individuals with anorexia nervosa have demonstrated marked overestimation of midsection size using an earlier version of the Somatomap tool14. In contrast, both Ralph-Nearman and colleagues14 and Karsan and colleagues6 found that healthy controls tended to underestimate waist and hip size while overestimating torso length. Thus, underestimation of waist and hip size appears to be a consistent finding across non-clinical samples.
Several methodological factors may account for these differences in body part estimation accuracy across studies that used Somatomap. First, the present study examined percent error, rather than centimetre error, which may better reflect perceptual accuracy given that just-noticeable differences for size scale proportionally with the magnitude of the stimulus29. Second, our study used an updated avatar that is smoother, has higher resolution, is more realistic, and has 26 adjustable body parts compared with 23 in earlier versions—including separately defined hip size and hip width. Additionally, the previous Somatomap 3D but not the current version matched skin and hair colour. Presenting a more minimal and abstract representation of the body, as in the previous Somatomap avatar, may alter reference cues used during estimation.
On average, participants spent approximately 3.3 s adjusting each body part, with interstimulus intervals averaging 4.1 s (see Supplementary Figure S5 for full distributions). These durations suggest that participants had sufficient time for deliberate perceptual judgments. These patterns suggest that healthy individuals make body-part-specific adjustments that likely reflect both perceptual tendencies—for example, greater salience or uncertainty around the torso—and structural features of the task, such as its top-down progression from head to feet.
Limitations, future directions, and clinical implications
Several limitations in this study should be noted. The Somatomap 3D fMRI task was adapted from the previously-developed non-fMRI behavioural task, whose utility in quantifying BSE accuracy had been demonstrated in non-clinical samples (fashion models and non-fashion model controls14, as proof-of-concept) and had been shown to have sensitivity to detect patterns of aberrant BSE accuracy in individuals with anorexia nervosa5 and body dysmorphic disorder6 compared with healthy controls. In retaining most of the structure of the original task, it remains a continuous task (rather than having experiment-set discrete trials) and is self-paced. Yet, this partially unconstrained structure reduces experimental control and limits the ability to isolate discrete cognitive processes, e.g., perception vs. decision-making, which are likely intermingled during each body part trial. However, we partially mitigated concerns about variable trial and rotation durations and by using trial-wise response time and rotation duration as regressors. This allowed us to account for time-on-task effects and better isolate signal associated with estimation accuracy.
Similarly, participants completed a brief practice version of the Somatomap task prior to scanning to ensure they could navigate the interface and perform the size-adjustment procedure. Although such familiarization is necessary for complex interactive fMRI tasks of this kind, it raises the possibility that some activation may reflect re-engagement of recently practiced perceptual or visuospatial strategies in addition to task-evoked processes. Because practice-related retrieval effects are difficult to fully eliminate in tasks requiring some motor–visuospatial proficiency, this should be considered an inherent limitation of the paradigm. Future work could vary the extent or timing of pre-scan familiarization to assess its influence on neural activation patterns. Additional considerations include the relatively modest sample size (N = 28), which may have limited statistical power. While our analytic approach addressed sources of variability, the sample size may have constrained sensitivity to detect smaller magnitude effects.
Future studies may benefit from other data-driven approaches to characterize the full neural architecture underlying BSE. Although the present study used univariate analyses to establish the neural correlates of BSE, multivariate approaches such as representational similarity analysis60 or machine learning could provide additional insight into whether distinct cortical patterns encode specific body parts or estimation tendencies (e.g., over- vs. underestimation). Implementing such methods in the current dataset poses several analytic challenges, notably the single-trial structure for each body part, heterogeneity in event durations, and limited sample size. Thus, developing multivariate models for Somatomap data is a promising future direction, with a different task design, which could potentially leverage body-segment grouping or representational geometry, to complement the present, proof-of-principle findings.
In terms of potential clinical applications, results from this study suggest that regions including the SPL, FBA, EBA, and PMC could be of interest in investigating neurobiological substrates of body image disturbances in those with eating disorders and body dysmorphic disorder. Among these, the SPL stands out for potential clinical relevance as its activity was associated with individual differences in BSE error profiles. This aligns with prior work implicating the posterior parietal cortex in body image distortion in anorexia nervosa22,61. Future studies could build on these findings by having individuals with anorexia nervosa complete this version of the Somatomap 3D task during fMRI. Alternatively, variations of the task could be developed to help discern specific cognitive processes. For instance, one variant could present brief, experimenter-controlled trials with fixed durations to better isolate perceptual and decision-related components. Outcomes such as atypical degree, spatial location, or timing of activation patterns, differences in sensitivity to perceptual uncertainty, or differences related to spatial transformation demands could help clarify how body image disturbances emerge and persist. Finally, future studies that characterize the neural correlates of BSE in large populations could help establish normative models, which could aim to quantify individual deviations from typical brain function and that could be stratified by, e.g. demographic or body morphometric factors such as height and weight62.
Conclusions
This study used a novel 3D avatar task to probe functional activation patterns associated with BSE, identifying a distributed neural system including visual, parietal, premotor, and self-monitoring regions. The SPL emerged as a key region associated with inter-individual differences in estimation accuracy, which potentially could contribute to perceptual body image distortions, suggesting significance for future clinical investigations. These findings advance efforts toward understanding the brain basis of BSE and offer specific targets for investigating altered body representation in clinical populations characterized by body image disturbances.
Data availability
The data that support the findings of this study are openly available in the NIMH data archive at [https://nda.nih.gov/edit_collection.html?id=3607], reference number [3607].
References
Feusner, J. & Cazzato, V. Body Image. in The Handbook of the Neurobiology of Eating Disorders (eds Wierenga, E. by C. E. & Steinglass, J. E.) (Oxford University Press, Oxford, New York).
Cash, T. F. & Deagle, I. I. I. The nature and extent of body-image disturbances in anorexia nervosa and bulimia nervosa: A meta-analysis. Int. J. Eat. Disord. 22, 107–126 (1997).
Farrell, C., Lee, M. & Shafran, R. Assessment of body size estimation: a review. Eur. Eat. Disorders Rev. 13, 75–88 (2005).
Gardner, R. M. & Brown, D. L. Body size Estimation in anorexia nervosa: a brief review of findings from 2003 through 2013. Psychiatry Res. 219, 407–410 (2014).
Ralph-Nearman, C. et al. Visual mapping of body image disturbance in anorexia nervosa reveals objective markers of illness severity. Sci. Rep. 11, 12262 (2021).
Karsan, S., Diaz-Fong, J. P., Ly, R., Hellemann, G. & Feusner, J. D. Quantifying body size Estimation accuracy and body dissatisfaction in body dysmorphic disorder using a digital avatar. Compr. Psychiatr. 135, 152529 (2024).
Mölbert, S. C. et al. Depictive and metric body size Estimation in anorexia nervosa and bulimia nervosa: A systematic review and meta-analysis. Clin. Psychol. Rev. 57, 21–31 (2017).
Uher, R. et al. Functional neuroanatomy of body shape perception in healthy and Eating-Disordered women. Biol. Psychiatry. 58, 990–997 (2005).
Hamamoto, Y. et al. Neural mechanisms of perceptual and affective body-image disturbance during own-body and ideal-body Estimation. Behav. Brain. Res. 444, 114349 (2023).
Castellini, G. et al. Looking at my body. Similarities and differences between anorexia nervosa patients and controls in body image visual processing. Eur. Psychiatry. 28, 427–435 (2013).
Miyake, Y. et al. Brain activation during the perception of distorted body images in eating disorders. Psychiatry Research: Neuroimaging. 181, 183–192 (2010).
Gao, X. et al. My body looks like that girl’s: body mass index modulates brain activity during body image Self-Reflection among young women. PLoS ONE. 11, e0164450 (2016).
Owens, T. E., Allen, M. D. & Spangler, D. L. An fMRI study of self-reflection about body image: sex differences. Pers. Indiv. Differ. 48, 849–854 (2010).
Ralph-Nearman, C. et al. A novel mobile tool (Somatomap) to assess body image perception pilot tested with fashion models and nonmodels: Cross-Sectional study. JMIR Mental Health. 6, e14115 (2019).
Cornelissen, K. K., McCarty, K., Cornelissen, P. L. & Tovée, M. J. Body size Estimation in women with anorexia nervosa and healthy controls using 3D avatars. Sci. Rep. 7, 1–15 (2017).
Ralph-Nearman, C. et al. Test-retest reliability of a 2-dimensional and 3-dimensional visual assessment of body image disturbance in anorexia nervosa. MedRxiv 2023.06.14.23291397 https://doi.org/10.1101/2023.06.14.23291397 (2023).
Downing, P. E. & Peelen, M. V. The role of occipitotemporal body-selective regions in person perception. Cogn. Neurosci. 2, 186–203 (2011).
Hummel, D. et al. Neural adaptation to thin and fat bodies in the fusiform body area and middle occipital gyrus: an fMRI adaptation study. Hum. Brain Mapp. 34, 3233–3246 (2013).
Urgesi, C., Candidi, M., Ionta, S. & Aglioti, S. M. Representation of body identity and body actions in extrastriate body area and ventral premotor cortex. Nat. Neurosci. 10, 30–31 (2007).
Vocks, S. et al. Neural correlates of viewing photographs of one’s own body and another woman’s body in anorexia and bulimia nervosa: an fMRI study. J. Psychiatry Neurosci. 35, 163–176 (2010).
Blanke, O. et al. Linking out-of-body experience and self processing to mental own-body imagery at the temporoparietal junction. J. Neurosci. 25, 550–557 (2005).
Gaudio, S. & Quattrocchi, C. C. Neural basis of a multidimensional model of body image distortion in anorexia nervosa. Neurosci. Biobehavioral Reviews. 36, 1839–1847 (2012).
Haggard, P. Human volition: towards a neuroscience of will. Nat. Rev. Neurosci. 9, 934–946 (2008).
Hanakawa, T., Dimyan, M. A. & Hallett, M. Motor planning, imagery, and execution in the distributed motor network: a time-course study with functional MRI. Cereb. Cortex. 18, 2775–2788 (2008).
Schwarzkopf, D. S., Song, C. & Rees, G. The surface area of human V1 predicts the subjective experience of object size. Nat. Neurosci. 14, 28–30 (2011).
Groe, K., Calahong, B., Diaz-Fong, J. P., Karsan, S. & Feusner, J. Twenty-Six Body Measurements for the Somatomap App. protocols.io (2025). https://www.protocols.io/view/twenty-six-body-measurements-for-the-somatomap-app-c8zvzx66
Esteban, O. et al. fMRIPrep: a robust preprocessing pipeline for functional MRI. Nat. Methods. 16, 111–116 (2019).
Pruim, R. H. R. et al. ICA-AROMA: A robust ICA-based strategy for removing motion artifacts from fMRI data. NeuroImage 112, 267–277 (2015).
Fechner, G. Elements of Psychophysics. Vol. I.New York, Holt, Rinehart and Winston, (1966).
Leeuw, J. D. & Mair, P. Multidimensional scaling using majorization: SMACOF in R. J Stat. Soft 31, 1-30 (2009).
Mair, P., Groenen, P. J. F. & de Leeuw, J. More on multidimensional scaling and unfolding in R: Smacof version 2. J. Stat. Softw. 102, 1–47 (2022).
de Leeuw, J. Application of Convex Analysis to Multidimensional Scaling. Recent Developments in Statistics (1977).
Friston, K. J., Rotshtein, P., Geng, J. J. & Sterzer, P. Henson, R. N. A critique of functional localisers. Neuroimage 30, 1077–1087 (2006).
Downing, P. E., Jiang, Y., Shuman, M. & Kanwisher, N. A cortical area selective for visual processing of the human body. Science 293, 2470–2473 (2001).
Taylor, J. C., Wiggett, A. J. & Downing, P. E. Functional MRI analysis of body and body part representations in the extrastriate and fusiform body areas. J. Neurophysiol. 98, 1626–1633 (2007).
Peelen, M. V. & Downing, P. E. The neural basis of visual body perception. Nat. Rev. Neurosci. 8, 636–648 (2007).
Medendorp, W. P. & Heed, T. State Estimation in posterior parietal cortex: distinct Poles of environmental and bodily States. Prog Neurobiol. 183, 101691 (2019).
Tsakiris, M., Hesse, M. D., Boy, C., Haggard, P. & Fink, G. R. Neural signatures of body ownership: a sensory network for bodily self-consciousness. Cereb. Cortex. 17, 2235–2244 (2007).
Martin, A. K., Huang, J., Hunold, A. & Meinzer, M. Dissociable roles within the social brain for Self-Other processing: A HD-tDCS study. Cereb. Cortex. 29, 3642–3654 (2019).
Martin, A. K., Kessler, K., Cooke, S., Huang, J. & Meinzer, M. The right temporoparietal junction is causally associated with embodied Perspective-taking. J. Neurosci. 40, 3089–3095 (2020).
Wang, H., Callaghan, E., Gooding-Williams, G., McAllister, C. & Kessler, K. Rhythm makes the world go round: an MEG-TMS study on the role of right TPJ theta oscillations in embodied perspective taking. Cortex 75, 68–81 (2016).
Murray, S. O., Boyaci, H. & Kersten, D. The representation of perceived angular size in human primary visual cortex. Nat. Neurosci. 9, 429–434 (2006).
Ng, J., Bharath, A. A. & Zhaoping, L. A Survey of Architecture and Function of the Primary Visual Cortex (V1). EURASIP J. Adv. Signal Process. 1–17 (2006). (2007).
Sperandio, I., Chouinard, P. A. & Goodale, M. A. Retinotopic activity in V1 reflects the perceived and not the retinal size of an afterimage. Nat. Neurosci. 15, 540–542 (2012).
Schwoebel, J. & Coslett, H. B. Evidence for multiple, distinct representations of the human body. J. Cogn. Neurosci. 17, 543–553 (2005).
Longo, M. R. & Haggard, P. An implicit body representation underlying human position sense. Proc. Natl. Acad. Sci. U S A. 107, 11727–11732 (2010).
Zacks, J. M. Neuroimaging studies of mental rotation: a meta-analysis and review. J. Cogn. Neurosci. 20, 1–19 (2008).
Blanke, O., Slater, M., Serino, A. & Behavioral Neural, and computational principles of bodily Self-Consciousness. Neuron 88, 145–166 (2015).
Bracci, S., Caramazza, A. & Peelen, M. V. Representational similarity of body parts in human occipitotemporal cortex. J. Neurosci. 35, 12977–12985 (2015).
Orlov, T., Makin, T. R. & Zohary, E. Topographic representation of the human body in the occipitotemporal cortex. Neuron 68, 586–600 (2010).
Whitlock, J. R. Posterior parietal cortex. Curr. Biol. 27, R691–R695 (2017).
Stoodley, C. J., Valera, E. M. & Schmahmann, J. D. Functional topography of the cerebellum for motor and cognitive tasks: an fMRI study. NeuroImage 59, 1560–1570 (2012).
Schimmelpfennig, J., Topczewski, J., Zajkowski, W. & Jankowiak-Siuda, K. The role of the salience network in cognitive and affective deficits. Front Hum. Neurosci 17, 1-9 (2023).
Nachev, P., Kennard, C. & Husain, M. Functional role of the supplementary and pre-supplementary motor areas. Nat. Rev. Neurosci. 9, 856–869 (2008).
Ji, J. L. et al. Mapping the human brain’s cortical-subcortical functional network organization. Neuroimage 185, 35–57 (2019).
Xue, A. et al. The detailed organization of the human cerebellum estimated by intrinsic functional connectivity within the individual. J. Neurophysiol. 125, 358–384 (2021).
Craig, A. D. B. How do you feel–now? The anterior Insula and human awareness. Nat. Rev. Neurosci. 10, 59–70 (2009).
Simmons, W. K. et al. Keeping the body in mind: Insula functional organization and functional connectivity integrate interoceptive, exteroceptive, and emotional awareness. Hum. Brain Mapp. 34, 2944–2958 (2013).
Sadaghiani, S. et al. Intrinsic connectivity Networks, alpha Oscillations, and tonic alertness: A simultaneous Electroencephalography/Functional magnetic resonance imaging study. J. Neurosci. 30, 10243–10250 (2010).
Kriegeskorte, N., Mur, M. & Bandettini, P. A. Representational similarity analysis - connecting the branches of systems neuroscience. Front Syst. Neurosci 2, 1-28 (2008).
Mohr, H. M. et al. Separating two components of body image in anorexia nervosa using fMRI. Psychol. Med. 40, 1519–1529 (2010).
Marquand, A. F., Rezek, I., Buitelaar, J. & Beckmann, C. F. Understanding heterogeneity in clinical cohorts using normative models: beyond Case-Control studies. Biol. Psychiatry. 80, 552–561 (2016).
Acknowledgements
We would like to thank Bea Calahong and Katalin Groe for their help collecting data for this study. This research was enabled in part by support provided by Compute Ontario (computeontario.ca) and the Digital Research Alliance of Canada (alliancecan.ca). The Software used in this research was created by Jamie D. Feusner, Armen C, Arevian, and Nanthia Suthana (UCLA); and Sahib S. Khalsa and Christina Ralph-Nearman (Laureate Institute for Brain Research). © 2023 UCLA. Originally published in JMIR Mental Health (http://mental.jmir.org), 29.10.2019. Ralph-Nearman, C., Arevian, A. C., Puhl, M., Kumar, R., Villaroman, D., Suthana, N., Feusner, J. D., & Khalsa, S. S. (2019). A Novel Mobile Tool (Somatomap) to Assess Body Image Perception Pilot Tested With Fashion Models and Nonmodels: Cross-Sectional Study. JMIR mental health, 6(10), e14115. https://doi.org/10.2196/14115.
Funding
Open access funding provided by Karolinska Institute. Funding for this study is provided by the National Institute of Mental Health (NIMH) grant number: 3R01MH121520-04S1 (Feusner) and the Centre for Addiction and Mental Health Discovery Fund (Feusner).
Author information
Authors and Affiliations
Contributions
**HJP** Formal analysis, Visualization, Writing – original draft, Writing – review and editing. **JPD** Data curation, Formal analysis, Investigation, Visualization, Writing – original draft, Writing – review and editing. **SK** Data curation, Writing – review and editing. **RK** Software, Writing – review and editing. **GH** Methodology, Writing – review and editing. **JDF** Conceptualization, Funding acquisition, Investigation, Methodology, Supervision, Writing – review and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethics approval
This study was approved by the Centre for Addiction and Mental Health Research Ethics Board (075-2021, 2022).
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Peel, H.J., Diaz-Fong, J.P., Karsan, S. et al. Functional brain mapping of body size estimation using a 3D avatar. Sci Rep 16, 4750 (2026). https://doi.org/10.1038/s41598-026-38383-0
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-026-38383-0




