Introduction

Diabetes mellitus is a pervasive and escalating global health challenge, currently affecting approximately 589 million adults aged 20–79 years, corresponding to 11.1% of the global adult population1. This number is projected to exceed 850 million by 2050, underscoring the urgent need for effective prevention and management strategies2. Among its numerous systemic complications, biomechanical alterations in plantar soft tissues are of particular clinical relevance due to their essential roles in weight-bearing, locomotion, and the prevention of foot ulceration3,4,5,6. These alterations are primarily driven by the pathological accumulation of advanced glycation end products (AGEs)7,8, which induce aberrant collagen cross-linking, reduced microvascular perfusion, and extracellular matrix remodeling, including changes in elastin fibers and glycosaminoglycans4,6,9. Collectively, these changes lead to increased tissue stiffness, impaired load distribution, and heightened susceptibility to pressure-related injuries3,10. This biomechanical deterioration is further exacerbated by peripheral neuropathy, which compromises protective sensation and significantly contributes to the development of diabetic foot ulcers, a major source of morbidity and healthcare burden11,12,13.

The human plantar surface comprises multilayered soft tissues, including skin, the heel fat pad, the plantar fascia, and the intrinsic foot muscles (IFMs). Each of these structures plays a distinct role in shock absorption, balance, and propulsion14,15. In individuals with diabetes, however, these tissues frequently undergo region-specific degeneration. Reduced skin elasticity and thinning are commonly observed, particularly in high-pressure regions such as the metatarsal heads and heel6,12,16. The fat pad may become fragmented and fibrotic, reducing its cushioning capacity17, while the plantar fascia often thickens and loses compliance18,19. Furthermore, IFMs, notably the abductor hallucis and flexor digitorum brevis, are prone to atrophy and fatty infiltration, leading to compromised arch support and postural control16,20,21,22,23,24,25. These cumulative structural changes disrupt plantar pressure distribution, contribute to gait abnormalities, and increase the risk of ulceration26.

Accurate and reliable assessment of plantar soft tissue stiffness is essential to improve the understanding and management of diabetes-related complications12,13,19,27,28,29. However, current in vivo techniques are limited by anatomical variability, restricted access to deeper layers, and operator-dependent inconsistencies, which reduce reproducibility and comparability across studies30. These challenges are particularly evident in anatomically complex regions such as the midfoot, where achieving consistent and reproducible in vivo measurements remains difficult31. Phantom models offer a robust solution by eliminating structural variability, enabling controlled replication of pathological changes such as diabetes-related stiffening, and providing standardized testing independent of operator expertise32. This advantage is especially relevant for midfoot assessment, where phantoms allow consistent and objective measurements under controlled conditions32,33,34,35,36,37. Unlike direct in vivo testing, which is confounded by inter-individual variability and clinical heterogeneity, phantom-based approaches ensure reproducibility, controllability, and reliability. They also remove patient-related risks and ethical concerns, thereby strengthening their role in preclinical research, device development, regulatory calibration, and educational applications34,38,39. Multilayer phantom models, in particular, enable reproducible simulation of pathological conditions such as increased stiffness in diabetes, supporting comparative device evaluation under standardized conditions. Although polyurethane-based phantoms are widely employed in imaging calibration and ultrasound validation studies32,40, their application to musculoskeletal and diabetic foot research remains limited. Importantly, polyurethane gels can be tuned to mimic the viscoelastic properties of soft tissues and layered to reflect the anatomical complexity of the plantar region41.

Several devices are available for assessing soft tissue stiffness, each employing distinct operational principles and penetrating to varying tissue depths. Shear Wave Elastography (SWE) is a non-invasive, ultrasound-based imaging modality that quantifies tissue elasticity by measuring shear wave velocity, and has emerged as a reference method in musculoskeletal research due to its ability to provide reproducible, depth-specific stiffness estimates without operator-induced compression42. In contrast, mechanical tools such as the Shore Durometer31, MyotonPRO43, IndentoPRO44, and Tissue Compliance Meter (TCM)45 offer portable and cost-effective alternatives for biomechanical evaluation. While these tools have demonstrated utility in both clinical31,43,44,45,46 and experimental settings35, there is a lack of comparative data under standardized conditions, particularly in diabetic-specific plantar models.

To address these challenges, the primary aim of this study was to construct anatomically accurate, multilayered polyurethane phantom models that replicate healthy and diabetic plantar soft tissues, incorporating region-specific configurations for the heel, midfoot, and forefoot. These phantoms, designed to replicate the layered structure of skin, fat pad, fascia, and intrinsic muscles, served as standardized platforms to evaluate the reliability and construct validity of four mechanical stiffness assessment tools (MyotonPRO, Shore Durometer, IndentoPRO, and TCM), using SWE as the reference standard. We hypothesized that the stiffness measurements obtained from each mechanical device would be significantly correlated with SWE-derived values, indicating their concurrent validity, and that all devices would demonstrate high intra- and inter-rater reliability under blinded, standardized conditions.

Materials and methods

Study design and setting

This experimental validation study was conducted between January and June 2025 at a university in Germany (institution blinded for review). As the study involved only synthetic models, ethical approval was not required.

Phantom model development

Phantom models were constructed to represent three anatomical regions of the plantar foot: calcaneal (heel), midfoot, and forefoot. Each region was modeled in both healthy and diabetic versions, resulting in six distinct phantom configurations. The layer structures were designed based on detailed anatomical literature47,48,49. Specifically, the heel region included skin, fat pad, and plantar fascia; the midfoot region included skin, fat pad, fascia, and the flexor digitorum brevis muscle; and the forefoot region included skin, fat pad, fascia, and the abductor hallucis muscle.

Target stiffness values for each anatomical layer were determined based on a comprehensive review of published in vivo elastography and cadaveric indentation studies, ensuring physiological relevance of the mechanical properties simulated (see Supplemental File 1). This allowed each phantom model to replicate region-specific stiffness characteristics observed in both healthy and diabetic populations. Technogel® was selected as the primary construction material due to its favorable viscoelastic behavior, biocompatibility, and established use in musculoskeletal phantom development. Compared to other elastomers such as silicone or agar-based gels, Technogel offers superior layering stability, consistent surface compliance, and customizable stiffness ranges within the Shore-OOO scale—traits critical for simulating the nuanced mechanical alterations associated with diabetic foot pathology under standardized experimental conditions.

Material fabrication and stiffness calibration

Each anatomical layer was fabricated using medical-grade polyurethane gel (Technogel GmbH, Germany), mixed with micro-silica particles to enhance both mechanical stability and ultrasound signal transmission. Layers were cast in standardized molds (15 × 10 cm) and sealed with a 25 μm elastic membrane to preserve hydration and surface characteristics during measurement (Fig. 1).

Fig. 1
figure 1

Polyurethane gel pads ınfused with micro-silica particles.

Initial mechanical calibration relied on Shore-OOO hardness ratings provided by the manufacturer (e.g., Shore-OOO 56), determined according to ASTM D2240, the standardized protocol issued by the American Society for Testing and Materials. Shore-OOO was specifically chosen over other durometer scales (e.g., Shore-A, Shore-OO) due to its higher sensitivity in the low-modulus range, which is most relevant for simulating soft biological tissues. Its fine resolution enables better discrimination of subtle stiffness differences among soft tissue analogs, particularly in diabetic models where small mechanical variations are clinically meaningful. Shore-OOO values were converted into estimates of Young’s modulus (E, in kPa) using established mathematical modeling methods50. In parallel, to ensure compatibility with elastographic measurements, E was also derived from shear wave velocity (V, in m/s) using the standard equation:

$$E = 3 \cdot p \cdot V^{2}$$
(1)

where ρ is the assumed tissue density (1000 kg/m3)30. This dual-calibration approach ensured consistency between mechanical and elastographic stiffness interpretations30,51. Final mechanical properties across all phantom configurations are summarized in Table 1.

Table 1 Mechanical properties by layer and region: shore-OOO, Young’s modulus, and shear wave velocity.

Layer variability and validation

Thickness and stiffness parameters for each tissue layer (skin, fat pad, fascia, and intrinsic muscle) were selected based on an extensive review of in vivo and cadaveric studies, as detailed in Supplemental File 1. For both healthy and diabetic models, each tissue type was constructed in three mechanical variants—representing minimum, average, and maximum stiffness values reported in the literature35,52. Accordingly, the heel region included 18 layers (9 healthy, 9 diabetic), while the midfoot and forefoot each comprised 20 layers (10 healthy, 10 diabetic).

All layers were evaluated using shear wave elastography (SWE) by two independent raters. Only those conforming to the literature-based target stiffness ranges were incorporated into the final multilayer assemblies. Each validated layer was assigned a coded label to ensure assessor blinding. Ultimately, six complete multilayer phantom models (three anatomical regions × two health conditions) were constructed. A total of 162 unique model configurations were generated by systematically altering one layer at a time. The full coding system and matrix of layer combinations are provided in Supplemental File 2.

Measurement procedures

All measurements were conducted on a rigid support surface under standardized laboratory conditions (22 ± 1 °C; 50% relative humidity) to minimize potential base interference. Assessors were blinded to both the phantom type and anatomical region to reduce bias.

For each device and anatomical layer, three consecutive measurements were obtained. Values with a coefficient of variation (CV) greater than 5% were excluded to ensure measurement reliability. The remaining two or three valid values were averaged to calculate the final stiffness value for that layer. This exclusion criterion and averaging procedure were applied uniformly across all devices and phantom configurations to maintain consistency and transparency (Table 2).

Table 2 List of phantom layers designed for the calcaneal, midfoot, and forefoot regions in healthy and diabetic models.

Shear wave elastography (SWE)

SWE measurements were performed using a high-resolution ultrasound device (Resona 7, Mindray Bio-Medical Electronics, China) equipped with a linear transducer (L11-3U, 3–11 MHz, 43 mm width). The system was operated under the musculoskeletal preset, with push pulse settings (Qgen) defined between 5.6 and 10 MHz. To ensure consistency and prevent compression artifacts, the transducer was mounted on a metallic holder in a parallel position relative to the phantom surface. A generous amount of transmission gel (Aquasonic 100, Parker Laboratories, USA) was applied to eliminate any probe pressure (Fig. 2A–C). All measurements were conducted in a no-contact configuration, which was selected to improve reliability by eliminating operator-induced variability and preventing deformation of the superficial gel layers, factors known to compromise the reproducibility of SWE data in contact-based protocols.

Fig. 2
figure 2

(AC) Surface view of the SWE setup: (A) immersion in transmission gel with no applied pressure; (B) probe placement parallel to the phantom layers; and (C) stabilization with metallic holders to prevent motion and compression artifacts.

The imaging depth was standardized at 25 mm, and the elastography color scale was set to E3 with a stiffness range limited to 0–800 kPa. The Motion Stability Index (M-STB) feature was activated to optimize acquisition quality. For each tissue-mimicking layer, three separate acquisitions were obtained, focusing on the central zone of the sample to avoid edge artifacts. The region of interest (ROI) was manually adjusted to fully cover the layer’s anatomical width without including boundary transitions.

For each acquisition, the system provided SWV and corresponding E. These values were recorded as mean, minimum, maximum, and standard deviation.

MyotonPRO measurements

The MyotonPRO (Myoton AS, Estonia) assesses dynamic stiffness (N/m) by delivering brief mechanical impulses through a 3 mm probe. Prior to each measurement, the probe was positioned perpendicularly to the phantom surface, and a stable preload of 0.18 N was ensured. This preload was standardized across operators using the device’s integrated visual cue: when the red light turned green, it indicated that the target preload had been consistently applied. The device then delivered a 0.4 N mechanical tap to induce local oscillations, which were captured by an internal accelerometer. Dynamic stiffness was automatically computed by the software based on the resulting oscillation waveforms27. Measurements were taken from three standardized locations (central, medial, and lateral) on each phantom layer (Fig. 3A). Data with a coefficient of variation (CV) > 3% were excluded to ensure measurement reliability, and the remaining valid values were averaged for analysis33.

Fig. 3
figure 3

(AD) Stiffness and compliance assessments of phantom tissues using four different devices. (A) MyotonPRO: triplicate measurements at middle, right, and left points. (B) Shore Durometer: triplicate readings from the same regions of each test site. (C) IndentoPRO: controlled-force stiffness assessment (10 N/s) with 10 mm indentations at three locations. (D) Tissue Compliance Meter (TCM): triplicate compliance measurements at the middle, right, and left regions.

Shore durometer measurements

Hardness measurements were obtained using the Shore Durometer (Type 1600-OO, Rex Gauge Co., Canada), with a 2 mm indenter tip. Three transverse locations (center, left, right) were tested per layer (Fig. 3B), with each indentation held for 1–2 s53. The device was applied vertically, under consistent manual pressure, and each indentation was allowed to stabilize for 1–2 s before recording the value. The device displays a stiffness value on an analog dial ranging from 0 to 100 (UI), where lower Shore-OOO values indicate softer materials54. Readings were averaged and recorded.

IndentoPRO measurements

The IndentoPRO device (Fascia Research Group, Ulm University; University of Chemnitz, Germany) features an 11.3 mm diameter compression probe, a Compression Load Cell FX1901 (TE Connectivity, Switzerland), and a ThinPot 10 kΩ membrane potentiometer (Spectra Symbol, USA)44. Force was applied manually at a controlled rate of 10 N/s until a 10 mm indentation was reached, indicated by an audible signal. Stiffness was calculated as the slope of the force–displacement curve and expressed in N/mm, with lower values reflecting softer tissue. Measurements were taken at the central, medial, and lateral regions of each phantom site. For each location, three consecutive trials were performed, and the average was used for analysis. Only trials with a coefficient of variation (CV) below 5% were included. Each phantom layer was tested at three locations (center, medial, lateral; Fig. 3C). Stiffness was calculated as the slope (N/mm) of the linear portion (2–8 mm) of the force–displacement curve. Trials with a coefficient of variation (CV) > 5% were excluded. The device was recalibrated every 45 min, and probe alignment was visually checked before each trial. Trained assessors, blinded to anatomical region and condition, completed > 30 pilot tests and followed a randomized measurement order35.

Tissue compliance meter (TCM) measurements

The Tissue Compliance Meter (TCM)55, is a validated instrument for assessing tissue compliance through standardized indentation. The system includes a 1 cm2 central probe within an 8 cm ring to ensure vertical alignment and reduce lateral force, which is crucial in fat-rich or uneven plantar areas. A gradual vertical load was applied until a 10 mm indentation depth was reached, while the required force (N) was continuously recorded using a built-in gauge (accuracy ± 0.05 N). This protocol is consistent with previous studies evaluating the plantar fat pad46. Each phantom layer was tested at three points (central, medial, lateral), and the mean was calculated. Trials with a CV > 5% were excluded. A 60-s rest was given between trials. All assessors performed over 30 pilot trials and were blinded to region and model. Measurement order was randomized. TCM’s large contact area and depth capacity make it suitable for assessing soft, fat-rich plantar tissues37,46.

Blinding and randomization

Each phantom layer and assembled model was assigned a unique alphanumeric code using R software56, which masked information regarding the anatomical region, health condition (healthy or diabetic), and stiffness level. All coded samples were sealed in opaque, unlabeled packaging to conceal their internal configuration. This coding system was managed by a researcher independent from the assessment and analysis phases.

To ensure rigorous blinding, a double-blind protocol was implemented: assessors were blinded to the phantom type, region, and mechanical characteristics, while device operators were blinded to measurements obtained by other tools. Measurements for each device (SWE, MyotonPRO, Shore Durometer, IndentoPRO, and Tissue Compliance Meter) were independently randomized using a block randomization scheme, and spatial test points (central, medial, and lateral) were randomly varied across trials to account for intra-layer heterogeneity.

All measurements were conducted by two assessors with doctoral-level training in biomechanics and clinical physiotherapy, each with a minimum of 5 years of experience using quantitative stiffness assessment tools in both research and clinical settings. Prior to data collection, the assessors received standardized training on all device-specific protocols to ensure consistency. While basic operation of the measurement tools is accessible to trained physiotherapists, reproduction of the observed precision may require advanced methodological expertise and adherence to rigorous calibration and blinding protocols as described in this study.

Statistical analysis

All statistical analyses were conducted using SPSS (v28.0; IBM) and R (v4.3.1; R Core Team, 2023), with significance set at p < 0.05. A blinded biostatistician performed all procedures independently. Data were screened for outliers (via z-scores and boxplots), missing values, and entry errors. No outliers were excluded, as all values fell within ± 3 SD. Normality was confirmed using Shapiro–Wilk tests, histograms, and Q–Q plots, allowing for parametric analyses. Intra- and inter-rater reliability were assessed using two-way mixed-effects intraclass correlation coefficients (ICC, absolute agreement), and agreement was visualized with Bland–Altman plots57. Concurent validity was evaluated using Pearson correlations between device-based stiffness and SWE-derived E. Predictive validity was assessed using linear regression analyses, reporting unstandardized beta coefficients (β), coefficient of determination (R2), standard errors (SE), and p-values. All regression assumptions (normality, homoscedasticity, independence of residuals) were satisfied; multicollinearity was excluded (variance inflation factor, VIF < 2). Between-group comparisons (healthy vs. diabetic) used independent-samples t-tests by region. One-way ANOVA with Bonferroni post hoc tests examined differences across stiffness configurations. Effect sizes were reported using Cohen’s d for pairwise comparisons58 and eta squared (η2) for ANOVA results. According to established guidelines, η2 values of 0.01, 0.06, and 0.14 correspond to small, medium, and large effects, respectively59. A multiple regression model also examined the combined effects of region, device, and configuration on stiffness.

Results

Inter- and ıntra-rater reproducibility

Table 3 presents the reproducibility results for the four stiffness measurement devices (MyotonPRO, Durometer, IndentoPRO, and TCM) based on inter- and intra-rater assessments. No statistically significant differences were observed between raters for any device (p > 0.05). Inter-rater reliability was excellent, with intraclass correlation coefficients (ICC) ranging from 0.979 (Durometer) to 0.994 (IndentoPRO). Similarly, intra-rater analyses revealed no significant test–retest differences across devices (p > 0.05), with consistently high ICC values: 0.966 (MyotonPRO), 0.969 (Durometer), 0.993 (IndentoPRO), and 0.992 (TCM), indicating strong reproducibility. The standard error of measurement (SEM) and minimal detectable change at 90% confidence (MDC90) are also reported in Table 3, further supporting the precision and stability of these tools.

Table 3 Assessment of intra- and inter-rater reproducibility for MyotonPRO, durometer, IndentoPRO, and TCM.

Bland–Altman plots (Fig. 4A–H) were used to assess agreement across raters and repeated measurements. Panels A–D illustrate inter-rater agreement, and Panels E–H display intra-rater agreement for each device. All plots demonstrated narrow limits of agreement and minimal bias, indicating high measurement consistency.

Fig. 4
figure 4

(AH) Agreement Analysis of Stiffness Measurement Tools Using Bland–Altman Plots: Inter-Rater and Intra-Rater Comparisons. *Panels (AD) show inter-rater, and Panels (EH) show intra-rater agreement for MyotonPRO, Durometer, IndentoPRO, and TCM. Each plot displays the mean bias and 95% limits of agreement. Narrow limits and low bias reflect strong measurement consistency within and between raters.

Regional comparison of stiffness in healthy and diabetic phantom models

Stiffness measurements obtained using SWE, MyotonPRO, Durometer, IndentoPRO, and TCM across the calcaneal, midfoot, and forefoot regions in healthy and diabetic phantom models are summarized in Table 4 and illustrated in Fig. 5.

Table 4 Comparison of SWE, MyotonPRO, durometer, IndentoPRO, and TCM measurements across plantar foot regions in healthy and diabetic phantom models.
Fig. 5
figure 5

Regional comparison of SWE and mechanical stiffness measurements in healthy and diabetic plantar models. Line graphs illustrating regional differences in shear wave elastography (SWE) values—Young’s modulus (E, kPa) and shear wave velocity (m/s)—as well as mechanical stiffness measurements obtained using the Durometer (Shore-OO), MyotonPRO (N/m), IndentoPRO (N), and TCM (N/mm) across the calcaneal, midfoot, and forefoot regions. Data are presented separately for diabetic (blue line) and healthy (green line) phantom models.

For SWE, shear wave velocity (SWV) values were significantly higher in diabetic models in the calcaneal (p < 0.001, η2 = 0.31) and midfoot (p < 0.001, η2 = 0.30) regions, with no significant difference in the forefoot (p = 0.139). Similarly, Young’s modulus (E) values were significantly elevated in diabetic models in the calcaneal and midfoot regions (both p < 0.001), but not in the forefoot (p = 0.141). Mixed ANOVA revealed significant main and interaction effects for both SWV and Young’s modulus (p < 0.001), with post hoc tests confirming greater stiffness in the calcaneal and midfoot regions compared to the forefoot.

For MyotonPRO, significant group differences were observed in the midfoot (p < 0.001, η2 = 0.41) and forefoot (p < 0.001, η2 = 0.16), but not in the calcaneal region (p = 0.075). Both within-group and interaction effects were significant (p < 0.001). Post hoc analysis showed that in healthy models, stiffness followed the pattern calcaneal > midfoot = forefoot, while in diabetic models, the pattern was calcaneal = midfoot > forefoot.

Durometer measurements showed significantly higher stiffness in diabetic models for the midfoot and forefoot regions (p < 0.001), but not for the calcaneal region (p = 0.440). Main and interaction effects were significant (p < 0.001). The regional pattern in healthy models was calcaneal > midfoot = forefoot, whereas in diabetic models it was calcaneal = midfoot > forefoot.

IndentoPRO-based force values (at 10 mm indentation depth) were significantly greater in diabetic models across all three regions (p < 0.001). Both regional differences and interaction effects were statistically significant (p < 0.001 and p = 0.027, respectively). The stiffness distribution was calcaneal > midfoot = forefoot in healthy models and calcaneal = midfoot > forefoot in diabetic models.

For TCM, all stiffness values were significantly elevated in diabetic models across regions (calcaneal: p = 0.003; midfoot and forefoot: p < 0.001). A strong grou p × region interaction was observed (p < 0.001), and regional patterns were consistent with other tools: calcaneal > midfoot = forefoot in healthy models, and calcaneal = midfoot > forefoot in diabetic models.

Regression analyses with SWE parameters

Table 5 presents the regression results evaluating the predictive value of each device’s stiffness measurements for SWV. All four devices showed statistically significant associations with SWV in the calcaneal region for both healthy and diabetic models (p < 0.001). Notably, TCM demonstrated the strongest association in the diabetic forefoot region (β = 0.155, R2 = 0.527). In the pooled analysis across all regions and models, TCM measurements explained nearly half of the variance in SWV (β = 0.097, R2 = 0.483), followed by IndentoPRO (R2 = 0.315), MyotonPRO (R2 = 0.307), and Durometer (R2 = 0.285).

Table 5 Regression analysis of stiffness measurement tools on shear wave velocity in plantar soft tissue.

Table 6 summarizes the regression outcomes for mechanical stiffness values in predicting SWE-derived E. All four devices demonstrated significant associations with E in the calcaneal region of both healthy and diabetic models (p ≤ 0.001). In the forefoot region, TCM again exhibited the strongest predictive relationship, particularly in the diabetic model (β = 5.55, R2 = 0.520). No statistically significant associations were observed in the midfoot region for any device. In the full regional model, TCM demonstrated the highest explanatory power (R2 = 0.508), followed by Durometer (R2 = 0.317), IndentoPRO (R2 = 0.315), and MyotonPRO (R2 = 0.315).

Table 6 Regression analysis of stiffness measurement tools on SWE-derived elastic modulus in plantar soft tissue.

Discussion

Validity and reliability of mechanical devices using phantom models

This study demonstrated that multilayer polyurethane foot phantoms can effectively simulate the mechanical behavior of healthy and diabetic plantar soft tissues. Constructed using literature-based stiffness values and anatomical segmentation, these models enabled a standardized evaluation of mechanical measurement tools. Consistent with our hypothesis, stiffness measurements obtained from all four mechanical devices were significantly correlated with SWE-derived values, supporting their concurrent validity. Additionally, all devices exhibited high intra- and inter-rater reliability under blinded and standardized testing conditions (ICC ≥ 0.96), indicating strong repeatability under phantom-based conditions. However, their direct applicability to biological tissues remains uncertain due to the greater variability in vivo.

These findings align with previous reports on the reliability of MyotonPRO33,36,43,60,61, IndentoPRO35,44,46, and TCM37,45,46. Minimal variability across repeated measures (< 5%) further supports the usability of these tools in consistent clinical or research settings. IndentoPRO exhibited the highest raw correlation (r ≈ 0.9) with SWE-derived stiffness, consistent with Bartsch et al.35, indicating that force–displacement metrics from controlled indentation closely mirror the elastic modulus provided by elastography. TCM also showed strong associations (r ≈ 0.84) and emerged as the best predictor of SWE-based E values, potentially due to its deeper, fixed-depth indentation. In contrast, MyotonPRO and Shore Durometer, primarily assessing superficial tissue properties31,36,46, demonstrated slightly lower but still acceptable correlations (r ≈ 0.78–0.87). Taken together, these results suggest that each device has specific strengths depending on the depth and anatomical location of the target tissue. IndentoPRO and TCM appear particularly sensitive to deeper structures such as the heel pad, while MyotonPRO and the Shore Durometer provide portable and practical options for superficial tissues like the forefoot skin and subcutaneous layers. In addition to technical accuracy, factors such as portability, cost, and ease of use should also inform device selection in applied settings.

Anatomical and regional variations in stiffness assessment

Our phantom models successfully reproduced region-specific stiffness patterns consistent with SWE findings in diabetic patients29,62. The heel and midfoot regions of diabetic phantoms exhibited higher stiffness than healthy models, simulating the known fibrotic remodeling and glycation-induced rigidity characteristic of diabetic foot pathology63. These regional stiffness increases are not the result of spontaneous biological change but are embedded by design, reflecting intentional differences in material modulus. Consequently, device-detected variations serve to verify measurement sensitivity and responsiveness rather than track disease progression. While these regional differences in stiffness are design-driven and not reflective of biological remodeling, they serve as a necessary foundation for evaluating the measurement accuracy and discriminatory capacity of the mechanical devices under standardized conditions. Although this outcome may appear expected, demonstrating that the devices can distinguish known, preconfigured stiffness gradients is essential for establishing their concurrent validity and reliability in detecting clinically relevant variations.

Interestingly, the forefoot region remained relatively compliant in diabetic phantoms, reflecting clinical findings that suggest later or less pronounced degenerative changes in this area64,65. Studies on the plantar fascia support this pattern, noting proximal stiffening near the heel and preserved compliance in the forefoot66,67,68,69,70. This region-specific approach (Fig. 6) reinforces the idea that diabetic plantar stiffening is not a uniform process and that device selection should be guided by anatomical and biomechanical context.

Fig. 6
figure 6

Region-specific measurement strengths of the four stiffness assessment tools across plantar foot. A comparative overview of the region-specific performance characteristics of MyotonPRO, Durometer, IndentoPRO, and TCM based on multilayer phantom model analysis. The figure summarizes optimal assessment zones, heel (calcaneal), midfoot, and forefoot, for each tool, considering their mechanical sensitivity and correlation with SWE. The visualization supports an integrated approach to diabetic foot stiffness assessment by guiding tool selection based on anatomical region and measurement capability.

The midfoot, in particular, emerged as a biomechanically complex region, where all devices exhibited weaker correlations with SWE and greater measurement variability. This finding is consistent with previous literature highlighting the anatomical intricacies and heterogeneous load-bearing structures of the midfoot, including intrinsic muscles and overlapping soft tissue layers6,9,68,71. Consistent with our findings, Tonna et al.31 also emphasized the difficulty of obtaining reliable stiffness measurements in this region. Our phantom models incorporated structural layering (skin, fat pad, fascia, muscle) but did not replicate internal anisotropy or directional fiber orientation, especially within the muscle layer. In addition, the flat phantom geometry limited the replication of the three-dimensional curvature and pressure distribution of the human foot, which may have contributed to estimation errors in the midfoot region. Future research should therefore focus on fabricating anatomically shaped phantom patches with tunable viscoelastic properties to enhance structural fidelity and improve translational relevance. Finite element modeling by Chatzistergos et al.72, supports the use of curved phantoms or imaging integration to improve measurement precision in this region.

Clinical ımplications, limitations, and future directions

Region-specific mechanical stiffness assessments offer promising applications for diabetic foot screening and early risk detection. The phantom models used in this study successfully simulated diabetic foot stiffness patterns, particularly increased stiffness in the heel and midfoot, supporting their relevance for preclinical evaluation. Devices such as the TCM and IndentoPRO appear well-suited for deeper tissue assessment, while the MyotonPRO and Shore Durometer offer practical utility for evaluating more superficial layers.

Nevertheless, several limitations must be acknowledged. Although the phantom models were carefully constructed, they cannot replicate key biological features such as vascular perfusion, anisotropic fiber architecture, or dynamic viscoelasticity. Their flat geometry and uniform density assumptions may have influenced load distribution, particularly in the midfoot, while the fixed measurement depths of some devices may have introduced estimation errors. A further limitation of this study is related to the midfoot region. Despite the use of anatomically realistic phantom models, the correlations between some mechanical devices and SWE were lower in this area. This finding is consistent with previous research, where the structural complexity and heterogeneous composition of the midfoot have been associated with reduced measurement stability and reproducibility. Our results therefore indicate that, while phantom-based models improve standardization, they do not fully resolve the challenges of midfoot assessment, and future studies should explore optimized protocols or device adaptations for this region. Another limitation is that the stiffness values used to fabricate the phantom models were derived from previous literature, without direct in vivo comparisons. Although these references provided essential ranges for simulating healthy and diabetic conditions, the absence of concurrent in vivo validation may restrict the immediate clinical generalizability of our findings.

Future work should therefore integrate direct in vivo assessments to confirm and expand upon the phantom-based results. To improve translational applicability, studies should also focus on the development of anatomically shaped phantoms with tunable viscoelastic properties and on evaluating device responsiveness to clinical interventions. Integrating mechanical stiffness data with plantar pressure mapping and neuropathy screening could provide a more comprehensive risk profile for diabetic foot complications. Combining experimental phantom studies with computational models may further improve predictive accuracy and translational value. The integration of numerical simulations with experimental data could provide more comprehensive insights into plantar tissue mechanics and enhance the clinical applicability of stiffness measurement tools.

Conclusion

This study demonstrated that four mechanical devices (MyotonPRO, Shore Durometer, IndentoPRO and TCM) are reliable and valid tools for assessing plantar soft tissue stiffness using multilayer polyurethane phantom models simulating healthy and diabetic foot conditions. All devices showed excellent intra- and inter-rater reliability and significant correlations with SWE-derived reference values, supporting their concurrent validity. Each device exhibited distinct strengths depending on tissue depth: the TCM and IndentoPRO were particularly effective in assessing deeper tissue stiffness, while the MyotonPRO and Shore Durometer were better suited for evaluating more superficial layers. Although phantom models cannot fully simulate the complexity of biological tissues, they offer a reproducible and standardized platform for evaluating device performance across anatomically relevant regions. This phantom-based approach may also support early-phase medical device evaluation, particularly in situations where in vivo testing is not immediately feasible. Moving forward, enhancing anatomical realism, validating findings in clinical settings, and integrating these tools with complementary diagnostics will be essential for advancing their clinical utility in diabetic foot care.