Multimodal inverse kinematics significantly improves IMU-based biomechanical analyses

Wechsler, Iris; Shanbhag, Julian; Schlechtweg, Niklas; Vossiek, Martin; Koelewijn, Anne D.; Wartzack, Sandro; Miehling, Jörg

doi:10.1038/s41598-025-33021-7

Download PDF

Article
Open access
Published: 23 December 2025

Multimodal inverse kinematics significantly improves IMU-based biomechanical analyses

Iris Wechsler¹,
Julian Shanbhag¹,
Niklas Schlechtweg²,
Martin Vossiek²,
Anne D. Koelewijn³,
Sandro Wartzack¹ &
…
Jörg Miehling¹

Scientific Reports volume 15, Article number: 44420 (2025) Cite this article

729 Accesses
Metrics details

Subjects

Abstract

In musculoskeletal simulations, IMU-based approaches are often compromised by errors such as joint angle drift and offset errors due to calibration inaccuracies. These errors can compromise the accuracy of both kinematic and dynamic outcomes. This study presents a simulation-based investigation that uses synthetic inertial and positional data to systematically assess the potential of integrating spatial reference information into IMU-driven inverse kinematics analyses. Optical motion capture data was captured and error-free kinematic and dynamic data was created based on the optical motion capture data. The error-free data was then used as a reference. Based on this reference data, synthetic orientation and position data was created, incorporating a range of error types and magnitudes (e.g., sensor noise, drift, misalignment). To create the IMU-based analysis results, we calculated relative quaternions based on the orientation data which were then converted into Euler angles. We then conducted a sensitivity analysis to determine the spatial accuracy required in the position data to effectively compensate for typical IMU errors. Across all error types and magnitudes, the multimodal inverse approach (using both synthetic IMU and positional data) yielded significantly more accurate results than solely IMU-based analyses. Specifically, the mean joint angle RMSE decreased by ${63}{\%} (\sim {5}^{\circ })$, the mean joint torque RMSE by ${80}{\%} (\sim {17}\,\hbox {Nm})$, the mean residual force RMSE by ${25}{\%} (\sim {9}\,\hbox {N})$, and the mean residual torque RMSE by ${70}{\%} (\sim {26}\,\hbox {Nm})$. Future research will evaluate the effectiveness of the multimodal inverse kinematics approach when applied to real-world measurement data.

Multibody kinematics optimization for motion reconstruction of the human upper extremity using potential field method

Article Open access 26 March 2025

Multimodal video and IMU kinematic dataset on daily life activities using affordable devices

Article Open access 22 September 2023

A frame orientation optimisation method for consistent interpretation of kinematic signals

Article Open access 14 June 2023

Introduction

Musculoskeletal simulations are used to estimate biomechanical variables that cannot (easily) be measured in reality, such as joint angle and joint velocity data during movement, or the muscle forces required to perform these movements. Simulations are used in a variety of fields to answer a range of (research) questions. For instance, in the field of medical technology they are used to assess the outcome of anterior cruciate ligament surgery^1,2, in ergonomics, to assess a worker’s workload³, and in sports science to optimize an athlete’s performance⁴. To analyze the motion data, it must first be captured. The most common approach to measure human motion data is optical motion capture. Small reflective markers are attached to specific points on the skin (usually on anatomical landmarks), and the trajectories of these markers are then measured using a set of high-resolution cameras while the participant moves within a calibrated measuring space. There are also alternative technologies available. Inertial measurement units (IMUs) are a frequently used alternative^5,6,7,8,9. One of the key advantages of IMU-systems is that they do not require a calibrated capture volume, which allows users to record motion data more flexibly, e.g. in outdoor environments¹⁰. Further, completely non-invasive and markerless approaches, meaning no markers or sensors have to be attached to the participants, have been developed that use either RGB cameras^11,12,13, depth cameras^14,15,16 or a combination of both to record human movement¹⁷. These approaches are a simple and cost-effective way of recording movement as no expensive measurement system is required. The most recent advancement are radar-based motion capture approaches^18,19. Although using radars for motion tracking is still in its early stages of development, results so far are promising. This is due to the fact that radars are not affected by lighting conditions and are capable of directly measuring speeds, distances and angles¹⁸.

Following the motion capture, the data is transferred to a musculoskeletal model. Musculoskeletal models are multibody systems that depict a person in the virtual world. The models consist of rigid bodies, joints and actuators, which respectively represent the bones, joints and muscles of the human body. A conventional approach to compute joint kinematics (joint angles, velocities and accelerations) and joint torques or muscle forces from a recorded motion is the inverse approach. It consists of inverse kinematics, followed by inverse dynamics and static or dynamic optimization. However, musculoskeletal simulations are simplifications of reality, thus the simulation results are error-prone. For the inverse approach, residual forces and torques (’residuals’) appear at the segment connected to the ground frame which is often the pelvis of a biomechanical model. Residuals have no physical cause and therefore are often called ’hand of god’ forces, making the simulation results physically inconsistent^20,21, as residual forces are not needed in reality. Therefore, deviations between reality (represented by the experimental data) and the simulation remain. We called this deviation the sim2real gap. In a previous study we identified and analyzed approaches how to handle this gap²². In short, both the level of detail of the musculoskeletal models as well as the tracking method used, affect the size of the sim2real gap. We decided to focus on and analyze available tracking methods in the review. Based on the identified and clustered results we concluded that one possible solution to handle the sim2real gap may be tracking multimodal sensor data, as multimodal approaches led to smaller deviations between model and measurement data (e.g. marker deviations or IMU-based body orientations). Pearl et al.²³ used trajectory optimization to fuse RGB camera and IMU data. The authors reported that the multimodal solution outperformed single-modality approaches using either IMU or video camera data alone. Additionally, trajectory optimization, alone²⁴ or combined with machine learning approaches²⁵, has been successfully used to compensate for or prevent IMU drift by using raw IMU data directly, avoiding sensor data integration and associated drift. However, applications have focused solely on two-dimensional gait and running movements thus far. Mallat et al.²⁶ used a Kalman filter to fuse RGB camera data and IMU sensor data. The authors reported reduced motion tracking errors as the sensor technologies compensated each other’s weaknesses (marker occlusion and IMU sensor drift respectively). In contrast, Atrsaei et al.²⁷ fused depth camera data and IMU sensor data for human motion tracking, also using a Kalman filter. The sensor fusion led to smaller deviations between computed joint angles and reference data calculated from marker-based motion data.

Even though Kalman filters have been successfully used for musculoskeletal simulations, they are not without limitation. The quality of tracking results computed using a Kalman filter are significantly influenced by the quality of chosen parameters e.g. covariances of the system and measurement noise. Further, specific assumptions are often taken to enhance convergence, including constant acceleration or jerk^28,29, periodicity of motions²⁶ or bilateral symmetry²⁶. The conventional inverse kinematics approach does not rely on such assumptions. Joint angles are calculated by minimizing the difference between experimental and model data (either marker positions or orientations) for each frame. However, a marker-based inverse kinematics approach requires a dedicated motion capturing lab as well as attaching the markers on the skin of a participant ideally, making the capturing inflexible, cumbersome, and invasive for the participants. For IMU-based motion data, the capture process is more flexible as well as user-friendly, as the sensors are mostly attached to the body using flexible bands which can be placed above the clothing of a participant. One disadvantage of IMU-based motion data is, that the data may be affected by various errors such as sensor drift³⁰ and calibration errors³¹, leading to inaccurate analysis results. Therefore, the approaches are not suited for use cases requiring high levels of accuracy and reliability, such as within a clinical context. A combination of both position and orientation data within the framework of the inverse kinematics approach should lead to more accurate biomechanical analysis results, as both data types complement each other. In the context of a simulation study, we want to investigate whether single position references, determined by alternative motion capture systems such as RGB cameras, depth cameras or radars, enhance the accuracy of IMU-based inverse kinematics results. The intention is thus not to use a complete set of highly precise marker position data that has been recorded by an optical measurement system. Instead, the focus is on investigating the impact of individual, less precise position data (one per segment).

The objective of this simulation study was to investigate the effect of adding position data of single references to IMU-based joint angle data within the framework of an inverse kinematics analysis. We investigated the effect on the calculated kinematics (joint angles) and dynamics (joint torques and residuals) based on the multimodal approach. As part of a sensitivity study, we wanted to find out the extent of error in the IMU-based joint angle data that can be compensated for by position data of varying accuracy. The approach is implemented in OpenSim – a framework for musculoskeletal simulations³². We created synthetic sensor data based on a reference simulation generated based on marker-based motion capturing. We investigated the effect of different errors and error sizes for both the IMU-based orientation data and position data on the resulting kinematics and dynamics.

Methods

Model description

In this study, we used the Rajagopal lower limb model³³. It consists of 20 bodies and has 37 degrees of freedom. Only experimental data for the lower body was captured. Consequently, the degrees of freedom of the upper body were locked. In addition, the metatarsophalangeal joint on each foot was locked. Therefore, the model had 16 free degrees of freedom in total. Virtual IMU sensors were placed on the pelvis, the upper and lower leg and the foot on each side. In addition to the markers corresponding to the experimental marker set, seven markers were placed at the origin of the virtual IMU sensor coordinate systems. To avoid uncertainties introduced by muscle modeling, muscles were removed and replaced with ideal torque actuators for each degree of freedom. This enabled us to assess the separate impact of the tracking method on joint kinematics and inverse dynamics results. In addition, point and torque actuators for each coordinate (x,y,z) were added to the origin of the left and right toes of the model to generate synthetic ground reaction forces using the reference data.

Experimental data collection and data pre-processing

Gait data was captured for one participant (female, 28 years old, ${1.6}\,\hbox {m}, {56}\,\hbox {kg}$) walking on a treadmill using the marker-based motion capture system OptiTrack³⁴. The motion capture procedure was initiated with the participant standing on the treadmill and continued for a duration of 30 seconds with a self-chosen walking speed. We analyzed the first ten seconds of motion encompassing the transition from standing to walking. The Rizzoli lower body marker set³⁵ comprising 22 markers was used in the motion capture process. Overall, 12 cameras (type: Flex 13) were used for capturing the passive markers (diameter: 14 mm). In addition to the collection of gait data, motion data of a static pose was captured for the purpose of model scaling. The motion data was captured with a measuring frequency of ${120}\,\hbox {Hz}$. An auto-labeling function was used to label the data. Unidentified or misidentified markers were manually labeled. We encountered problems with some markers due to marker occlusion. Therefore, we used interpolation methods to recalculate the missing data. The marker data was filtered using a third order Butterworth Filter with a cutoff frequency of ${6}\,\hbox {Hz}$. The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board (or Ethics Committee) of Friedrich-Alexander-Universität Erlangen-Nürnberg (protocol code 20-489_1-B). The participant provided written informed consent prior to inclusion in the study.

Creation of reference data

The lower body of the generic Rajagopal model was scaled based on marker data from the static pose. Using the OpenSim scaling tool, all 22 markers of the Rizzoli lower body marker set were used to scale the model. Using inverse kinematics, the reference kinematics data was computed. The computed results were filtered using a third-order Butterworth filter with a cutoff frequency set at ${6}\,\hbox {Hz}$. Based on the calculated generalized coordinates, static optimization was used to compute reference joint torques. Ground reaction forces were not measured using a force plate, but were instead estimated using the model and the reference generalized coordinates. Synthetic ground reaction forces were computed using the point and torque actuators added to the toes. To generate realistic ground reaction forces, the point and torque actuators on each foot were only activated when the respective foot was in contact with the ground. Heel strike and toe off time frames were estimated by analyzing the processed marker data. Heel strike time frames were determined through the identification of frames in which the velocity of the heel marker was found to be zero. The identification of toe-off time frames was achieved by the identification of peaks in the vertical acceleration data of the toe marker. As the reference data was used to calculate the synthetic ground reaction forces, the reference inverse dynamics results were free of residuals.

Creation of synthetic data

Based on the reference kinematics data, the synthetic position and orientation data for the sensitivity analysis was created. To create the noise-free position data, the position of each of the seven markers placed in the virtual IMU sensors was extracted for every frame. To generate noisy position data, two errors were added to the noise-free reference position data: a white Gaussian noise ($X \sim {\mathscr {N}}(\mu , \sigma ^2)$) depicting measurement noise and a marker offset depicting calibration errors. For both errors, three sizes of errors were chosen, see Table 1. The types and sizes of error for the position data were selected in reference to data accuracy for the determination of sensor positions using radars, as outlined in³⁶. Furthermore, the position data has been adjusted to include dropout rates (missing values). For each position reference, a random dropout rate of between ${20}\%\,\hbox {to}\,{40}{\%}$ has been implemented. The missing values were then filled using cubic splines. To generate synthetic orientation data that imitates experimental IMU-based orientation data, angular velocities were calculated using the virtual IMU-sensors. Four different error types were added to each component of the angular velocity vector: a bias, a misalignment error, a bias drift and a Gaussian noise defined by a noise density value. The bias and the misalignment error did not change in size. For the bias drift and Gaussian error, three levels of errors were chosen. Bias and misalignment were kept constant as they represent time-invariant, device-specific errors for a given setup. In contrast, bias drift and noise density were varied to simulate time-dependent effects and different sensor quality levels, respectively, enabling a broader evaluation of the robustness of the multimodal inverse kinematics approach. The types and sizes of errors were chosen with reference to the error values of the gyro sensor built into the ICM-20689 motion tracker, as documented in the technical datasheet³⁷. The noisy angular velocity data was then integrated to generate noisy orientation data. In addition to the four errors added directly to the angular velocities, a random but constant misalignment error between ${5}^{\circ }\,\hbox {to}\,{10}^{\circ }$ was added to each sensor data to model the effect of incorrect sensor placement on the body. To model the effect of magnetic disturbance, a yaw drift error of ${0.01}\,\hbox {rad/s}$ was added to the integrated angular velocities. The orientation data was expressed as quaternions. The size of the errors for every level is listed in Table 1. By combining all position and orientation error levels, 81 pairs of input files were created in total.

Table 1 Error sizes for all types of errors investigated in the sensitivity analysis.

Full size table

Creation of IMU-based motion data

The synthetic IMU-based motion data was created based on the synthetic orientation data. Joint angles were calculated based on the synthetic orientation data expressed as quaternions. The hip, knee and ankle angles were calculated by computing the relative quaternion between two quaternions expressing the orientation of adjacent segments. The relative quaternion is expressed in axis-angle form. For each joint angle, the axis angle form was then transformed to Euler angles corresponding to the degrees of freedom of the biomechanical model. As we generated orientation data based on the virtual gyro data, only data for the rotational degrees of freedom of the model were generated for the IMU-based solution. The computed joint angles were filtered using a third-order Butterworth filter with a cutoff frequency set at ${6}\,\hbox {Hz}$. No data was generated for the translational generalized coordinates (x-, y-, z-translation of the pelvis). Instead, fixed values were chosen to hold the model in place.

Multimodal inverse kinematics

The multimodal inverse kinematics approach used both position and joint angle data as input. Consequently, the IMU-based motion data described in the preceding paragraph is used. Analogous to the marker-based inverse kinematics approach, it solved a least-squares optimization problem for every timestep. However, in addition to minimizing model–marker deviations at each time step, deviations between model and experimental joint angles are also minimized. The multimodal inverse kinematics results are calculated by solving the following least squares problem for each timestep:

$$\begin{aligned} \min J = \sum _{i=1}^7 (w_{mi} \cdot (m_i - m_{di})^\top (m_i - m_{di})) + \sum _{j=1}^{13} (w_{qj} \cdot (q_j - q_{dj})^2) \end{aligned}$$

(1)

where $m_i$ and $m_{di}$ are the model and desired marker positions, $q_j$ and $q_{dj}$ are the model and desired generalized joint coordinates and $w_{mi}$ and $w_{qj}$ are the marker and generalized coordinates weightings. For the spatial input data, only the markers positioned at the origin of the virtual IMU-sensors were used. For the rotational input data, joint coordinate values for all free degrees of freedom of the model were used. By using both spatial and rotational data as input, joint coordinate values are computed for all translational and rotational degrees of freedom.

A systematic approach was employed to determine appropriate weightings for the marker and generalized coordinate data. Initially, the error-free reference dataset was used to compute the variance of both data types, enabling a variance-based normalization of the weighting factors. The global variance across all marker positions ($\sigma ^2_{m} = 0.1004$) and generalized coordinates ($\sigma ^2_{q} = 0.1019$) was calculated and the weighting factors were then determined accordingly ($w_{mi} = 9.81$ and $w_{qi} = 9.96$), see Equation 2 and 3 (with $\gamma = 1$). Kinematic results were then computed for all 81 analyses. Analysis of the resulting mean trajectories from both the IMU-based and the multimodal approaches revealed that the marker data had little to no influence on the multimodal inverse kinematics outcomes when using variance-based weightings. Consequently, a focused grid search was conducted to identify more effective weighting parameters. In this process, the weighting of the joint angle data was systematically reduced to evaluate the relative contribution of the marker information.

$$\begin{aligned} w_{mi} = \frac{1}{\sigma ^2_{m}} \end{aligned}$$

(2)

$$\begin{aligned} w_{qj} = \gamma \cdot \frac{1}{\sigma ^2_{q}}, \text { with } \gamma \in \{ 10^i | -8 \le i \le 0 \} \end{aligned}$$

(3)

To determine the optimal weighting configuration, kinematic data for all 81 datasets were computed for each tested weighting pair. The mean joint angle root mean square error (RMSE) between the reference and multimodal results was used as the evaluation metric to identify the most suitable weightings. The resulting mean RMSE values for all tested combinations are provided online as Supplementary Table S1. Based on this evaluation, final weightings of 9.81 for the marker data and $9.96 \times 10^{-6}$ for the generalized coordinates were selected. The computed generalized coordinates were filtered using a third-order Butterworth filter with a cutoff frequency set at ${6}\,\hbox {Hz}$. A range of pelvis marker weightings was investigated to determine whether different weightings of the spatial reference for the base segment affect the size of residual forces and torques. The original weighting was adjusted using a scaling parameter $\epsilon$, with $\epsilon \in \{ 0.01, 0.1, 10, 100 \}$. All other weightings remained the same. The different steps of the simulation study are illustrated in Fig. 1.

Statistical analysis

To assess whether the inclusion of single position data improves IMU-based kinematic results in the context of this simulation study, paired t-tests were conducted to compare the RMSE values of joint angles and joint torques between the IMU-based and multimodal analysis results, using the reference data as a benchmark. For statistical analysis of dynamic data, residual torques (pelvis rotations) were combined with joint torques. As no data was generated for the IMU-based solution for pelvic translations, no statistical analysis was performed on this data. To evaluate the effect of error types and sizes on the RMSEs, the t-tests were calculated on grouped data based on error type and size. As the orientation data-based method is not affected by the error types of the position data, paired t-tests were only calculated for both orientation errors. Therefore, to compare the joint angle and joint torque data, 6 paired t-tests were calculated. To avoid an inflated Type 1 error rate, we used the Bonferroni-Holm correction to adjust the alpha value for each t-test. The adjusted alpha values were calculated using the following equation:

$$\begin{aligned} \alpha _{\text {Holm},i} = \frac{\alpha }{m - i + 1} \end{aligned}$$

(4)

where $\alpha$ is the original significance level (0.05), m is the number of tests performed (6), and i is the rank of the test in order of p-values. 95% Confidence Intervals (CIs) were calculated on the mean difference between RMSEs for each group to further analyze the results. Mean differences were calculated by subtracting the RMSE for the IMU-based approach of the RMSE of the multimodal approach. CIs not including the zero value then indicate a significant mean difference between RMSEs. Before performing the t-test, all groups of data were tested for normality using the Shapiro-Wilk test.

In order to further analyze the impact of position-based error types on the quality of multimodal approach outcomes, the RMSEs for the multimodal approach are analyzed depending on orientation data-based errors and grouped by spatial error sizes. This enables the analysis of the influence of position data accuracy on the effect of orientation errors. To facilitate the analysis, a mean position error was calculated. For each marker offset level (low, medium, high), the mean value is calculated across all marker noise levels for each orientation level. This calculation is perfomed for both orientation errors.

Results

Comparison of kinematic data

We analyzed the computed generalized coordinates for the IMU-based and the multimodal inverse kinematics approach to determine whether the addition of single position references enhances the kinematic results of the IMU-based solutions. Figure 2 shows mean generalized coordinate values as well as standard deviations calculated for 81 analyses for all actuated degrees of freedom for both inverse kinematics analyses results. The reference data is depicted for comparison. In general, the mean curves for all joint angles correspond well to the reference curves for both approaches. However, a better match can be observed for the multimodal inverse kinematics solution. In addition, the standard deviations are larger for the IMU-based solution across all joint angles, with the exception of the hip rotation angles. With regard to the hip rotation and hip adduction angles, a drift in the data can be observed in the IMU-based solutions. With regard to the multimodal approach, such a drift is not observable. For the pelvis translations, a good match can be observed between the multimodal approach and the reference data for the x- and y-translation. For the z-coordinate, a steady offset can be observed. As we used fixed data for the IMU-based approach, large deviations can be observed. Table 2 lists the mean joint angle RMSE values for both approaches with regard to the reference data. The mean RMSE for the IMU-based solution was more than twice that of the multimodal approach. Mean and standard deviation RMSE values for each degree of freedom can be found online as Supplementary Table S2. Incorporating position data enhances the accuracy of computed kinematics, as reflected in both qualitative and quantitative analyses.

Table 2 Mean joint angle ($^{\circ }$), mean joint torque ($\hbox {Nm}$) RMSEs and mean residual force ($\hbox {N}$) and torque ($\hbox {Nm}$) RMSEs for both approaches in relation to the reference data.

Full size table

Next, we analyzed the mean difference between the joint angle RMSE values for both approaches to determine whether the addition of position data significantly enhances the computed kinematics. Normality was tested using the Shapiro–Wilk test and was not violated in any of the 6 groups ($p \, > \,0.05$ for all). The calculated paired t-tests showed significant differences for both groups and for all levels of error. Figure 3 shows the ${95}{\%}$ CIs for the mean differences between both approaches. For all orentation-based error types and levels of errors, the multimodal approach resulted in significantly smaller RMSEs as indicated by the negative values of the ${95}{\%}$ CIs. The size of the mean differences was consistent ($\sim {5^{\circ }}\,\hbox {to}\,{6^{\circ }}$) across the errors and levels of errors. An overview of the statistical analysis, showing mean differences, upper and lower bounds of the ${95}{\%}$ CIs, and p-values obtained from the t-tests can be found online as Supplementary Table S3.

Figure 5 shows the joint angle RMSEs of the multimodal approach depending on the orientation errors and grouped by the size of the position errors. The orientation errors have a limited impact on joint angle RMSEs, as demonstrated by the parallel and non-intersecting lines in the RMSE curves for bias drift and gyroscope noise erros. Furthermore, the level offset resulting from position errors remains relatively consistent across all levels of orientation error. However, a clear offset can be seen for the different levels of position error.

Comparison of dynamic data

We analyzed the computed torques and residuals for the IMU-based and the multimodal inverse kinematics approach to determine whether the addition of single position references enhances the dynamic results of the IMU-based solutions. Figure 4 shows mean joint torques as well as standard deviations calculated for 81 analyses for all actuated degrees of freedoms for both approaches. In addition, mean residual forces and torques as well as their standard deviations are depicted. In general, computed joint torques correspond well to the reference values for both approaches. However, a better match can be observed for the multimodal solution. The standard deviation values are larger for the IMU-based solution. The mean residual torque values ranged between ${-63.94}\,\hbox {Nm}$ and ${95.19}\,\hbox {Nm}$ for the IMU-based solution and ${-22.29}\,\hbox {Nm}$ and ${25.34}\,\hbox {Nm}$ for the multimodal approach. Mean residual forces ranged between ${-127.03}\,\hbox {N}$ and ${183.66}\,\hbox {N}$ for the IMU-based solution and ${-60.01}\,\hbox {N}$ and ${78.40}\,\hbox {N}$ for the multimodal approach. Table 2 lists the mean joint torque RMSE values for both approaches with regard to the reference data. For the IMU-based solution, the mean RMSE was about five times higher compared to the multimodal approach. In addition, mean residual force and torque RMSE values were smaller for the multimodal approach compared to the IMU-based solution. Mean and standard deviation RMSE values for each degree of freedom can be found online as Supplementary Table S2. The mean residual force and torque RMSE values for the swing and stance phase, per coordinate component, and per approach are listed in Table 3. The multimodal approach resulted in consistently smaller values in both phases and all coordinates. Different pelvis marker weightings did not positively affect residual force and torque RMSEs. While increasing the weighting by a factor of 10 produced consistent results, further values of the scaling factor $\epsilon$ led to higher residual RMSE values. Exact values for all investigated pelvis marker weightings can be found online as Supplementary Table S5. Incorporating position data enhances the accuracy of computed dynamics, as reflected in both qualitative and quantitative analyses.

Table 3 Residual forces and torques RMSE values for both approaches during stance and swing phase of gait, for each coordinate component.

Full size table

We analyzed the mean difference between the joint torque RMSE values for both approaches to determine, whether the addition of position data significantly enhances the computed dynamics. Normality was tested using the Shapiro–Wilk test and was not violated in any of the 6 groups ($p \, > \,0.05$ for all) The calculated paired t-tests showed significant differences for all groups and for all levels of error. Figure 3 shows the ${95}{\%}$ CIs for the mean differences between both approaches. For all orientation-based error types and levels of errors, multimodal inverse kinematics based results resulted in significantly better dynamic results (compared to the reference data) as well as to smaller residuals. The mean difference was consistent ($15~\text {Nm}-20~\text {Nm}$) across all error sizes for both errors. An overview of the statistical analysis, showing mean differences, upper and lower bounds of the ${95}{\%}$ CIs, and p-values obtained from the t-tests can be found online as Supplementary Table S4.

Figure 5 shows the joint torque RMSEs of the multimodal approach depending on the orientation errors and grouped by the size of the position errors. The orientation errors have a limited impact on the joint torque RMSEs, as demonstrated by the parallel and non-intersecting lines in the RMSE curves for bias drift and gyroscope noise errors. Furthermore, the level offset resulting from position errors remains relatively consistent across all levels of orientation error. However, a clear offset can be seen for different levels of position error.

Discussion

In this simulation study, we introduced a multimodal inverse kinematics approach to improve IMU-based motion data by incorporating single position reference data. We performed a sensitivity analysis using synthetic data to assess how much error in IMU-derived joint angle data can be offset by position data with varying accuracy levels. Two types of errors, each with three levels of severity, were introduced into the position and orientation data. In total, 81 pairs of input files and analyses were conducted. Across all error types and magnitudes, the multimodal inverse approach produced notably more accurate results compared to the reference data. For both kinematic (joint angle) and dynamic (joint torque and residuals) analysis results, this approach led to a substantial reduction in RMSEs. Specifically, the mean joint angle RMSE decreased by ${63}{\%}$, the mean joint torque RMSE decreased by ${80}{\%}$, the mean residual force RMSE was reduced by ${25}{\%}$, and the mean residual torque RMSE was reduced by ${70}{\%}$.

While our proposed multimodal approach yielded significantly enhanced kinematic and dynamic analysis results, it is important to acknowledge its limitations. One primary limitation of this simulation study is the reliance on synthetic data to evaluate the performance of the multimodal inverse kinematic approach. We have tried to create IMU-based data as realistically as possible by adding various artificial errors to the data. However, we cannot guarantee that the data will be equivalent to real-life sensor data. The influence of corrupted magnetic heading has been implemented by adding a yaw drift error to each virtual sensor. Further magnetic disturbances (e.g. variable misalignment) have not been implemented. Errors stemming from magnetic disturbance are not definitive and therefore difficult to model correctly and cohesively. This remains a limitation of this simulation study. As a result, we cannot yet draw definitive conclusions about the method’s efficacy when applied to real-world measurement data, or about its generalizability. The decision to use virtual sensor data was driven by the main objective of this study: to determine whether single position references can improve kinematic and dynamic analysis outcomes compared with IMU-based results. Additionally, we sought to assess the required accuracy level of spatial data to enhance IMU-based analyses. Our findings indicate that even the least accurate positional data (with errors up to ${3}\,\hbox {cm}$) substantially improve analysis results. Typical IMU errors, such as offset from calibration issues or joint angle drift, were consistently mitigated. This error compensation is notably evident in degrees of freedom with limited ranges of motion, such as hip adduction and hip rotation, as demonstrated in Fig. 2. Specifically, for hip adduction, an offset error correction is particularly pronounced. This error reduction leads to markedly improved dynamic results, as the discrepancies between the reference curve and the IMU-based curve are negligible in the multimodal results. Peak deviations are effectively compensated, as illustrated in Fig. 4.

Further, it is important to note that, despite improvements in dynamic results, residual forces and torques remain evident. In the multimodal approach, these residual forces and torques range between $\sim \pm {70}\,\hbox {N}$ and $\sim {20}\,\hbox {Nm}$, respectively. Although the residual forces fall below the recommended threshold of ${5}{\%}$ of the net external forces, the residual torques exceed the suggested limit of ${1}{\%}$ of the net external torques²⁰. Adjustments made to the spatial reference weighting of the base segment (the pelvis) did not result in a reduction of residual forces and torques. Subsequent analysis of the residual forces and torques per gait phase (stance and swing) demonstrate consistent values. As apparent from the coordinate components, the x-y-coordinate values are generally higher than the z-component values, with the exception of residual forces for the IMU-based approach in the swing phase. However, the primary aim of this study was not to introduce a novel method to minimize residuals, but rather to explore whether incorporating multimodal data in an inverse kinematics approach enhances biomechanical analysis outcomes. Both qualitative and quantitative analyses of the dynamic analysis results demonstrated a consistent improvement over IMU-based results due to the inclusion of spatial reference data. Residual forces and torques reflect various errors in the inverse analysis approach, such as model uncertainties or measurement inaccuracies. Minimizing the kinematic error as much as possible is advantageous for reducing overall residuals, as it allows other error sources to be addressed through alternative methods.

One key benefit of our approach is that it does not impose any restrictions on the movement to be analyzed or the dimensionality of the model used. Inverse kinematics methods have already been used to analyze a wide variety of motions using three-dimensional models: e.g. (deep) squat^38,39, upper body reaching motions³¹, side cut^40,41 or jump lunge⁴⁰. This distinguishes our inverse kinematics-based approach from trajectory optimization approaches used to process IMU-based data without drift, which have thus far been limited to analyzing two-dimensional gait or running motions. Although gait data was also analyzed in this study, three-dimensional motion data was used. In addition, given that inverse kinematics–based analyses have been successfully applied to a variety of captured movements, we anticipate that our method will generalize well to other motion types. Future work will focus on evaluating the performance of the proposed approach using real-world measurement data across different movement tasks.

Another key advantage of the proposed multimodal inverse kinematics approach is that it is largely independent of the measurement system used. While the method requires specific input modalities – namely, position and orientation data – it is not limited to a particular technology for acquiring these inputs. Recent advances in motion capture have introduced various techniques, including those based on RGB cameras^11,12,13, depth cameras^14,15, and radar^18,19. Our findings show that input data with relatively low spatial accuracy can still lead to significant improvements in biomechanical analysis outcomes. Consequently, the approach is compatible with a broad range of sensing technologies, highlighting its versatility and practical application.

Conclusion

In this simulation study using synthetic data, we demonstrated that incorporating spatial information into IMU-based motion data improves both kinematic and dynamic analysis. Future research will evaluate the effectiveness of the multimodal inverse kinematics approach when applied to real-world measurement data of various motions. We plan to use radar-based position data to enhance orientation-based motion capture data using the proposed multimodal approach. In addition, we intend to investigate whether combining a forward dynamic simulation approach with multimodal motion data can further enhance simulation accuracy and reduce residuals more effectively. Future research could also focus on integrating multimodal motion data with comprehensive model individualization methods to help bridge the gap between simulation results and reality.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Capin, J. J. et al. Gait Mechanics After ACL Reconstruction Differ According to Medial Meniscal Treatment. J. Bone Joint Surg. 100, 1209–1216. https://doi.org/10.2106/JBJS.17.01014 (2018).
Article PubMed Google Scholar
Wellsandt, E., Khandha, A., Capin, J., Buchanan, T. S. & Snyder-Mackler, L. Operative and nonoperative management of anterior cruciate ligament injury: Differences in gait biomechanics at 5 years. J. Orthop. Res. 38, 2675–2684. https://doi.org/10.1002/jor.24652 (2020).
Article PubMed PubMed Central Google Scholar
Van Der Have, A., Van Rossom, S. & Jonkers, I. Musculoskeletal-Modeling-Based, Full-Body Load-Assessment Tool for Ergonomists (MATE): Method Development and Proof of Concept Case Studies. Int. J. Environ. Res. Public Health 20, 1507. https://doi.org/10.3390/ijerph20021507 (2023).
Article PubMed PubMed Central Google Scholar
Rasmussen, J. et al. Performance optimization by musculoskeletal simulation. Movement Sport Sci. 73–83. https://doi.org/10.1051/sm/2011122 (2012).
Al Borno, M. et al. OpenSense: An open-source toolbox for inertial-measurement-unit-based measurement of lower extremity kinematics over long durations. J. Neuroeng. Rehabil. 19, 22. https://doi.org/10.1186/s12984-022-01001-x (2022).
Article PubMed PubMed Central Google Scholar
Roetenberg, D., Luinge, H. J. & Slycke, P. Xsens MVN: Full 6DOF human motion tracking using miniature inertial sensors. http://www.xsens.com (2013).
Sy, L. W., Lovell, N. H. & Redmond, S. J. Estimating Lower Limb Kinematics Using a Lie Group Constrained Extended Kalman Filter with a Reduced Wearable IMU Count and Distance Measurements. Sensors 20, 6829. https://doi.org/10.3390/s20236829 (2020).
Article ADS PubMed PubMed Central Google Scholar
Nazarahari, M. & Rouhani, H. A Full-State Robust Extended Kalman Filter for Orientation Tracking During Long-Duration Dynamic Tasks Using Magnetic and Inertial Measurement Units. IEEE Trans. Neural Syst. Rehabil. Eng. 29, 1280–1289. https://doi.org/10.1109/TNSRE.2021.3093006 (2021).
Article PubMed Google Scholar
Riek, P. M., Best, A. N. & Wu, A. R. Validation of Inertial Sensors to Evaluate Gait Stability. Sensors 23, 1547. https://doi.org/10.3390/s23031547 (2023).
Article ADS PubMed PubMed Central Google Scholar
Parker, S. M., Crenshaw, J., Hunt, N. H., Burcal, C. & Knarr, B. A. Outdoor walking exhibits peak ankle and knee flexion differences compared to fixed and adaptive-speed treadmills in older adults. Biomed. Eng. Online 20, 104. https://doi.org/10.1186/s12938-021-00941-0 (2021).
Article PubMed PubMed Central Google Scholar
Uhlrich, S. D. et al. OpenCap: Human movement dynamics from smartphone videos. PLoS Comput. Biol. 19, e1011462. https://doi.org/10.1371/journal.pcbi.1011462 (2023).
Article CAS PubMed PubMed Central Google Scholar
Mehta, D. et al. XNect: Real-time multi-person 3D motion capture with a single RGB camera. ACM Trans. Graph. 39. https://doi.org/10.1145/3386569.3392410 (2020).
Regazzoni, D., De Vecchi, G. & Rizzi, C. RGB cams vs RGB-D sensors: Low cost motion capture technologies performances and limitations. J. Manuf. Syst. 33, 719–728. https://doi.org/10.1016/j.jmsy.2014.07.011 (2014).
Article Google Scholar
Chatzitofis, A., Zarpalas, D., Kollias, S. & Daras, P. DeepMoCap: Deep Optical Motion Capture Using Multiple Depth Sensors and Retro-Reflectors. Sensors 19, 282. https://doi.org/10.3390/s19020282 (2019).
Article ADS PubMed PubMed Central Google Scholar
Wang, K., Zhang, G., Yang, J. & Bao, H. Dynamic human body reconstruction and motion tracking with low-cost depth cameras. Vis. Comput. 37, 603–618. https://doi.org/10.1007/s00371-020-01826-4 (2021).
Article Google Scholar
Zhang, L., Sturm, J., Cremers, D. & Lee, D. Real-time human motion tracking using multiple depth cameras. In 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2389–2395, https://doi.org/10.1109/IROS.2012.6385968 (IEEE, Vilamoura-Algarve, Portugal, 2012).
Bylow, E., Olsson, C. & Kahl, F. Robust Camera Tracking by Combining Color and Depth Measurements. In 2014 22nd International Conference on Pattern Recognition, 4038–4043, https://doi.org/10.1109/ICPR.2014.692 (IEEE, Stockholm, Sweden, 2014).
Bräunig, J. et al. A Radar-Based Concept for Simultaneous High-Resolution Imaging and Pixel-Wise Velocity Analysis for Tracking Human Motion. IEEE J. Microw. 4, 639–652. https://doi.org/10.1109/JMW.2024.3453570 (2024).
Article Google Scholar
Hu, S. et al. mmWave Radar for Sit-to-Stand Analysis: A Comparative Study with Wearables and Kinect. IEEE Trans. Biomed. Eng. 1–12. https://doi.org/10.1109/TBME.2025.3548092 (2025).
Hicks, J. L., Uchida, T. K., Seth, A., Rajagopal, A. & Delp, S. L. Is My Model Good Enough? Best Practices for Verification and Validation of Musculoskeletal Models and Simulations of Movement. J. Biomech. Eng. 137, 020905. https://doi.org/10.1115/1.4029304 (2015).
Article PubMed Google Scholar
Hatze, H. The fundamental problem of myoskeletal inverse dynamics and its implications. J. Biomech. 35, 109–15. https://doi.org/10.1016/S0021-9290(01)00158-0 (2002).
Article PubMed Google Scholar
Wechsler, I. et al. Bridging the sim2real gap. Investigating deviations between experimental motion measurements and musculoskeletal simulation results–a systematic review. Front. Bioeng. Biotechnol. 12, 1386874. https://doi.org/10.3389/fbioe.2024.1386874 (2024).
Article PubMed PubMed Central Google Scholar
Pearl, O., Shin, S., Godura, A., Bergbreiter, S. & Halilaj, E. Fusion of video and inertial sensing data via dynamic optimization of a biomechanical model. J. Biomech. 155, 111617. https://doi.org/10.1016/j.jbiomech.2023.111617 (2023).
Article PubMed Google Scholar
Dorschky, E., Nitschke, M., Seifer, A.-K., van den Bogert, A. J. & Eskofier, B. M. Estimation of gait kinematics and kinetics from inertial sensor data using optimal control of musculoskeletal models. J. Biomech. 95, 109278. https://doi.org/10.1016/j.jbiomech.2019.07.022 (2019).
Article PubMed Google Scholar
Dorschky, E. et al. Comparing sparse inertial sensor setups for sagittal-plane walking and running reconstructions. Front. Bioeng. Biotechnol. 13, 1507162. https://doi.org/10.3389/fbioe.2025.1507162 (2025).
Article PubMed PubMed Central Google Scholar
Mallat, R. et al. Sparse Visual-Inertial Measurement Units Placement for Gait Kinematics Assessment. IEEE Trans. Neural Syst. Rehabil. Eng. 29, 1300–1311. https://doi.org/10.1109/TNSRE.2021.3089873 (2021).
Article PubMed Google Scholar
Atrsaei, A., Salarieh, H. & Alasty, A. Human Arm Motion Tracking by Orientation-Based Fusion of Inertial Sensors and Kinect Using Unscented Kalman Filter. J. Biomech. Eng. 138, 091005. https://doi.org/10.1115/1.4034170 (2016).
Article Google Scholar
Joukov, V. et al. Human motion estimation on Lie groups using IMU measurements. In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 1965–1972, https://doi.org/10.1109/IROS.2017.8206016 (IEEE, Vancouver, BC, 2017).
Potter, M. V. et al. Error-state Kalman filter for lower-limb kinematic estimation: Evaluation on a 3-body model. PLoS ONE 16, 0249577. https://doi.org/10.1371/journal.pone.0249577 (2021).
Article CAS Google Scholar
Xing, H., Hou, B., Lin, Z. & Guo, M. Modeling and Compensation of Random Drift of MEMS Gyroscopes Based on Least Squares Support Vector Machine Optimized by Chaotic Particle Swarm Optimization. Sensors 17, 2335. https://doi.org/10.3390/s17102335 (2017).
Article ADS PubMed PubMed Central Google Scholar
Wechsler, I. et al. Method for Using IMU-Based Experimental Motion Data in BVH Format for Musculoskeletal Simulations via OpenSim. Sensors 23, 5423. https://doi.org/10.3390/s23125423 (2023).
Article ADS PubMed PubMed Central Google Scholar
Seth, A. et al. OpenSim: Simulating musculoskeletal dynamics and neuromuscular control to study human and animal movement. PLoS Comput. Biol. 14, e1006223. https://doi.org/10.1371/journal.pcbi.1006223 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rajagopal, A. et al. Full-Body Musculoskeletal Model for Muscle-Driven Simulation of Human Gait. IEEE Trans. Biomed. Eng. 63, 2068–2079. https://doi.org/10.1109/TBME.2016.2586891 (2016).
Article PubMed PubMed Central Google Scholar
NaturalPoint, I. OptiTrack Motion Capture System.https://www.optitrack.com/ (2025).
Leardini, A. et al. A new anatomically based protocol for gait analysis in children. Gait Posture 26, 560–571. https://doi.org/10.1016/j.gaitpost.2006.12.018 (2007).
Article PubMed Google Scholar
Balbach, S. et al. A Miniaturized Flexible Surface Electromyography Sensor With an Integrated Localization Concept. IEEE Microwave Mag. 26, 47–59. https://doi.org/10.1109/MMM.2024.3494717 (2025).
Article Google Scholar
InvenSense, T. ICM-20689 Datasheet (2021).
Lu, Y. et al. A Comparative Study on Loadings of the Lower Extremity during Deep Squat in Asian and Caucasian Individuals via OpenSim Musculoskeletal Modelling. Biomed. Res. Int. 2020, 7531719. https://doi.org/10.1155/2020/7531719 (2020).
Article Google Scholar
Schellenberg, F. et al. Evaluation of the accuracy of musculoskeletal simulation during squats by means of instrumented knee prostheses. Med. Eng. Phys. 61, 95–99. https://doi.org/10.1016/j.medengphy.2018.09.004 (2018).
Article PubMed Google Scholar
Smale, K. B., Potvin, B. M., Shourijeh, M. S. & Benoit, D. L. Knee joint kinematics and kinetics during the hop and cut after soft tissue artifact suppression: Time to reconsider ACL injury mechanisms?. J. Biomech. 62, 132–139. https://doi.org/10.1016/j.jbiomech.2017.06.049 (2017).
Article PubMed Google Scholar
Smale, K. B. et al. Effect of implementing magnetic resonance imaging for patient-specific OpenSim models on lower-body kinematics and knee ligament lengths. J. Biomech. 83, 9–15. https://doi.org/10.1016/j.jbiomech.2018.11.016 (2019).
Article PubMed Google Scholar

Download references

Acknowledgements

This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – SFB 1483 – Project-ID 442419336, EmpkinS.

Funding

Open Access funding enabled and organized by Projekt DEAL. This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – SFB 1483 – Project-ID 442419336, EmpkinS.

Author information

Authors and Affiliations

Engineering Design, Department of Mechanical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen, Germany
Iris Wechsler, Julian Shanbhag, Sandro Wartzack & Jörg Miehling
Institute of Microwaves and Photonics, Department of Electrical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen, Germany
Niklas Schlechtweg & Martin Vossiek
Chair of Autonomous Systems and Mechatronics, Department of Electrical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91052, Erlangen, Germany
Anne D. Koelewijn

Authors

Iris Wechsler
View author publications
Search author on:PubMed Google Scholar
Julian Shanbhag
View author publications
Search author on:PubMed Google Scholar
Niklas Schlechtweg
View author publications
Search author on:PubMed Google Scholar
Martin Vossiek
View author publications
Search author on:PubMed Google Scholar
Anne D. Koelewijn
View author publications
Search author on:PubMed Google Scholar
Sandro Wartzack
View author publications
Search author on:PubMed Google Scholar
Jörg Miehling
View author publications
Search author on:PubMed Google Scholar

Contributions

I.W.: Conceptualization, Methodology, Software, Formal analysis, Investigation, Writing - Original Draft, Visualization; J.S.: Methodology, Formal analysis, Writing - Review & Editing; N.S.: Investigation, Writing - Review & Editing; M.V.: Supervision, Writing - Review & Editing; S.W.: Supervision, Writing - Review & Editing; A.K.: Supervision, Writing - Review & Editing; J.M.: Conceptualization, Methodology, Writing - Review & Editing, Supervision, Funding acquisition

Corresponding author

Correspondence to Iris Wechsler.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wechsler, I., Shanbhag, J., Schlechtweg, N. et al. Multimodal inverse kinematics significantly improves IMU-based biomechanical analyses. Sci Rep 15, 44420 (2025). https://doi.org/10.1038/s41598-025-33021-7

Download citation

Received: 08 July 2025
Accepted: 15 December 2025
Published: 23 December 2025
Version of record: 24 December 2025
DOI: https://doi.org/10.1038/s41598-025-33021-7

Subjects

Abstract

Similar content being viewed by others

Multibody kinematics optimization for motion reconstruction of the human upper extremity using potential field method

Multimodal video and IMU kinematic dataset on daily life activities using affordable devices

A frame orientation optimisation method for consistent interpretation of kinematic signals

Introduction

Methods

Model description

Experimental data collection and data pre-processing

Creation of reference data

Creation of synthetic data

Creation of IMU-based motion data

Multimodal inverse kinematics

Statistical analysis

Results

Comparison of kinematic data

Comparison of dynamic data

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links