Validation and application of computer vision algorithms for video-based tremor analysis

Friedrich, Maximilian U.; Roenn, Anna-Julia; Palmisano, Chiara; Alty, Jane; Paschen, Steffen; Deuschl, Guenther; Ip, Chi Wang; Volkmann, Jens; Muthuraman, Muthuraman; Peach, Robert; Reich, Martin M.

doi:10.1038/s41746-024-01153-1

Download PDF

Article
Open access
Published: 21 June 2024

Validation and application of computer vision algorithms for video-based tremor analysis

npj Digital Medicine volume 7, Article number: 165 (2024) Cite this article

6759 Accesses
21 Citations
20 Altmetric
Metrics details

Subjects

Abstract

Tremor is one of the most common neurological symptoms. Its clinical and neurobiological complexity necessitates novel approaches for granular phenotyping. Instrumented neurophysiological analyses have proven useful, but are highly resource-intensive and lack broad accessibility. In contrast, bedside scores are simple to administer, but lack the granularity to capture subtle but relevant tremor features. We utilise the open-source computer vision pose tracking algorithm Mediapipe to track hands in clinical video recordings and use the resulting time series to compute canonical tremor features. This approach is compared to marker-based 3D motion capture, wrist-worn accelerometry, clinical scoring and a second, specifically trained tremor-specific algorithm in two independent clinical cohorts. These cohorts consisted of 66 patients diagnosed with essential tremor, assessed in different task conditions and states of deep brain stimulation therapy. We find that Mediapipe-derived tremor metrics exhibit high convergent clinical validity to scores (Spearman’s ρ = 0.55–0.86, p≤ .01) as well as an accuracy of up to 2.60 mm (95% CI [−3.13, 8.23]) and ≤0.21 Hz (95% CI [−0.05, 0.46]) for tremor amplitude and frequency measurements, matching gold-standard equipment. Mediapipe, but not the disease-specific algorithm, was capable of analysing videos involving complex configurational changes of the hands. Moreover, it enabled the extraction of tremor features with diagnostic and prognostic relevance, a dimension which conventional tremor scores were unable to provide. Collectively, this demonstrates that current computer vision algorithms can be transformed into an accurate and highly accessible tool for video-based tremor analysis, yielding comparable results to gold standard tremor recordings.

Validity of tremor analysis using smartphone compatible computer vision frameworks

Article Open access 18 April 2025

Essential tremor amplitude modulation by median nerve stimulation

Article Open access 06 September 2021

EMD-based data augmentation method applied to handwriting data for the diagnosis of Essential Tremor using LSTM networks

Article Open access 27 July 2022

Introduction

Tremor syndromes are among the most common neurological disorders. Of these, essential tremor affects up to 4.6% of the global population ≥65 years old¹. This disorder is characterised by a mixture of postural and kinetic tremors, which likely represent diverse facets of pathological oscillations in brain motor networks^2,3,4. Tremor is often accompanied by additional neurological signs such as dystonia or ataxia. As such, tremor is also a common symptom in a range of acquired and genetic neurological disorders, posing a significant diagnostic challenge in clinical neurology. This translates into high rates of misdiagnosed tremor disorders⁵, which has profound therapeutic implications in particular for deep brain stimulation (DBS), a potent neural circuit therapy for tremor disorders. DBS outcomes largely hinge on accurate patient selection, which itself is influenced by accurate tremor assessment⁶. The complexity of tremor syndromes has been a roadblock to pathogenetic and diagnostic research, which culminated in a call to redefine tremor classification through quantitative phenotyping⁴.

To this end, instrumented tremor analysis offers an unbiased and detailed assessment of key tremor features, such as frequency and amplitude, which are crucial for phenotyping^7,8, therapeutic monitoring^9,10, differential diagnosis^{11,12,13,14,15} and closed-loop neuromodulation¹⁶. 3D motion capture methods enable comprehensive characterisation of both tremor and associated movement abnormalities (reviewed in ref. ¹⁷). However, the reliance on these complex and resource-intensive methods restricts their practical use, especially in routine clinical settings.

In contemporary practice, the complex phenomenology of tremor syndromes is therefore condensed into low dimensional, ordinal rating scales. These scales represent tremor items in a non-linear, logarithmic manner^18,19 and, despite their simplicity, suffer from considerable clinimetric limitations. One of these limitations is interrater reliability, reported to be as low as 0.1 (Cohen’s kappa)^{18,20,21,22,23}.

While mobile technologies, such as smartphone accelerometers, have emerged as promising tools for tremor frequency assessment^{7,24,25,26,27}, they have critical limitations, such as their reliance on calibration, sensor weight and placement⁸. Additionally, they cannot readily measure associated neurological signs.

Novel computer vision (CV) methods for marker-less pose tracking have been developed for consumer applications but are increasingly adapted in movement sciences²⁴^,^{28,29,30,31,32,33,34}. Pilot studies have shown the feasibility of CV-based measurement of neurological motor symptoms^29,33,35 and specifically, tremor frequency^{23,31,36,37,38}. However, CV-based measurement of tremor amplitude remains unexplored, despite it being the key kinematic determinant of patient life quality³⁹.

A key challenge for pose tracking algorithms is a generalisation to clinical contexts, where medical equipment interferes with body landmark detection and disease-related alterations of movement and posture deviate from their training data^28,29,30^,40. Tools like DeepLabCut^23,29,41 enable supervised fine-tuning of pose-tracking algorithms with task-specific data, but the training can introduce biases that lead to overfitting. Finally, consumer CV algorithms are evaluated with static metrics (i.e., Euclidean distances in single frames) that are largely unrelated to the clinical quanta of interest (e.g. frequencies, amplitudes)^28,42. At present, there is a critical lack of rigorous validation of CV algorithms against clinical gold standard methods and application in larger neurological patient populations^29,31.

To address these challenges, we repurpose Mediapipe, an open-source pose tracking algorithm, for comprehensive tremor analysis. We evaluate its capability to track hands in clinical standard videos of postural and kinetic tremor assessments and use the resulting time series data to compute both fundamental and advanced tremor features. We benchmark this CV framework against gold standard methods in a cohort of patients diagnosed with essential tremor. Subsequently, we apply it to an independent, retrospective dataset of unstandardised, real-world videos from two clinical sites, examining its convergent clinical validity and capability to characterise therapeutic effects of deep brain stimulation on tremor. We assess the framework’s utility to inform diagnostic and prognostic challenges in two clinical use case scenarios. Finally, we explore how different CV architectures impact the performance of tremor analysis by comparing Mediapipe to a pose-tracking algorithm specifically fine-tuned for tremor analysis.

Results

Validation of the computer vision framework: tremor amplitudes

To assess the CV framework’s technical and clinical validity, we first applied it to video data from a prospectively recruited cohort of patients with a diagnosis of essential tremor and treated with thalamic DBS. Ground truth values of tremor amplitudes and frequencies were determined using laboratory gold standard technologies: marker-based 3D motion capture and simultaneous wrist-mounted accelerometery.

CV-derived peak postural tremor amplitudes showed a strong correlation with respective clinical scores, similarly to gold standard motion capture (MP: ρ > 0.86, MC: ρ = 0.90, p < 0.001, Fig. 1a, b). Excellent agreement of computer vision was found with motion capture (ρ = 0.89, p < 0.001, Fig. 1c). In comparison to motion capture, computer vision had a mean absolute error of 10 mm (95% CI [5.65, 14.4]). No systematic relationship between measurement and error magnitudes was observed (Fig. 1d). Computer vision-derived tremor amplitudes fell within equivalence boundaries of motion capture tracking (±10 mm, Supplementary Fig. 1a) and were comparably responsive to DBS effect (d > 0.94, all p < 0.001, Fig. 1e), overall suggestive of equivalent accuracy. Median precision, measured by the standard deviation of each amplitude measurement, was 1.29 mm for motion capture and 0.54 mm for Mediapipe. Precision values reached equivalence to motion capture within gold-standard derived boundaries of ±3.63 mm (Supplementary Fig. 1b). Reducing the 90% CI margins to ±1.5 and ±1.0 mm did not substantially change these results, indicating robustness beyond the defined boundaries.

Mediapipe’s peak kinetic tremor amplitude estimates were strongly correlated to the clinical scores, again comparable to motion capture derived values (ρ = 0.55, p < 0.01, Fig. 1f, g). Mediapipe reached substantial agreement with motion capture (ρ = 0.72, p < 0.001, Fig. 1h). Mean absolute error was −2.60 mm (95% CI [−3.13, 8.23], Fig. 1i). Mediapipe’s accuracy in kinetic tremor amplitude measurement was equivalent to motion capture (Supplementary Fig. 1a). Mediapipe and motion capture were again comparably responsive to DBS effects on kinetic tremor amplitude (d = 0.69 and 0.60, Fig. 1j). Median precision of kinetic tremor amplitude measurement was calculated to be 0.31 mm for motion capture and 0.49 mm for Mediapipe. Mediapipe’s precision fell within the equivalence boundaries of ±2.1 mm (Supplementary Fig. 1c). Repeating the equivalence tests with empirically reduced 90% CI margins of ±1.5 and ±1.0 mm did not substantially change these results. Notably, the aforementioned results were similar when using mean instead of peak amplitude measurements (Supplementary Figs. 2 and 3).

Validation of the computer vision framework: tremor frequencies

Computer vision-derived tremor frequency measurements were validated against wrist-worn accelerometery, a clinical and laboratory gold standard for tremor analysis. The correspondence of tremor frequencies from Mediapipe and motion capture to accelerometery was found to be similarly strong (r > 0.40, Fig. 2a). The mean dominant frequency of postural tremor was measured to be 5.7 ± 0.72 Hz with accelerometery, 6.04 ± 0.65 Hz with motion capture and 5.9 ± 0.58 Hz with Mediapipe, resulting in mean absolute errors of −0.34 Hz [95% CI −0.08, 0.60] for motion capture and −0.21 Hz [95% CI −0.05, 0.46] for Mediapipe (Fig. 2b).

Within the predefined margins of ±0.5 Hz, Mediapipe-derived frequency measurements achieved equivalent accuracy to accelerometery, while motion capture exceeded the equivalence bounds (Fig. 2c and Supplementary Fig. 1d). Median precision of tremor frequency measurements was 0.58 Hz for accelerometery, 1.15 Hz for motion capture and 1.12 Hz for Mediapipe. Precision values from motion capture and Mediapipe were equivalent to accelerometer within gold standard derived margins of ±2 Hz (Supplementary Fig. 1e). Again, reducing the 90% CI margins to ±1.5 and ±1.0 Hz did not substantially alter these results.

Both motion capture and Mediapipe-derived kinetic tremor frequencies demonstrated moderate agreement with respective accelerometric measurements (motion capture: ρ = 0.38, p = 0.034; Mediapipe: ρ = 0.37, p = 0.033, Fig. 2d). The mean dominant frequency of kinetic tremor was 5.25 ± 1.06 Hz using accelerometery, 5.48 ± 0.41 Hz using motion capture and 5.31 ± 0.34 Hz using Mediapipe, with mean absolute errors of 0.22 Hz (95% CI [−0.15, 0.59]) for motion capture and 0.06 Hz (95% CI [−0.30, 0.41]) for Mediapipe. Bland-Altman plots for motion capture and Mediapipe suggested a systematic relationship between error and measurement magnitudes (Fig. 2e).

Within the predefined boundaries of ±0.5 Hz, Mediapipe’s accuracy in frequency measurements was equivalent to accelerometery (Fig. 2f and Supplementary Fig. 1d). In contrast, motion capture’s accuracy was significantly lower than Mediapipe (T(31) = 2.98, 95% CI of difference [0.05, 0.28], p = 0.006). Median precision was 0.58 Hz for accelerometry, 0.14 Hz for motion capture and 0.1 Hz for Mediapipe. Both motion capture and Mediapipe precision values fell within equivalence boundaries derived from the minimal precision achieved by accelerometery, ±2 Hz (Supplementary Fig. 1f). Reducing the margins to ±1.5 and ±1.25 Hz in equivalence tests did not substantially alter these results.

Retrospective application: postural tremor

In order to clinically validate the CV framework in an independent sample, we applied it to clinical videos of 43 individuals undergoing clinical tremor assessment before and after thalamic DBS implantation. Peak postural tremor amplitudes derived from Mediapipe were strongly correlated with the corresponding tremor scores (Fig. 3a). Wilcoxon testing further revealed that the CV framework’s peak amplitude measurements were highly responsive to the effect of DBS, as were scores (Fig. 3b, c). Repeating the analyses using mean instead of peak tremor amplitudes yielded similar results with respect to score correlation (Supplementary Fig. 4). Mean dominant frequency of postural tremor was calculated to be 5.96 ± 0.76 Hz.

**Fig. 3: Application of computer vision tremor analysis in an independent, retrospective cohort.**

Retrospective application: kinetic tremor

In the 25 available individuals, a moderate correlation was found between the measured peak amplitudes and the corresponding tremor scores (Fig. 3d). Wilcoxon testing revealed that peak kinetic tremor amplitude measurements were highly sensitive to the DBS effect (Fig. 3e, f). Repeating the analyses using mean instead of peak amplitudes yielded similar results (Supplementary Fig. 3). The mean dominant frequency of kinetic tremor was calculated to be 5.75 ± 0.58 Hz.

Clinical use cases: diagnostic features

Instrumented tremor analysis can provide valuable differential diagnostic clues for tremor syndromes. Beyond the basic tremor characteristics like amplitude and frequency, advanced features such as harmonics or inter-limb tremor coherence have previously been established to support differential diagnosis of tremor syndromes¹⁰. To this end, we investigated whether the CV framework is capable of extracting advanced diagnostic tremor features, which usually require electromyography or other sensors.

Indeed, the Mediapipe-derived tremor signal displayed a harmonic peak which was located at twice the mean dominant frequency (Fig. 4a, b), a feature previously reported to differentiate essential from parkinsonian tremor⁴³. Moreover, no significant inter-limb tremor coherence was detected, a feature reported to discern essential tremor from orthostatic tremor^10,44 (Fig. 4c, d).

**Fig. 4: Using the CV framework to augment diagnostic insight.**

Clinical use cases: predictive modelling

Albeit efficacious in the majority of cases, thalamic DBS outcomes vary⁹. Lack of tremor improvement or even paradoxical increases in kinetic tremor amplitude signify a poor DBS outcome⁶. Patient-specific factors such as baseline clinical tremor scores have been shown to aid DBS outcome prognostication across tremor disorders⁴⁵, which facilitates patient counselling.

Therefore, we aimed to assess the utility of computer vision-derived metrics in predicting DBS outcomes from preoperative kinematics and clinical score information. First, we found that kinetic tremor was markedly less strongly modulated by DBS than postural tremor (Fig. 5a). Since persisting kinetic tremor is a key driver of functional disability in essential tremor and among the main reasons for failed DBS interventions^46,47, we binarized our patient cohort into good and poor responders based on post-operative tremor amplitudes. We chose a threshold of ≥2 cm residual tremor amplitude and ≤30% relative tremor reduction in DBS ON, so as to identify cases with clinically relevant disability^6,46,47. Applying this threshold, we found that kinetic tremor was significantly more frequently associated with a poor outcome (55% vs. 21% fraction of poor responders, p < 0.001, Fig. 5b).

**Fig. 5: Using the CV framework for DBS effect quantification and prognostication.**

To identify determinants of suboptimal DBS outcomes, which might assist in preoperative patient counselling, we conducted a logistic regression analysis. Using binarized response group as the outcome variable and preoperative limb kinematic features as covariates, we detected a strong and significant association of preoperative tremor measurements to DBS outcomes (χ² = 58.4, p < 0.001, McFadden R² = 0.65). Among all covariates, baseline kinetic tremor amplitude emerged as a significant and independent predictor of DBS response (p = 0.002, OR 0.89, 95% CI [0.82, 0.96]). Implementing a rigorous leave-one-out cross-validation to evaluate the model’s performance yielded an area under the receiver operator curve of 0.88 and a F1-score of 0.89 (Fig. 5c). Moreover, baseline kinetic tremor amplitude emerged as an independent predictor of DBS-associated improvement of kinetic tremor amplitude in a linear regression model (R² = 0.18, p < 0.001; baseline kinetic tremor: p = 0.021, Fig. 5d). Of note, preoperative tremor scores were neither a significant predictor of binary outcome nor tremor amplitude change. For additional exploration of clinical and demographic features across cohorts, please see supplementary results.

Assessment of a disease-specific convolutional network: DLC-RCNN

The performance of pose tracking algorithms is crucially influenced by task and visual context, especially in clinical settings^28,29,35. To gauge this effect’s relevance in the context of tremor, we additionally developed a tremor-specific residual convolutional neural network using DeepLabCut⁴¹ (DLC-RCNN). This network was trained with >120,000 frames of clinical video material. Final performance evaluation showed a median Euclidean distance of 3.56 mm and 10.74 mm between user-annotated and predicted keypoints, demonstrating acceptable generalisation and tracking accuracy related to fingertip size (occupying 10-20 pixels, corresponding to 10–20 mm on average^29,48). The model’s generalisation to an out-of-sample validation dataset (>15,000 frames) showed high confidence in predicting postural tremor keypoints (median likelihood of 0.99, 4884 predictions) but unacceptably low confidence for kinetic tremor keypoints (median likelihood of 0.22, 10,532 predictions, Supplementary Fig. 5). Therefore, DLC-RCNN could only be used for postural tremor analysis.

In the prospective cohort, DLC-RCNN-derived tremor amplitudes were strongly correlated to clinical scores (ρ = 0.92, p < 0.001) and gold standard motion capture (ρ = 0.88, p < 0.001, Fig. 6a, b). The mean absolute error was 2.55 mm (Fig. 6c). DLC-RCNN-derived mean dominant tremor frequencies were moderately correlated to accelerometer (ρ = 0.44, p < 0.05), with a mean absolute error of −0.69 Hz (Fig. 6d, e). In the retrospective cohort, DLC-RCNN-derived postural tremor amplitudes were moderately to strongly correlated with assigned clinical scores (ρ = 0.72, p = 0.001, Fig. 6f). DLC-RCNN’s accuracy and precision (0.66 mm) for amplitudes was equivalent to motion capture. The mean dominant frequency was calculated to be 6.38 ± 0.54 Hz, but the DLC-RCNN’s frequency accuracy was significantly lower than Mediapipe and motion capture, hence not equivalent.

**Fig. 6: Application of a disease-specific convolutional neural network for postural tremor analysis across cohorts.**

Discussion

Tremor disorders underscore the critical need for granular phenotyping in clinical management. While traditional instrumented methods provide valuable insights, their high resource demands significantly limit their widespread application in clinical settings. As a result, clinicians often rely on a more reductionist approach, employing semi-quantitative rating scales that bear considerable clinimetric limitations^18,20,21. Our study was aimed to address these challenges by comprehensively assessing the feasibility and robustness of computer vision methods enabling tremor analysis from standard clinical videos.

First, we found that the CV framework achieves comparable accuracy to specialised gold standard equipment in the measurement of both tremor amplitude and frequency. Second, we demonstrated its practical utility not only in characterising the effects of deep brain stimulation but also in providing valuable insights into diagnostic and prognostic challenges – aspects that conventional scores failed to capture. Finally, our study elucidated the impact of different algorithmic architectures on clinical pose tracking capability, providing a roadmap for future technical scalability.

The results of our prospective validation underline the framework’s accuracy, precision and clinical validity, which largely match gold standard equipment. While prior studies have tapped into the potential of computer vision for tremor detection^38,49 and frequency extraction²⁴, amplitude quantification remained largely unexplored. Yet, tremor amplitude is pivotal in assessing patient disability and therapeutic outcomes^6,9. Our findings indicate that smartphone videos, coupled with computer vision tracking tools, can gauge tremor amplitude with an accuracy of up to 2.6 mm, a value that falls on the low end of reported pose tracking accuracies^28,32 and that is almost an order of magnitude smaller than the lowest anchor value provided in the tremor rating scale (20 mm).

Compared to gold standard accelerometery, computer vision-derived tremor frequency measurements demonstrated a mean absolute error between −0.06 and −0.21 Hz, values falling well within, if not below modern vision-based frameworks²⁴. More generally, large scale studies investigating clinical pose tracking in other movement disorders^33,35 report moderate to high score correlation strengths in the range of 0.6–0.8, which corresponds closely to our reported values of 0.55–0.86. Overall, this is strongly indicative that the CV framework effectively captured the clinically relevant target information.

Notably, some correlation plots exhibit increasing residuals with higher scores, which is well in line with the notion of a logarithmic rather than linear relationship of tremor severity and ordinal scores¹⁸. Continuous digital biomarkers are not subject to such non-linearity, which often complicates both intra- and interindividual comparisons relevant for clinical studies and management.

Therefore, CV frameworks offer the potential to dramatically simplify tremor analysis by eliminating the need for multiple devices and sensors and even enabling the analysis of unstandardised legacy videos, underscoring their generalisability and versatility. The fully vision-based approach can be further scaled to additionally quantify tremor-associated neurological signs such as ataxia⁵⁰ or dystonia³⁵. This capability aligns with the central goals of future quantitative phenotyping efforts in tremor disorders⁴.

Next, we applied the CV framework in exemplary use cases that are directly inspired by clinical tremor management. Mediapipe was capable of extracting advanced diagnostic tremor features, which offer additional insights relevant for the differential diagnosis of tremor disorders^10,43,44,51. For example, a harmonic peak at twice the dominant tremor frequency or a lack of inter-limb tremor coherence can be diagnostic clues differentiating essential tremor from other tremor syndromes^10,43,44. While our study was not designed to facilitate comparisons across different tremor disorders, our results nonetheless demonstrate the feasibility of using the CV framework to derive diagnostically relevant tremor features, linking computer vision-derived biomarkers with sensor- or EMG-based findings reported in the neurophysiological literature^10,43,44.

Second, CV-derived features could aid in characterising thalamic neurostimulation outcomes. Our predictive model, focusing on kinetic tremor reduction as the key determinant of disability and life quality after DBS implantation^3,6,9,46,52, identified baseline kinetic tremor amplitude as a predictor of DBS outcome. Interestingly, conventional tremor scores lacked this predictive power, emphasising the advantages of sensitive and continuously encoded digital biomarkers in capturing such nuanced clinical relationships. This finding aligns with similar results for DBS outcome prediction based on scores in Parkinson’s disease⁴⁵ as well as emerging evidence for the added value of digital phenotyping in neurological disorders which reaches far beyond conventional scores^{29,33,35,53,54}.

While both CV architectures excelled at postural tremor tracking, their performance was reduced in kinetic tremor tracking. The tremor-specific DeepLabCut model entirely failed to track kinetic tremor, drastically reducing its versatility. We hypothesise that Mediapipe outperformed the disease-specific model due to its 3D pose tracking capability and high hand landmark coverage (21 landmarks), which is essential for tracking the complex configurational changes of the hands during the finger-to-nose test^28,55,56. Similar observations were reported for head tracking in the context of dystonia³⁵, while another study found that a disease-specific network trained with DeepLabCut outperformed Mediapipe in the tracking of abnormal eye movements²⁹. As the interactions of task, context and algorithm selection for clinical pose tracking are just beginning to be unravelled^28,55, future research is needed to explore the benefits of 3D tracking capabilities, task-specific algorithm customisation and model combination in different clinical scenarios.

Several limitations should be acknowledged. While effective across task conditions, Mediapipe hands was less robust during kinetic tremor assessments involving complex hand configurations. In addition, it does not track proximal arm landmarks, which could be of interest for future mechanistic investigations of DBS effects^57,58. In future clinical pose tracking studies, a combination of algorithms that synergise 3D and full-body tracking with task-specific customisation could offer a more comprehensive approach to tremor analysis across different body regions. Second, the CV framework was highly accurate, but our sequential recording strategy – adopted to minimise marker interference which could lead to overly optimistic tracking results – might introduce biological variance in tremor amplitude measurements⁵⁹. This approach is likely to underestimate rather than overestimate the framework’s accuracy, indicating that the technical agreement between the CV framework and gold standard might, in fact, be even higher. Lastly, while the CV framework was effective across both cohorts, the prospective cohort was limited in size and recruited from a single centre. In the future, larger multi-centric studies are needed to confirm the broader applicability and robustness of CV-based tremor analysis in varied clinical settings.

In conclusion, repurposing open-source pose tracking algorithms like Mediapipe enables tremor analysis from standard clinical video material with comparable accuracy to gold-standard methods and high convergent clinical validity. This approach enables the extraction of digital biomarkers for tremor diagnosis and prognosis and represents a rapidly scalable alternative to more resource-intensive and marker-based methods. Future work should focus on exploring hybrid approaches, combining different pose-tracking algorithms for a more comprehensive analysis of tremor across body regions and task conditions. We envision computer vision pose tracking as a pivotal tool to strengthen and democratise digital and precision medicine approaches in Neurology.

Methods

Ethics approval

This study was conducted in accordance with the Declaration of Helsinki and ethics approval was obtained from the Julius-Maximilians University Wuerzburg’s ethics committee (#283/14 and 163/14_MP). Patients provided written informed consent for all experimental procedures.

Study cohorts and design

The study consists of two independent phases with independent cohorts. This design was chosen to reflect best practices in machine learning, aiming to ensure validity, generalisability and reproducibility (Fig. 7). All patients had a diagnosis of essential tremor based on the Movement Disorder Society’s consensus criteria⁴ and active bilateral thalamic deep brain stimulation, programmed to individually optimal settings. All patients were refractory to anti-tremor medication (propranolol, primidone).

The retrospective cohort consisted of n = 58 patients (mean age at surgery 66.4 ± 9.84 years, 32 males, mean disease duration at surgery 30.4 ± 19.3 years). 14 patients underwent DBS surgery at Wuerzburg University Hospital between 2016 and 2017 and the remaining 44 patients at Kiel University Hospital between 2003 and 2015^13,60,61. The mean postoperative interval to videotaping was 25.8 ± 20.8 months. Participants were video recorded whilst seated in a clinic room with standard ambient lighting. All videos were collected using various consumer grade, handheld or tripod mounted cameras in the context of standard clinical care to document the severity of postural and kinetic tremor components. Spatiotemporal video resolution was at least 25 Hz and 320 ×238 pixels (px), respectively. The Fahn-Tolosa-Marin Tremor Rating Scale (FTM) was formally administered before DBS surgery and again during optimal DBS settings, by movement disorders specialists^13,60,61,62. Separate video segments showing the assessment of postural tremor (arms in front of chest, “wing-beating position”, fingertips facing each other but not touching) and kinetic tremor (minimum of three repetitions of finger-to-nose-test on both sides) were identified. Clinical score distributions are shown in Supplementary Fig. 4a.

In the retrospective cohort, video quality criteria were utilised to optimally balance standardisation, robustness and broad applicability. Videos were excluded for computer vision analysis if the hands left the frame and if excessive camera movements or zooming were present. These criteria reflect a practical consensus synthesised from previous work in computer vision for movement analysis^24,28,29,30 as well as exploratory pilot experiments preceding this study. Based on these criteria, n = 15 patients had to be excluded for postural and n = 33 patients for kinetic tremor tracking, leaving videos of n = 43 and n = 25 individuals for subsequent computer vision-based hand tracking in postural and kinetic conditions, respectively.

The prospective cohort consisted of n = 8 patients (mean age at surgery 66.6 ± 10.4 years, 4 males, mean disease duration at surgery 30.2 ± 19.5). All patients were recruited from the movement disorders clinics at the University Hospital Wuerzburg, department of Neurology, in 2021. The mean postoperative interval to the experiment was 53.6 ± 33.3 months.

Prospective experimental design

Experiments were conducted at the department of Neurology, University Hospital Wuerzburg. Participants underwent standardised assessment of postural (holding arms in front of chest, “wing-beating position”, fingertips facing each other but not touching, 3 blocks of 30 seconds) and kinetic tremor (15 repetitions of finger-to-nose pointing per side, each starting from a resting position of the laterally outstretched arm). Tremor assessments were recorded in a 2 × 2 block design with DBS (on/off) and method (video/motion capture and accelerometery) as intraindividual factors. Minimal DBS washout period after impulse generator deactivation was conservatively set to 45 minutes to exclude stimulation carry over effects^63,64. Experimental blocks were pseudorandomized to reduce systematic biases. Based on the video material, the corresponding items of the FTM tremor rating scale (postural and kinetic tremor amplitudes) were annotated by a clinician expert in movement disorders blinded to the experimental condition (MMR).

Motion capture and accelerometery setup

A six-camera optoelectronic motion capture system (SMART-DX, BTS, Italy) operating at a temporal resolution of 100 Hz was used to track retroreflective markers placed bilaterally on the upper limbs’ ulnar styloid, lateral epicondyle of the humerus and the acromion, as previously described¹⁴. Two additional markers were placed on the middle and index fingertips’ dorsal heads for the assessment of postural and kinetic tremor, respectively. The signals for postural and kinetic tremors were computed from the middle and index fingertips’ signals and exported for subsequent computation of tremor characteristics. During all recording sessions, two inertial measurement units (Opal, APDM, USA; dimensions: 48.5 × 36.5 × 13.5 mm; size: 22 g) were placed on the dorsum of both wrists. Tri-axial accelerometer data were used to measure tremor frequencies (sampling frequency: 128 Hz). To avoid potential interference of retroreflective markers with computer vision tracking, experiments were repeated without markers for sequential analysis, as previously described²⁹.

Video hand tracking setup

Participants were seated on a chair in front of a neutral background. Tremor assessment was videotaped using a standard smartphone camera (Samsung Galaxy S20, Samsung, Seoul, South Korea), operating at a spatiotemporal resolution of 1920×1080 px and 60 Hz. The camera was mounted on a standard tripod in landscape mode at a viewing distance of 3 metres to cover the full body of the participants centrally in the video frame throughout the recording time. To avoid obscuring anatomical landmarks, participants were asked to wear sleeveless tops exposing the shoulders and arms. Watches or other jewellery were removed or covered with tape to prevent any interference with the limb tracking, e.g., through aberrant reflections. For videos, pixel-to-metric conversion was derived using a “ChArUco” board (a checkerboard with additional geometric shapes of known metric dimensions for calibration), which was presented before each new video run, as previously described²⁹. Motion capture markers and accelerometers significantly change the visual appearance of hands, which impacts computer vision tracking performance and reduces external validity in non-instrumented settings. Hence, motion capture combined with accelerometery and computer vision recordings were taken separately.

Mediapipe

For video-based hand tracking, we utilised a powerful and widely used computer vision and pose tracking framework, Mediapipe^31,65 (MP). To this end, the Mediapipe PyPI package was executed in Python Version 3.9 and the respective hand landmark detection model applied to the video dataset which loaded using OpenCV⁶⁶. Based on Mediapipe’s internal computation of “world referenced landmarks”, no further calibration step was needed and the coordinate time series of the 21 landmarks per hand were exported for subsequent calculation of tremor characteristics.

Tremor-specific convolutional neural network

Additionally, a residual convolutional neural network was fine-tuned using DeepLabCut^41,48 to track 29 upper body landmarks from diverse clinical videos (henceforth DLC-RCNN). In an iterative process, frames were extracted from a total of 202 clinical videos from 58 retrospective patients as well as 10 videos from 10 healthy controls performing the finger-to-nose test. These videos were deliberately taken in diverse video settings (perspective, lighting, background) to model variance typical to medical videography^29,67. To further broaden the coverage of variability in regards to anthropomorphic, pose, lighting, background and other technical factors²⁹, a k-Means algorithm was utilised for frame extraction from videos. Extracted frames were subsequently labelled by a trained annotator (AJR) and validated by an expert annotator (MF). In order to minimise labelling errors interfering with training efficiency, the labelled frames were plotted and checked for accuracy and plausibility before the annotated frame sets were passed into neural network training using 95% of data, leaving the remainder as a test set for performance evaluation. In a total of 13 consecutive iterations, the CNN was initialised with ResNet-50 weights and trained using both default and imgaug augmentation approaches. To ensure sufficient convergence of the loss function, the maximum iterations were varied between 500,000 and 1,030,000.

In addition, a subset of 10 retrospective clinical videos was held back for an additional out-of-sample validation. Importantly, no videos of the prospective cohort were included in CNN training in order to maintain strict separation between training, test and validation datasets across the study arms. Model performance was evaluated in a multi-faceted approach as previously described^29,68 (Supplementary Fig. 5).

Calculation of tremor characteristics

As previously reported²⁹, an inverse relationship of the DLC-RCNN’s tracking performance and spatial resolution of the videos was observed. Therefore, videos were resampled to 1280×720 px. This value offered the optimal trade-off of spatial information for landmark tracking and favourable tracking performance.

Kinematic analysis of limb movements was implemented in Python 3.9 using standard scientific analysis packages (pandas, sklearn, numpy, scipy) and a custom analysis pipeline. Two-dimensional coordinate time series were conditioned by removing low likelihood marker data points (confidence/likelihood <0.5, default setting in Mediapipe). Mediapipe outputs three-dimensional marker coordinates, but for fair comparison to the two-dimensional DLC-RCNN, only the x and y coordinates were used. The missing points were then interpolated using a linear filter. Furthermore, a high pass filter was implemented to remove remaining low-frequency components associated with slow arm drifts unrelated to tremor frequency. Given its consistency at successful tracking across videos and its clinical relevance we tracked the middle finger’s distal phalanx for postural tremor analysis (Mediapipe marker “middle finger tip”) and tracked the index finger’s distal phalanx (Mediapipe marker “index finger tip”) for kinetic tremor quantification. A bandpass filter was implemented (postural: low cut = 1 Hz, high cut = 10 Hz), removing both high frequency noise (such as “prediction jitter” introduced by frame-to-frame tracking variability, or failure) and low-frequency large-scale movements (such as slow arm drifts unrelated to tremor frequency). To correct for occasional poor tracking, an additional spiking threshold was applied that identified markers that differed from the previous marker coordinates by over 100px and removed them prior to linear interpolation (this was only relevant in a minority of kinetic tremor videos).

For pixel-to-metric conversion, an individual scaling factor was calculated for each retrospective clinical video in which no systematic calibration information (i.e., checkerboard) was available. To this end, the interpupillary distance (IPD, pupil centre to pupil centre) was used, as previously described²⁹. The ground truth metric IPD was derived from each individual’s preoperative structural T1w-MRI scans (averaged over three measurements) using Suretune 3 (Medtronic Inc., Minneapolis, MN, USA). A patient-specific scaling factor was then calculated by the real IPD (in mm) divided by the video IPD (in px) measured with the open-source software GIMP 2.8.22 (GNU Image Manipulation Program). In the prospective videos, a ChArUco board was used for pixel-to-metric conversion. Using these scaling factors and the known temporal video resolution, the time series could be spatiotemporally transformed to facilitate meaningful comparisons with clinical tremor scores, which mainly rely on tremor amplitude estimates, as well as previous tremor research.

Tremor amplitude was calculated by first computing a spectrogram of the tremor signal which describes the power density of frequencies of a signal as it varies with time. The frequency bin with the maximum power is identified for each frame and the associated amplitude and frequency for that frame is stored. Finally, the resulting feature time series were collapsed per experimental condition (i.e., task, DBS status) into their aggregated mean and peak values. In accordance with the clinical scoring assessing maximal amplitude, peak values were primarily used for subsequent comparisons and correlations. Of note, all main results were reproducible using mean instead of peak amplitudes (Supplementary Figs. 3 and 4).

Computation of tremor frequency from wrist-worn accelerometer

Firstly, the axis with the highest range of variation in acceleration was identified for further analysis. The data underwent pre-processing, involving bandpass filtering between 1 and 10 Hz using a 5th order Butterworth filter. The trials were then segmented based on the type of tremor. Postural tremor trials were segmented by means of synchronised video recordings (VIXTA, BTS, Italy), by excluding any initial or final voluntary arm movements. Kinetic tremor trials were segmented by thresholding the moving average of the absolute value of the signal. For both types of tremor, power spectral density (PSD) of the accelerometric signal was calculated using the pwelch method with a rectangular window of 1 s duration and a 0.5 s overlap. Frequency peaks in the PSD were identified, and average values and standard deviations across trials were calculated for each patient and condition.

Computation of advanced diagnostic features from motion capture and computer vision

To further characterise the nature of the tremor, we analysed harmonics based on the spectrograms derived from each method. Beyond the dominant frequency, we identified additional peaks, indicative of potential harmonics. To determine the nature of these harmonics, we focused on the second most prominent peak in the spectrogram. The frequency of this second peak was divided by the dominant frequency to determine the harmonic relationship. A resulting value of 2 would indicate that the second peak is an even-numbered harmonic, which has previously been reported to be differential diagnostically relevant⁴³. Also, we analysed the inter-limb coherence between the tremor signals of both hands. Inter-limb coherence provides a measure of the synchrony or similarity between the tremor oscillations in the two hands. For essential tremor, a diagnostic feature is the presence of non-coherent tremors between the two hands, as previously described⁴⁴. To quantitatively determine the coherence, we calculated the coherence value between the tremor signals of the two hands, derived from both motion capture and computer vision. Based on the length of the time series data, a significance threshold of 0.15 was established. Coherence values below this threshold were considered as non-coherent, consistent with essential tremor, while values above this threshold suggested coherent tremor activity between the hands.

Statistical methods

For the definition of equivalence boundaries, the smallest effect size of interest (SESOI)^69,70 was chosen in accordance to the anchor intervals used in the FTM tremor scale. We reasoned that the minimal clinically relevant difference of tremor amplitudes most likely corresponds to the boundary between score 0, “no tremor” and score 1, “slight tremor”. However, no amplitude estimate is used as an anchor for this differentiation in the Tremor scale. Based on the 20 mm intervals used to differentiate the subsequent score levels 2–4, which align with our clinical experience in terms of tremor relevance to everyday life, as well as previous technical validations of sensor-based tremor amplitude measurements achieving accuracies ±10 mm⁷¹, a SESOI of 10 mm was empirically derived. Half of the Tremor score anchor interval, this value was intended to reflect a rather conservative estimate of the minimally clinical important difference. For tremor frequency, a SESOI of 0.5 Hz was chosen in accordance with similar previous work²⁴. For precision metrics, no a priori information was available. In line with previous work²⁹, the respective SESOI was anchored at the minimum precision calculated from gold standard, accelerometry. To exclude significant confounding by potential outliers, equivalence boundaries were empirically lowered to determine a hypothetical minimum, given the data. Equivalence was assessed with the two one-sample t-test (TOST) method as implemented in JAMOVI Version 2.2.5.0.

Leave-one-out cross-validation (LOOCV) was used to evaluate the performance of the logistic regression model for DBS outcome prediction. We used kinematic features and the binarized outcome variable based on a ≥ 30% kinetic tremor amplitude increase for analysis. The LOOCV procedure involved iteratively training the model on all samples except one, which was then used as the test set. This process was repeated for each sample, resulting in multiple rounds of training and testing. Performance metrics, such as accuracy, precision, recall, F1-score, AUROC, balanced accuracy and the confusion matrix, were calculated to assess the model’s predictive capability. Additionally, we performed a Wald test to determine the significance of each selected feature as an independent predictor of the outcome. The LOOCV and associated performance evaluation were executed using Python and various libraries, including scikit-learn, statsmodels and seaborn, to support the analysis and generate high-resolution plots for visualisation.

A post-hoc power analysis was conducted. The prospective cohort’s size was constrained by the technically complex experimental setup and patient burden resulting from the associated periods of DBS inactivation. Other studies using motion capture to analyse DBS effects have reported similar sample sizes between 5 and 11 patients in the target groups^{6,13,14,61,72}. We approximated the achieved power for our study’s main objective (correlation between gold standard and computer vision tremor amplitudes), post-hoc. Given the observed correlation strength of 0.72–0.89 and an α-error probability of .05, this study achieved a power (1-β) of 0.69 – 0.99 in our sample of 8 participants.

The normality of datasets was examined using the Shapiro-Wilk test and additional inspection of quartile (“Q-Q”) plots to inform the appropriate display of data distributions and the selection of subsequent contrast tests. In case of a significant deviation of the (log-)normality assumption, non-parametric tests, i.e., Wilcoxon rank-sum test and matched rank biserial correlation were used. Linear relationships were examined using Pearson or Spearman’s rank correlations. When appropriate, outliers were removed using the robust regression and outlier removal (ROUT) method with a balanced coefficient of Q = 1%⁷³. The significance level was set at p < 0.05.

Statistical computations were conducted using Python 3, JAMOVI Version 2.2.5⁷⁴, R Studio⁷⁵ and GraphPad Prism Version 9 (GraphPad Software. GraphPad Prism).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Non-identifiable patient data are available upon request to the corresponding author.

Code availability

Mediapipe hands model is openly available at https://developers.google.com/mediapipe. DeepLabCut code is openly available at https://github.com/DeepLabCut/DeepLabCut. The tremor analysis pipeline is available through github: https://github.com/peach-lucien/PoET. The tremor-specific DLC-RCNN model is available at https://doi.org/10.7910/DVN/CRJRJF.

References

Louis, E. D. & Ferreira, J. J. How common is the most common adult movement disorder? Update on the worldwide prevalence of essential tremor. Mov. Disord. 25, 534–541 (2010).
Article PubMed Google Scholar
Lenka, A. & Jankovic, J. Tremor syndromes: an updated review. Front. Neurol. 12, 684835 (2021).
Article PubMed PubMed Central Google Scholar
Deuschl, G., Raethjen, J., Lindemann, M. & Krack, P. The pathophysiology of tremor. Muscle Nerve 24, 716–735 (2001).
Article CAS PubMed Google Scholar
Bhatia, K. P. et al. Consensus Statement on the classification of tremors. from the task force on tremor of the International Parkinson and Movement Disorder Society. Mov. Disord. J. Mov. Disord. Soc. 33, 75–87 (2018).
Article Google Scholar
Jain, S., Lo, S. E. & Louis, E. D. Common misdiagnosis of a common neurological disorder: how are we misdiagnosing essential tremor? Arch. Neurol. 63, 1100–1104 (2006).
Article PubMed Google Scholar
Reich, M. M. et al. Progressive gait ataxia following deep brain stimulation for essential tremor: adverse effect or lack of efficacy?. Brain J. Neurol.139, 2948–2956 (2016).
Article Google Scholar
Balachandar, A. et al. Are smartphones and machine learning enough to diagnose tremor? J. Neurol. 269, 6104–6115 (2022).
Article PubMed Google Scholar
De, A., Bhatia, K. P., Volkmann, J., Peach, R. & Schreglmann, S. R. Machine learning in tremor analysis: critique and directions. Mov. Disord. https://doi.org/10.1002/mds.29376 (2023).
Article PubMed Google Scholar
Welton, T. et al. Essential tremor. Nat. Rev. Dis. Prim. 7, 83 (2021).
Article PubMed Google Scholar
Deuschl, G. et al. The clinical and electrophysiological investigation of tremor. Clin. Neurophysiol. 136, 93–129 (2022).
Article PubMed Google Scholar
Alusi, S. H., Macerollo, A., MacKinnon, C. D., Rothwell, J. C. & Bain, P. G. Tremor and dysmetria in multiple sclerosis: a neurophysiological study. Tremor Hyperkinetic Mov. 11, 30 (2021).
Casamento-Moran, A. et al. Quantitative separation of tremor and ataxia in essential tremor. Ann. Neurol. 88, 375 (2020).
Article PubMed PubMed Central Google Scholar
Herzog, J. et al. Kinematic analysis of thalamic versus subthalamic neurostimulation in postural and intention tremor. Brain 130, 1608–1625 (2007).
Article PubMed Google Scholar
Vissani, M. et al. Impaired reach-to-grasp kinematics in parkinsonian patients relates to dopamine-dependent, subthalamic beta bursts. Npj Park. Dis. 7, 1–10 (2021).
Google Scholar
Williams, S. R. et al. Quantitative motion analysis and clinical characteristics of Holmes tremor as compared to other tremor types (S32.008). Neurology 98, 1842 (2022).
Article Google Scholar
Schreglmann, S. R. et al. Non-invasive suppression of essential tremor via phase-locked disruption of its temporal coherence. Nat. Commun. 12, 363 (2021).
Article CAS PubMed PubMed Central Google Scholar
Movement Disorders Moment: Use of 3D Motion Capture for Kinematic Analysis in Movement Disorders. Practical Neurology https://practicalneurology.com/articles/2023-dec/movement-disorders-moment-use-of-3d-motion-capture-for-kinematic-analysis-in-movement-disorders (2023).
Elble, R. J. et al. Tremor amplitude is logarithmically related to 4- and 5-point tremor rating scales. Brain 129, 2660–2666 (2006).
Article PubMed Google Scholar
Kremer, N. I. et al. Supine MDS-UPDRS-III assessment: an explorative study. J. Clin. Med. 12, 3108 (2023).
Article PubMed PubMed Central Google Scholar
Stacy, M. A. et al. Assessment of interrater and intrarater reliability of the Fahn-Tolosa-Marin Tremor Rating Scale in essential tremor. Mov. Disord. 22, 833–838 (2007).
Article PubMed Google Scholar
Becktepe, J. et al. Exploring interrater disagreement on essential tremor using a standardized tremor elements assessment. Mov. Disord. Clin. Pract. 8, 371–376 (2021).
Article PubMed PubMed Central Google Scholar
Alusi, S. H., Worthington, J., Glickman, S., Findley, L. J. & Bain, P. G. Evaluation of three different ways of assessing tremor in multiple sclerosis. J. Neurol. Neurosurg. Psychiatry 68, 756–760 (2000).
Article CAS PubMed PubMed Central Google Scholar
Tien, R. N. et al. Deep learning based markerless motion tracking as a clinical tool for movement disorders: utility, feasibility and early experience. Front. Signal Proc. 2, 884384 (2022).
Article Google Scholar
Williams, S. et al. Accuracy of smartphone video for contactless measurement of hand tremor frequency. Mov. Disord. Clin. Pract. 8, 69–75 (2021).
Article PubMed Google Scholar
Barrantes, S. et al. Differential diagnosis between Parkinson’s disease and essential tremor using the smartphone’s accelerometer. PLoS ONE 12, e0183843 (2017).
Article PubMed PubMed Central Google Scholar
van Brummelen, E. M. J. et al. Quantification of tremor using consumer product accelerometry is feasible in patients with essential tremor and Parkinson’s disease: a comparative study. J. Clin. Mov. Disord. 7, 4 (2020).
Article PubMed PubMed Central Google Scholar
Elble, R. J. & McNames, J. Using portable transducers to measure tremor severity. Tremor Hyperkinet. Mov. 6, 375 (2016).
Article Google Scholar
Seethapathi, N., Wang, S., Saluja, R., Blohm, G. & Kording, K. P. Movement science needs different pose tracking algorithms. Preprint at https://doi.org/10.48550/arXiv.1907.10226 (2019).
Friedrich, M. U. et al. Smartphone video nystagmography using convolutional neural networks: ConVNG. J. Neurol. https://doi.org/10.1007/s00415-022-11493-1 (2022).
Article PubMed PubMed Central Google Scholar
Stenum, J., Rossi, C. & Roemmich, R. T. Two-dimensional video-based analysis of human gait using pose estimation. PLoS Comput. Biol. 17, e1008935 (2021).
Article CAS PubMed PubMed Central Google Scholar
Güney, G. et al. Video-based hand movement analysis of parkinson patients before and after medication using high-frame-rate videos and mediaPipe. Sensors 22, 7992 (2022).
Article PubMed PubMed Central Google Scholar
Stenum, J. et al. Applications of pose estimation in human health and performance across the lifespan. Sensors 21, 7315 (2021).
Article PubMed PubMed Central Google Scholar
Morinan, G. et al. Computer vision quantification of whole-body Parkinsonian bradykinesia using a large multi-site population. Npj Park. Dis. 9, 1–12 (2023).
Google Scholar
Esteva, A. et al. Deep learning-enabled medical computer vision. Npj Digit. Med. 4, 1–9 (2021).
Article Google Scholar
Peach, R. et al. Head movement dynamics in dystonia: a multi-centre retrospective study using visual perceptive deep learning. npj Digit. Med. 7, 160 (2024).
Article PubMed Google Scholar
Williams, S., Fang, H., Relton, S. D., Graham, C. D. & Alty, J. E. Seeing the unseen: could Eulerian video magnification aid clinician detection of subclinical Parkinson’s tremor? J. Clin. Neurosci. 81, 101–104 (2020).
Article PubMed Google Scholar
Williams, S. et al. The discerning eye of computer vision: Can it measure Parkinson’s finger tap bradykinesia? J. Neurol. Sci. 416, 117003 (2020).
Article PubMed Google Scholar
Wang, X., Garg, S., Tran, S. N., Bai, Q. & Alty, J. Hand tremor detection in videos with cluttered background using neural network based approaches. Health Inf. Sci. Syst. 9, 30 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hess, C. W. & Pullman, S. L. Tremor: clinical phenomenology and assessment techniques. Tremor Hyperkinet. Mov. 2, tre-02-65-365-1 (2012).
Google Scholar
Cimorelli, A., Patel, A., Karakostas, T. & Cotton, R. J. Validation of portable in-clinic video-based gait analysis for prosthesis users. Sci Rep 14, 3840 (2024).
Article CAS PubMed PubMed Central Google Scholar
Mathis, A. et al. DeepLabCut: markerless pose estimation of user-defined body parts with deep learning. Nat. Neurosci. 21, 1281–1289 (2018).
Article CAS PubMed Google Scholar
Amprimo, G. et al. Hand tracking for clinical applications: validation of the Google MediaPipe Hand (GMH) and the depth-enhanced GMH-D frameworks. Preprint at https://doi.org/10.48550/arXiv.2308.01088 (2023).
Muthuraman, M., Hossen, A., Heute, U., Deuschl, G. & Raethjen, J. A new diagnostic test to distinguish tremulous Parkinson’s disease from advanced essential tremor. Mov. Disord. 26, 1548–1552 (2011).
Article PubMed Google Scholar
Lauk, M. et al. Side-to-side correlation of muscle activity in physiological and pathological human tremors. Clin. Neurophysiol. 110, 1774–1783 (1999).
Article CAS PubMed Google Scholar
Sandoe, C. et al. Predictors of deep brain stimulation outcome in tremor patients. Brain Stimul. 11, 592–599 (2018).
Article PubMed Google Scholar
Favilla, C. G. et al. Worsening essential tremor following deep brain stimulation: disease progression versus tolerance. Brain 135, 1455–1462 (2012).
Article PubMed Google Scholar
Agarwal, S. & Biagioni, M. C. StatPearls (StatPearls Publishing, Treasure Island (FL), 2023).
Nath, T. et al. Using DeepLabCut for 3D markerless pose estimation across species and behaviors. Nat. Protoc. 14, 2152–2176 (2019).
Article CAS PubMed Google Scholar
Williams, S. et al. Computer vision of smartphone video has potential to detect functional tremor. J. Neurol. Sci. 401, 27–28 (2019).
Article PubMed Google Scholar
Nunes, A. S. et al. Automatic classification and severity estimation of ataxia from finger tapping videos. Front. Neurol. 12, 795258 (2022).
Article PubMed PubMed Central Google Scholar
Vidailhet, M., Roze, E. & Jinnah, H. A. A simple way to distinguish essential tremor from tremulous Parkinson’s disease. Brain 140, 1820–1822 (2017).
Article PubMed Google Scholar
Chiu, S. Y. et al. Ataxia and tolerance after thalamic deep brain stimulation for essential tremor. Parkinsonism Relat. Disord. 80, 47–53 (2020).
Article PubMed Google Scholar
Kadirvelu, B. et al. A wearable motion capture suit and machine learning predict disease progression in Friedreich’s ataxia. Nat. Med. 29, 86–94 (2023).
Article CAS PubMed PubMed Central Google Scholar
Ilg, W. et al. Digital gait biomarkers allow to capture 1-year longitudinal change in spinocerebellar ataxia type 3. Mov. Disord. 37, 2295–2301 (2022).
Article CAS PubMed Google Scholar
Needham, L. et al. The accuracy of several pose estimation methods for 3D joint centre localisation. Sci. Rep. 11, 20673 (2021).
Article CAS PubMed PubMed Central Google Scholar
Chatzis, T., Stergioulas, A., Konstantinidis, D., Dimitropoulos, K. & Daras, P. A comprehensive study on deep learning-based 3D hand pose estimation methods. Appl. Sci. 10, 6850 (2020).
Article CAS Google Scholar
Nguyen, J. P. & Degos, J. D. Thalamic stimulation and proximal tremor. A specific target in the nucleus ventrointermedius thalami. Arch. Neurol. 50, 498–500 (1993).
Article CAS PubMed Google Scholar
Ramirez-Zamora, A. & Okun, M. S. Deep brain stimulation for the treatment of uncommon tremor syndromes. Expert Rev. Neurother. 16, 983–997 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cleeves, L. & Findley, L. J. Variability in amplitude of untreated essential tremor. J. Neurol. Neurosurg. Psychiatry 50, 704–708 (1987).
Article CAS PubMed PubMed Central Google Scholar
Fasano, A. et al. Gait ataxia in essential tremor is differentially modulated by thalamic stimulation. Brain 133, 3635–3648 (2010).
Article PubMed Google Scholar
Groppa, S. et al. Physiological and anatomical decomposition of subthalamic neurostimulation effects in essential tremor. Brain 137, 109–121 (2014).
Article PubMed Google Scholar
Fahn, S., Tolosa, E. & Marin, C. Clinical rating scale for tremor. In: Parkinson’s Disease and Movement Disorders (eds Jankovic J, Tolosa E.) 225–234 (Baltimore, MD and Munich, Germany: Urban & Schwarzenberg, 1988).
Perera, T. et al. Deep brain stimulation wash-in and wash-out times for tremor and speech. Brain Stimul. Basic Transl. Clin. Res. Neuromodulation 8, 359 (2015).
Google Scholar
Cooper, S. E., McIntyre, C. C., Fernandez, H. H. & Vitek, J. L. Association of deep brain stimulation washout effects with Parkinson disease duration. JAMA Neurol. 70, 95–99 (2013).
Article PubMed PubMed Central Google Scholar
GitHub - google/mediapipe: Cross-platform, customizable ML solutions for live and streaming media. https://github.com/google/mediapipe (Last accessed September 10th, 2023).
opencv/opencv. OpenCV. Open Source Computer Vision Library (2015).
Haglin, J. M., Jimenez, G. & Eltorai, A. E. M. Artificial neural networks in medicine. Health Technol. 9, 1–6 (2019).
Article Google Scholar
Knorr, S. et al. The evolution of dystonia-like movements in TOR1A rats after transient nerve injury is accompanied by dopaminergic dysregulation and abnormal oscillatory activity of a central motor network. Neurobiol. Dis. 154, 105337 (2021).
Article CAS PubMed Google Scholar
Anvari, F. & Lakens, D. Using anchor-based methods to determine the smallest effect size of interest. J. Exp. Soc. Psychol. 96, 104159 (2021).
Article Google Scholar
Lakens, D. Equivalence tests: a practical primer for t tests, correlations, and meta-analyses. Soc. Psychol. Personal. Sci. 8, 355–362 (2017).
Article PubMed PubMed Central Google Scholar
Mcgurrin P, Mcnames J, Wu T, Hallett M, Haubenberger D. Quantifying tremor in essential tremor using inertial sensors-validation of an algorithm. IEEE J. Transl. Eng. Health Med. 9, 2700110 (2020). Erratum in: IEEE J Transl Eng Health Med. 9, 9700101 (2020).
Fasano, A. et al. Lower limb joints kinematics in essential tremor and the effect of thalamic stimulation. Gait Posture 36, 187–193 (2012).
Article PubMed Google Scholar
Motulsky, H. J. & Brown, R. E. Detecting outliers when fitting data with nonlinear regression – a new method based on robust nonlinear regression and the false discovery rate. BMC Bioinform. 7, 123 (2006).
Article Google Scholar
The jamovi project (2023). jamovi (Version 2.3) [Computer Software]. Retrieved from https://www.jamovi.org.
RStudio Team. RStudio: Integrated Development for R. RStudio, PBC, Boston, MA http://www.rstudio.com/ (2020).

Download references

Acknowledgements

M. U. F. was supported by the Interdisciplinary Center for Clinical Research (IZKF) Z2-CSP13 at the University Hospital Wuerzburg. J.V., M.M.R. and C.P. are supported by the German Research Foundation (DFG, Project-ID 424778381, TRR 295). C.P. was supported by the Fondazione Grigioni per il Morbo di Parkinson and the Fondazione Europea di Ricerca Biomedica (FERB Onlus). The authors thank Prof. David Hogg, Dr. Samuel Relton and Dr. David Wong for helpful remarks.

Funding

Open access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Maximilian U. Friedrich, Anna-Julia Roenn.
These authors jointly supervised this work: Robert Peach, Martin M. Reich.

Authors and Affiliations

Center for Brain Circuit Therapeutics, Brigham and Women’s Hospital, Boston, MA, USA
Maximilian U. Friedrich
Harvard Medical School, Boston, MA, USA
Maximilian U. Friedrich
Department of Neurology, University Hospital Wurzburg, Wuerzburg, Germany
Maximilian U. Friedrich, Anna-Julia Roenn, Chiara Palmisano, Chi Wang Ip, Jens Volkmann, Muthuraman Muthuraman, Robert Peach & Martin M. Reich
Wicking Dementia Research and Education Centre, College of Health and Medicine, University of Tasmania, Hobart, Tasmania, Australia
Jane Alty
Department of Neurology, University Kiel, Kiel, Germany
Steffen Paschen & Guenther Deuschl
Department of Brain Sciences, Imperial College, London, UK
Robert Peach

Authors

Maximilian U. Friedrich
View author publications
Search author on:PubMed Google Scholar
Anna-Julia Roenn
View author publications
Search author on:PubMed Google Scholar
Chiara Palmisano
View author publications
Search author on:PubMed Google Scholar
Jane Alty
View author publications
Search author on:PubMed Google Scholar
Steffen Paschen
View author publications
Search author on:PubMed Google Scholar
Guenther Deuschl
View author publications
Search author on:PubMed Google Scholar
Chi Wang Ip
View author publications
Search author on:PubMed Google Scholar
Jens Volkmann
View author publications
Search author on:PubMed Google Scholar
Muthuraman Muthuraman
View author publications
Search author on:PubMed Google Scholar
Robert Peach
View author publications
Search author on:PubMed Google Scholar
Martin M. Reich
View author publications
Search author on:PubMed Google Scholar

Contributions

M.U.F.: conceptualisation, data analysis, data collection, model development, data interpretation, writing and revision of the manuscript and project supervision. A.J.R.: data collection, model development, data analysis and interpretation and writing and revision of the manuscript. C.P.: data collection, data analysis and revision of the manuscript. J.A.: data interpretation and revision of manuscript. S.P. and G.D.: data collection, data analysis and revision of the manuscript. C.W.I.: computational resources and project supervision. J.V.: data collection, data interpretation and revision of manuscript. M.M.: data analysis and revision of manuscript. R.P.: data analysis, writing and revision of the manuscript and project supervision. M.M.R.: conceptualisation, data collection, data interpretation, revision of the manuscript and project supervision. M.U.F. and A.J.R. are co-first authors and R.P. and M.M.R. are co-last authors.

Corresponding authors

Correspondence to Maximilian U. Friedrich or Martin M. Reich.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Materials

Reporting Summary

Patient consent to disclose main figure

Patient consent to disclose supplementary figure

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Friedrich, M.U., Roenn, AJ., Palmisano, C. et al. Validation and application of computer vision algorithms for video-based tremor analysis. npj Digit. Med. 7, 165 (2024). https://doi.org/10.1038/s41746-024-01153-1

Download citation

Received: 01 December 2023
Accepted: 29 May 2024
Published: 21 June 2024
Version of record: 21 June 2024
DOI: https://doi.org/10.1038/s41746-024-01153-1

This article is cited by

Validity of tremor analysis using smartphone compatible computer vision frameworks
- Robin Wolke
- Julius Welzel
- Jos Becktepe
Scientific Reports (2025)
Video-based quantification of hand postural tremor without external references. Integrating postural tremor quantification into visionMD
- Diego L. Guarín
npj Parkinson's Disease (2025)