A quantum inspired machine learning approach for multimodal Parkinson’s disease screening

Vatsavai, Diya; Iyer, Anya; Nair, Ashwin A.

doi:10.1038/s41598-025-95315-0

Download PDF

Article
Open access
Published: 04 April 2025

A quantum inspired machine learning approach for multimodal Parkinson’s disease screening

Diya Vatsavai¹,
Anya Iyer² &
Ashwin A. Nair³

Scientific Reports volume 15, Article number: 11660 (2025) Cite this article

5858 Accesses
10 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Parkinson’s disease, currently the fastest-growing neurodegenerative disorder globally, has seen a 50% increase in cases within just two years. As disease progression impairs speech, memory, and motor functions over time, early diagnosis is crucial for preserving patients’ quality of life. Although machine-learning-based detection has shown promise for detecting Parkinson’s disease, most studies rely on a single feature for classification and can be error-prone due to the variability of symptoms between patients. To address this limitation we utilized the mPower dataset, which includes 150,000 samples across four key biomarkers: voice, gait, tapping, and demographic data. From these measurements, we extracted 64 features and trained a baseline Random Forest model to select the features above the 80th percentile. For classification, we designed a simulatable quantum support vector machine (qSVM) that detects high-dimensional patterns, leveraging recent advancements in quantum machine learning. With this novel and simulatable architecture that can be run on standard hardware rather than resource-intensive quantum computers, our model achieves an accuracy of 90%, F-1 score of 0.90, and an AUC of 0.98—surpassing benchmark models. Utilizing an innovative classification framework built on a diverse set of features, our model offers a pathway for accessible global Parkinson’s screening.

Detection of Parkinson disease using multiclass machine learning approach

Article Open access 15 June 2024

Voice biomarkers as prognostic indicators for Parkinson’s disease using machine learning techniques

Article Open access 09 April 2025

Multi-modality machine learning predicting Parkinson’s disease

Article Open access 01 April 2022

Introduction

Parkinson’s disease (PD) is a progressive neurodegenerative disorder that gradually compromises neuronal function, resulting in motor and nonmotor impairments¹. Central to PD pathology is the degeneration of dopaminergic neurons within the substantia nigra, a critical region for dopamine synthesis. An essential neurotransmitter deficiency arises as these neurons lose their dopamine-producing capacity, leading to hallmark symptoms such as bradykinesia, hypokinetic dysarthria, resting tremor, and muscular rigidity². Over the past 25 years, PD prevalence has doubled, and related deaths have increased by more than 100% since 2000³. These trends highlight an urgent need for effective disease detection and intervention.

Current detection methods, like DAT and PET scans, are not PD-specific, often leading to imprecise results⁴. These diagnostic approaches also require expensive, specialized equipment and medical expertise, costing the U.S. around $14 billion annually⁵. As the fastest-growing neurological condition globally, PD urgently calls for a low-cost, high-accuracy screening solution⁶.

Despite recent improvements in clinical diagnostic criteria, accurately identifying PD remains difficult due to the overlap of symptoms with other neurodegenerative conditions and the normal aging process. Diagnoses based on a single or limited set of clinical features often lead to inconclusive results. Research on specific biomarkers has helped improve PD detection. In this study, we focus on vocal biomarkers, gait indicators, demographic data, and tapping metrics. Studies show that voice serves as a useful PD indicator. Notably, voice has proven particularly informative: Ma et al.⁷ found that 70–90% of individuals with PD exhibit varying degrees of vocal impairment, and up to 78% of those in early disease stages show noticeable changes⁸. Common vocal impairments associated with PD include pitch variations, decreased volume, unclear articulation, and an unstable voice. Recognizing the diagnostic potential of these speech impairments, we focus on analyzing specific vocal features associated with these changes, including volume, pitch, jitter, shimmer, and breathiness.

Gait serves as another characteristic marker of PD. Bradykinesia disrupts gait, causing both episodic and continuous disturbances⁹. Episodic disturbances include start hesitation and freezing of gait, while continuous disturbances reflect inconsistent walking patterns. These gait impairments significantly impact PD patients, with studies showing that 45-68% experience falls annually, and 50-86% fall recurrently¹⁰. Although gait disturbances aid in diagnosis, they also represent one of PD’s most physically deteriorating symptoms, leading to severe injuries and heightening patients’ fears of daily activities.

In addition to clinical markers such as voice and gait impairments, demographic factors—particularly age—play a critical role in detecting and understanding Parkinson’s disease (PD). Data from the Parkinson’s Foundation reveal a pronounced increase in PD incidence among individuals aged 65 and older, underscoring the importance of age-related vulnerability as an early diagnostic cue. This demographic insight not only aids in identifying higher-risk populations but also encourages more timely interventions aimed at slowing disease progression, and thus, it is included in our analysis.

Finger-tapping measurements, which serve as a proxy for bradykinesia, have proven useful yet remain controversial for PD diagnosis. Bradykinesia, a primary motor symptom of PD, manifests as slowness and halting in movement¹¹. However, clinical evaluations of bradykinesia often rely on visual judgment¹², which varies widely between clinicians. Moreover, the limited number of published studies on finger-tapping also lack robust interrater reliability estimates, emphasizing the lack of standardization. Nonetheless, some studies indicate that finger-to-thumb tapping tests correlate with lower overall UPDRS (Unified Parkinson’s Disease Rating Scale) scores in PD patients¹³. Although the UPDRS offers a standardized method for assessing PD motor severity, it cannot fully address the issues of subjectivity and interrater variability.

Multimodal data integration presents a significant challenge in PD diagnosis due to the complex interrelationships between vocal, motor, and demographic features. Traditional machine learning models often struggle when fusing disparate data types efficiently, leading to suboptimal classification performance. Each data modality exhibits distinct characteristics—vocal biomarkers involve temporal and spectral features, gait data requires spatiotemporal analysis, and demographic data adds categorical information—making it difficult for conventional models to construct a holistic understanding of PD-related patterns. Additionally, feature imbalances across these modalities can lead to model bias, reducing reliability and generalizability. As a result, a robust diagnostic system must effectively capture nonlinear relationships while mitigating the impact of missing or noisy data, ensuring comprehensive and precise predictions.

Machine learning methods have seen extensive adoption in healthcare, influencing areas that range from advanced image classification to broader public health policy^14,15,16. For instance, a systematic review of AI applications in urology cancer¹⁵ found that convolutional neural networks (CNNs) achieved the highest detection accuracy (77–95%) for prostate, bladder, and kidney cancers. These findings highlight how AI-driven medical diagnostics, particularly those leveraging deep learning and multi-feature integration, can improve accuracy and clinical decision-making. However, just as with PD research, the review emphasized the need for more extensive, high-quality datasets to enhance the real-world clinical performance of AI models. Recent studies have explored the potential of machine learning algorithms to predict the occurrence of PD. However, most existing approaches rely on a single feature for the basis of prediction as rather than incorporating multiple data modalities, resulting in performance metrics ranging from 60 to 85%, which are generally unsuitable for clinical applications^17,18,19,20. When it comes to diagnosis based on audio, convolutional neural networks (CNNs) have been applied to PD voice recordings, which are converted into spectrogram images for model classification. Many of these single-feature studies are built on private, homogenous datasets with minimal data points and unbalanced samples²¹. This can severely affect the reproducibility of the study as well as model generalizability due to lack of data. In addition, a substantial portion of studies relies on voice data obtained from speakers of a single language, which can introduce bias and further limit the generalizability of classification findings²².

To overcome the discussed limitations, our study’s objective is to provide a multimodal diagnostic framework based on vocal data, gait tracking, tapping, and demographic information to generate a comprehensive prediction of PD. Additionally, using automated detection removes the human subjectivity that arises with visual analysis of finger-tapping. Leveraging a large, publicly available dataset of over 150,000 samples, our approach ensures robustness and mitigates geographic bias. Importantly, we utilize the universal syllable “ahh” for 10 s to eliminate linguistic or accent-based confounding factors in vocal data. We utilize a custom quantum-assisted Support Vector Machine (qSVM) classifier exhibiting high performance even in classical simulations, removing the dependence on computationally expensive quantum hardware. qSVMs leverage the high-dimensional spaces of quantum Hilbert space to capture complex, nonlinear relationships that conventional methods might overlook. This enriched representation often translates into higher classification accuracy, better generalization, and potentially more efficient computations. As a result, qSVMs can outperform traditional methods, particularly in challenging tasks that involve large, heterogeneous datasets like mPower. Moreover, the quantum kernel can be simulated on standard hardware rather than relying on resource-intensive quantum computers, enabling broader access and practical implementation for large-scale clinical or research settings. Using this model, we outperform standard machine learning techniques, state-of-the-art deep learning approaches, and commonly offered qSVM architectures with an accuracy of 90% and an ROC/AUC score of 0.98. Moreover, these results derive from a diverse dataset representing participants with varied gender, age, racial, and educational backgrounds, underscoring its potential for clinical applications globally.

Results

Data description

We utilized the mPower public research portal, which contains measurements from over 6,000 participants—both healthy and those affected by Parkinson’s²³. The dataset is available under protected access to certified researchers. The data includes common Parkinson’s disease biomarkers: demographic information, such as age, gender, and smoking history, as well as voice recordings, tapping measurements, and gait tracking, all recorded through a smartphone app. We restricted our analysis to participants who completed all the different tests (voice, tapping, gait) measured in the mPower dataset. Since many participants completed multiple iterations of the same test, we randomly selected a single trial per activity per participant to mitigate potential biases favoring those with repeated trials. We do so because, including multiple trials from the same participant could inadvertently skew model performance by overrepresenting that individual’s characteristics in the dataset. This risk of overrepresentation would make the model overly tailored to participants with more trials, diminishing its generalizability to broader patient populations²⁴. For model training and testing, we focused on 194 participants who completed the voice, gait, and tapping tests. This subset stands out for its diversity, including male and female participants who identified as Caucasian, African American, Hispanic, East Asian, South Asian, and mixed race. The subset al.so represents a range of educational backgrounds, with 35% of participants not holding a four-year college degree. We divided these 194 samples into 164 for training and 30 for testing, ensuring a balanced representation of both Parkinson’s-affected and healthy individuals.

Feature selection

Using this dataset, we extracted 64 voice, gait, tapping, and demographic features for each of the 194 participants, balanced between healthy individuals and those with Parkinson’s. Each data modality (voice, gait, tapping, demographics) was preprocessed individually to ensure modality-specific feature extraction and noise reduction. For gait and tapping data, we extracted both time-domain metrics (e.g., root mean square, standard deviation, tapping counts) and frequency-domain features (e.g., spectral centroid, spectral spread) to capture Parkinson’s-related tremors. Vocal data consisted of 10-second “ahh” recordings, from which we derived pitch, volume, breathiness, and reduced-dimensional MFCCs, while demographic information included age, smoking history, and gender. Feature correlation was managed through random forest–based importance weighting. Additional details on feature extraction appear in the methods section. We normalized the dataframe using Scikit Learn’s StandardScaler to ensure a consistent magnitude for each feature. Next, we trained a baseline Random Forest model to identify the top-performing features for the final qSVM model, selecting features with importance values above the 80th percentile²⁵ (Fig. 1).

Among the demographic features, age proved to be the most significant, consistent with extensive research illustrating a heightened risk of neurological disorders in older populations. In the voice analysis, the spectral centroid mean was the best predictor of Parkinson’s disease. This feature refers to the “center of mass” of a voice signal and often corresponds to how sharp or muffled the sound is, corresponding to the vocal changes observed in PD. Regarding the gait features, the root mean square and standard deviation of acceleration in the z direction had the highest feature importance. The root mean square, corresponding to the magnitude of acceleration, shows how forcefully the participants moved up and down when walking, and the standard deviation shows the variability of vertical acceleration, corresponding to “shaky motion”, a hallmark of Parkinson’s disease.

For the raw tapping information, the number of left taps, right taps, and total taps quantify how many times the participants tapped their screen in 20 s, providing a measure of their dexterity. Meanwhile, tapping consistency quantified by the standard deviation of the time between taps, shows whether they kept a consistent pace throughout the test. In addition to the raw tapping information, the tapping acceleration measurements from the participants’ smartphones were also significant. Among these features, the root mean square, or magnitude of acceleration, as well as the standard deviations of accelerations in each direction, captures abrupt movements characteristic of PD. Then, in the frequency domain, the average frequency and spectral centroid reflect the smoothness and consistency of tapping acceleration. Finally, the spectral spread of tapping acceleration serves as an additional indicator of the erraticness of the tapping motion, detecting tremors.

For input into the proposed qSVM model, which is highly sensitive to feature ordering and magnitude²⁶, we multiplied each feature by its importance. We sorted them accordingly to emphasize the significance of higher-performing features. We then scaled all features by a factor of 10 to ensure that all features had a magnitude close to 1, enhancing the model’s ability to process the data effectively.

Model architecture

We used Quantum Support Vector Machines (qSVMs) due to their capacity for accurately classifying high-dimensional datasets, capturing subtle patterns that might otherwise go unnoticed, similar to those in the mPower data. Quantum SVM (qSVM) models can access high-dimensional quantum Hilbert spaces, allowing them to encode complex relationships more effectively than standard classification models. This enhanced representation often translates into higher classification accuracy, improved generalization, and more efficient computations. As a result, qSVMs frequently outperform conventional SVM methods, especially for intricate classification tasks²⁷. For the mPower data, which includes diverse and complex biomarkers like voice, gait, tapping, and demographic features, qSVMs can leverage quantum feature mapping to capture subtle, non-linear interactions between these heterogeneous variables. Researchers have increasingly applied qSVMs in clinical diagnosis²⁸. However, quantum computing in the current noisy-intermediate scale quantum (NISQ) era remains costly, time-intensive, and error-prone²⁹. To address these challenges, our study introduces a quantum-inspired kernel architecture that we simulate on classical hardware, while still outperforming traditional models. However, unlike many current qSVM kernels, our model does not rely on entanglement, which is challenging to simulate classically; instead, it uses dynamic angle embedding³⁰ to capture complex data patterns without the overhead of full quantum computation.

Evaluation and comparative analysis

Once we constructed the custom qSVM architecture, we trained the model on our dataframe of 194 samples. Afterward, we compared the accuracy, ROC/AUC score, recall/sensitivity, specificity, and precision to current state-of-the-art models in the field to demonstrate the viability of our approach. These benchmarks collectively provide a comprehensive evaluation of the model’s diagnostic capabilities: accuracy measures overall correctness, ROC/AUC balances true- and false-positive rates, recall (sensitivity) quantifies the ability to detect actual PD cases, specificity ensures minimal false alarms among healthy individuals, and precision assesses correctness among predicted positives. Benchmarking against established state-of-the-art methods, we provide evidence of the proposed model’s viability and robustness across multiple clinical performance criteria. These included architectures that have been explored for medical applications in the past, such as neural networks, SVM and qSVM models, and random forests. The comparative results are displayed in Table 1.

Table 1 Performance of various models across accuracy, ROC/AUC, precision, and recall.

Full size table

Table 2 This confusion matrix summarizes the classification performance of the proposed model on the 30 test subjects.

Full size table

Due to the extensive feature set and the high data complexity, models that incorporated strong overfitting protections generally performed better (Table 1). The corresponding confusion matrix for the proposed model is presented in Table 2. Among the classical algorithms, logistic regression and the linear SVM demonstrated the best performance. By contrast, complex neural networks tended to overfit, reducing their accuracy. Among alternative qSVMs, the entanglement-heavy ZZ feature map performed poorly, likely because classical simulators cannot emulate entanglement accurately. The Z feature map without entanglement performed better but lacked the complexity necessary to capture the full dimensionality of the data, as reflected in its lower metrics. The proposed kernel, by using quantum rotation gates to encode features into a complex quantum state without requiring entanglement, achieved the highest performance across accuracy, ROC/AUC score and F1 score (Fig. 2), emphasizing its potential as a baseline for future clinical applications.

We incorporated statistical tests including McNemar’s tests on classification outcomes, which yielded p-values ranging from 0.00049 to 0.0625, indicating that our qSVM model significantly outperforms classical and deep learning benchmarks. The low p-values (mostly < 0.05) confirm that the performance differences are unlikely due to chance, reinforcing the robustness of our model’s 90% accuracy. This demonstrates that qSVM effectively captures complex, nonlinear relationships in the data, improving classification performance over traditional methods. The results validate our approach, showing the model’s superior predictive power.

Feature importance in classification

Figure 3 displays the Shapley value plot, illustrating the relative importance of each feature in the model. As anticipated, age emerges as the most influential factor, exerting a significant effect across the predictive spectrum. Several tapping-related features—such as tap acceleration standard deviation (x, y, z), spectral centroid, and spectral spread—play significant roles, indicating that variability and frequency components of tapping movements contribute meaningfully to classification. Additionally, measures of tap consistency, total taps, and left/right taps show notable importance, reinforcing the relevance of motor coordination. Gait acceleration metrics, including root mean square (z) and standard deviation (z), further underscore the significance of movement-related biomarkers. Overall, these findings highlight the importance of motor-focused data, notably tapping dynamics and gait accelerations, as critical indicators for predicting Parkinson’s disease.

Discussion

In this study, we employ a multimodal framework for prediction, using voice recordings with a universal vowel sound “ahh”, gait indications from phone-detected acceleration data points, phone tapping count and acceleration, as well as demographic information. By introducing a novel, simulatable quantum Support Vector Machine (qSVM) equipped with a custom kernel based on quantum rotation gates, we achieve high performance without requiring a fault-tolerant quantum device. This approach not only produces superior classification metrics but also demonstrates efficient and practical testing procedures, paving the way for more accessible clinical applications.

Limitations

Concerns could arise regarding the model’s ability to generalize to new data. With only a train and test dataset, there is a possibility of the model overfitting to the validation set. Overfitting occurs when a model becomes overly complex, relying on the noise within the data rather than solely focusing on underlying patterns, causing the model to generalize poorly to real-world data³⁴. Future studies could address this issue through rigorous clinical trials that expose the model to diverse patient populations, clinical environments, and real-world conditions, thereby offering a practical cross-validation of performance. Additionally, replicating the data collection procedures from the mPower study would help ensure consistency and robustness in subsequent applications.

Ethics

All data used was made available by the mPower public researcher portal, and our academic usage complies with the guidelines outlined in the research license. In a real-world application, we acknowledge that false positives, though rare, could have the potential to cause psychological distress. Therefore, we assert that this model is not a standalone diagnostic but rather a screening tool, to be considered in the larger context of overall health by medical professionals. Furthermore, our work is intended to serve as a baseline for further research and testing. To enhance reliability and minimize errors, we propose a structured testing protocol similar to the measurements used in the mPower dataset.

Additionally, future studies should focus on evaluating the model across diverse populations and clinical settings, ensuring fairness and generalizability. Conducting controlled clinical trials will be essential in verifying the model’s effectiveness under real-world conditions, reducing biases, and refining its practical applications.

Broader impact and future work

We aim to advance research in Parkinson’s disease (PD) prediction, with a particular focus on developing multimodal early screening tools that combine multiple biomarkers to enhance classification accuracy. Such noninvasive approaches boost the accessibility of initial classification while lowering costs compared to conventional methods. Moreover, incorporating a broad range of features not only improves model performance metrics but also facilitates a more individualized analysis, potentially enabling more tailored therapeutic strategies in future clinical settings. For example, future work could focus on broadening the spectrum of biomarkers by incorporating additional physiological and biochemical indicators, such as neuroimaging, genomic data, and other sensor-derived metrics.

We additionally encourage investigations into simulatable qSVM kernels for diverse real-world applications, as these may pave the way for new breakthroughs in various domains. The multimodal framework, which integrates various biomarkers and leverages advanced modeling techniques like qSVMs, can be readily extended to other neurological disorders characterized by complex symptom profiles. By adapting the feature extraction process to different physiological signals and clinical datasets, the approach holds promise for detecting a range of conditions beyond Parkinson’s disease, ultimately offering a flexible, powerful tool for broader healthcare applications.

For clinical deployment, such a model could be seamlessly integrated into existing healthcare infrastructures (e.g., EHRs) via digital platforms, allowing for remote and continuous patient monitoring. By embedding it into smartphone applications or wearable devices, physicians can more effectively track early symptoms and disease progression. Additionally, the model’s capacity to evaluate symptoms across multiple data modalities provides a holistic view of patient health, complementing conventional clinical evaluations. Healthcare providers can then leverage these consolidated insights directly within EHR platforms, enabling more personalized treatment strategies and streamlined patient monitoring.

In summary, we present a multimodal classifier for PD, trained on a diverse, globally representative dataset. This framework employs a quantum Support Vector Machine (qSVM) to integrate data from tapping measurements, gait tracking, voice recordings, and demographic information. To optimize resource utilization, the qSVM kernel relies on rotation gates rather than entanglement operations, permitting efficient classical simulations while preserving the strengths of quantum algorithmic principles. As a result, this design enables cost-effective and scalable PD prediction for a worldwide population.

Looking ahead, future work may prioritize conducting real-world clinical trials to gauge clinical utility, refining the model for improved interpretability, and fostering collaboration with healthcare providers for seamless integration into clinical practice.

Methods

Gait data processing

Although the mPower study included gait tracking measurements from a variety of different devices, including pedometers and accelerometers, we decided to focus on the smartphone measurements to preserve the accessibility and broad applicability of our model. The smartphone measurement file was composed of time series data tracking the x, y, and z acceleration of the device at regular intervals while the participant was walking. In order to interpret the data, we extracted the root mean square of the acceleration in the x, y, and z directions as well as that of the total signal, a metric commonly used to capture magnitude when describing time-varying quantities³⁵. In addition, we included the standard deviation of acceleration in each direction to capture the variability over time. Next, we converted the total magnitude of acceleration into the frequency domain using a fast Fourier transform. In the frequency domain, we extracted the dominant frequency, mean frequency, spectral centroid, and spectral spread, corresponding to signal features that have the potential to capture the characteristic erratic movements of individuals affected by Parkinson’s disease³⁶.

Tapping feature extraction

In the tapping test, mPower participants used a smartphone app to alternate tapping a left button and a right button as many times as they could in 20 s. From this, two types of measurements were recorded. First, each left tap and right tap were recorded, along with their corresponding timestamps. From this information, we extracted the number of taps on the left button, right button, and the total number of taps in 20 s, which shows the dexterity of the participants. Next, we extracted the number of repeated taps on the same button, which shows the participant’s ability to alternate between buttons effectively. Finally, we extracted the consistency of their taps, which we measured by computing the standard deviation of the time between taps, corresponding to how consistent their tapping speed was.

The second type of measurement recorded was the acceleration of the smartphone in the x, y, and z directions throughout this 20 s interval. Similarly to gait, we extracted the root mean square and standard deviation of acceleration in the x, y and z directions, as well as for the total signal. In the frequency domain, we extracted the dominant and mean frequencies, in addition to the spectral centroid and spectral spread. Through all of the acceleration features, we hoped to represent the characteristic shakiness and tremors associated with PD³⁷.

Vocal feature extraction

While gait and demographic, and tapping data were expressed numerically, the researcher portal provided vocal data in the form of 10 s recordings of the vowel sound “ahhh”. To quantify the differences in voice, specific vocal characteristics were considered. Variations in volume and pitch have been known to be associated with PD³⁸. As a result, their means and standard deviations were added to the dataframe. Furthermore, signal characteristics also capture variations in voice, so the means and standard deviations of the zero-crossing rate, root mean square, spectral centroid, spectral bandwidth, and spectral rolloff were included as well. Research has described Parkinson’s affected voices as having an airy or breathy quality³⁹, and direct quantification of this feature was performed using the Acoustic Breathiness Index proposed by⁴⁰. This index was added to the feature list, along with its inputs of harmonics to noise ratio, cepstral peak prominence, power spectral density, harmonic difference, glottal to noise excitation ratio, high-frequency noise occurrence at 6000 Hz, and shimmer in decibels. Finally, Mel Frequency Cepstral Coefficients were extracted for each recording, with Principal Component Analysis being applied to reduce the number of MFCC features to ten for computational efficiency.

Demographic features

In addition to professional diagnosis, various information about the participants was provided. Excluding the hardware specifications, participant data mainly consisted of medical history. However, most of these measurements, such as past surgery, were not available for the majority of participants, hindering their efficacy. Furthermore, we excluded race to mitigate bias in detection. As such, age and smoking history were included due to a strong correlation with PD prevalence^41,42, as well as gender because of its effect on vocal characteristics⁴³. Table 3 provides further details on the demographic characteristics of the participants.

Table 3 Demographic breakdown of training and testing sets with significance for generalizability.

Full size table

SVM classifiers

Support vector machine (SVM) models are widely used in binary classification problems due to their flexibility across dataset sizes and resistance to overfitting⁴⁴. By plotting each datapoint in high-dimensional space, the kernel function of an SVM seeks to define a hyperplane boundary between each class. In this way, the model is able to classify new data by plotting it on this same plane and seeing which side of the boundary it falls on. These boundaries, or kernels, can be linear, polynomial, or a radial basis function (RBF).

qSVM model architecture

Quantum SVMs, or qSVMs, are being researched for their ability to improve classification by accessing high-dimensional Hilbert space. This works by considering the overlap or fidelity between quantum states, which has the potential to capture more complex relationships than a standard dot product. However, with current limitations on quantum hardware, quantum-inspired kernels that can be simulated on classical computers are more promising for real-world applications.

Currently, Qiskit offers the Fidelity Kernel constructor to construct quantum SVM models. This kernel mainly runs on the ZZ Feature Map⁴⁵. This feature map has been known to achieve excellent performance on resource-intensive quantum hardware. However, due to its inherent complexity and reliance on entanglement, which can be difficult to simulate classically, this kernel performs poorly in classical state vector simulations, as supported by Simoes et al.⁴⁶.

Past papers, such as Kariya et al. and Suzuki et al., have explored the use of specific rotational gates to encode data characteristics into a complex quantum state in conjunction with or as an alternative to entanglement^47,48. This study proposes using Pennylane’s provided Angle Embedding function³⁰, which encodes numerical data into rotation angles. This approach enhances the simplicity of the kernel construction by mitigating resource-intensive matrix operations or entanglement. The method encodes classical data as rotational angles on qubits, effectively transforming each input feature into a quantum state. By mapping numeric values to these rotations, Angle Embedding provides a straightforward yet powerful tool for simulating quantum-based machine-learning models on classical hardware. Furthermore, these rotations, represented as matrix operations applied to multidimensional quantum state vectors, have the potential to capture highly complex, nonlinear relationships.

qSVM model framework

After each input is preprocessed and combined, the input data is mapped to a quantum feature space using our custom kernel. SVM kernels work by computing the similarity between two data points x₁ and x₂. To do this through quantum operations, each qubit represents one feature of the data. For each feature, the proposed model first performs a Y-axis rotation of the x₁ value, followed by a Y-axis rotation of -x₂. If x₁ and x₂ are similar, the resulting measurement would yield a value close to the qubit’s initial state of 0 since the rotations would nearly cancel out. In this way, by measuring the magnitude of the final qubit state, the kernel computes the overlap between x₁ and x₂.

For ease of computation, the kernel replaces two rotations of R_Y(x₁) and R_Y(-x₂) with a single rotation of R_Y(x₁ − x₂). As shown in Fig. 4, Qiskit breaks down R_Y operations into R_Z and R_X components, using quantum mechanical identities. Since R_X($\:\frac{\pi\:}{2}$) is equivalent to $\:\sqrt{x}$, the kernel can be programmed as R_Z($\:-\frac{\pi\:}{2}$), $\:\sqrt{x}$, R_Z(x₁ − x₂), $\:\sqrt{x}$ and R_Z($\:-\frac{\pi\:}{2}$), as shown. Since the rotation angles are entirely determined by the input data, no additional hyperparameters need to be tuned for this model.

After each qubit is measured, the values are aggregated through a weighted sum of each measurement multiplied by the random forest feature importance of the corresponding feature, transformed using a softmax function to emphasize the contribution of features with high predictive power. This computation is repeated on every pair of datapoints in the training set to construct the kernel matrix, which is then passed into a classical SVM for evaluation as shown in the schematic diagram for the quantum-inspired framework and data flow (Fig. 5).

The proposed model mitigates overfitting by first selecting only the most predictive features using a Random Forest (above the 80th percentile) and scaling them by importance, and by employing a simplified kernel based on quantum rotation gates. Additionally, using a single trial per participant prevents individual data from dominating the training set, further enhancing generalizability.

Benchmark models

All benchmark models were chosen from either standard machine learning options or models used by prior researchers in this field^22,28,29,31. For most models, the same training and testing sets as the proposed model were used to ensure a fair comparison. However, for the alternative qSVM kernels of the Z and ZZ feature map, the full dataset was too resource-intensive to run. So, we chose to extract metrics based on a subset of the dataset including the first 30 train and 15 test samples.

Data availability

The datasets generated and/or analysed during the current study are available in the mPower Public Research Portal²³ repository, https://www.synapse.org/Synapse:syn4993293/wiki/376006. This dataset is publicly available at the link noted.

References

DeMaagd, G. & Philip, A. Parkinson’s disease and its management: part 1: disease entity, risk factors, pathophysiology, clinical presentation, and diagnosis. P T: Peer-Reviewed J. Formulary Manage. 40, 504–532 (2015). https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4517533/
Google Scholar
Triarhou, L. & National Center for Biotechnology Information (NCBI). Parkinson’s Disease Overview. GeneReviews®. https://www.ncbi.nlm.nih.gov/books/NBK6271/ (2004).
World Health Organization. Parkinson disease. WHO Fact Sheets. https://www.who.int/news-room/fact-sheets/detail/parkinson-disease (2019).
Palermo, G., Giannoni, S., Bellini, G., Siciliano, G. & Ceravolo, R. Dopamine transporter imaging, current status of a potential biomarker: A comprehensive review. Int. J. Mol. Sci. 22(20), 11234. https://doi.org/10.3390/ijms222011234 (2021).
Article CAS PubMed PubMed Central Google Scholar
U.S. Department of Health and Human Services. Parkinson’s disease: Challenges, progress, and promise. National Institute of Neurological Disorders and Stroke. https://www.ninds.nih.gov/current-research/focus-disorders/parkinsons-disease-research/parkinsons-disease-challenges-progress-and-promise (n.d.).
Bhidayasiri, R. et al. The rise of Parkinson’s disease is a global challenge, but efforts to tackle this must begin at a National level: A protocol for National digital screening and eat, move, sleep lifestyle interventions to prevent or slow the rise of non-communicable diseases in Thailand. Front. Neurol. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11129688/ (2024).
Ma, A., Lau, K. K. & Thyagarajan, D. Voice changes in Parkinson’s disease: what are they telling Us?? J. Clin. Neurosci. 72(1), 1–7. https://doi.org/10.1016/j.jocn.2019.12.029 (2020).
Article PubMed MATH Google Scholar
Ireland, S., Carroll, V., Blanchard, D. & Rossiter, R. Communication and swallowing difficulties in parkinsons disease. Aust. J. Gen. Pract. https://www1.racgp.org.au/ajgp/2022/april/communication-and-swallowing-difficulties-in-parki (2022).
Ataullah, A. H. M. Gait Disturbances (StatPearls, 2024). https://www.ncbi.nlm.nih.gov/books/NBK560610/
Pelicioni, P. H. S., Menant, J. C., Latt, M. D. & Lord, S. R. Falls in Parkinson’s disease subtypes: risk factors, locations and circumstances. Int. J. Environ. Res. Public Health. https://pmc.ncbi.nlm.nih.gov/articles/PMC6616496/ (2019)
Adams, W. R. Bradykinesia in Parkinson’s disease. In Diagnosis and Management in Parkinson’s Disease (Elsevier, 2020). https://doi.org/10.1016/B978-0-12-818042-0.00010-9.
Hopkins Medicine. Assessment of bradykinesia in Parkinson’s disease using video-based pose estimation. Johns Hopkins Clinical Connection; https://clinicalconnection.hopkinsmedicine.org/videos/assessment-of-bradykinesia-in-parkinson-s-disease-using-video-based-pose-estimation (n.d.).
Yu, T., Park, K. W., McKeown, M. J. & Wang, Z. J. Clinically informed automated assessment of finger tapping videos in Parkinson’s disease. Sensors. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674854/ (2023).
El-Sayed, M. & Abualigah, M. M. Laith. Machine learning in public health forecasting and monitoring the Zika virus. Metaheuristic Optim. Rev.. https://doi.org/10.54216/MOR.010201 (2024).
Lubbad, M. et al. Machine learning applications in detection and diagnosis of urology cancers: a systematic literature review. Neural Comput. Applic. 36, 6355–6379. https://doi.org/10.1007/s00521-023-09375-2 (2024).
Article MATH Google Scholar
Lubbad, M. A. H. et al. A comparative analysis of deep Learning-Based approaches for classifying dental implants decision support system. J. Digit. Imaging Inf. Med. 37, 2559–2580. https://doi.org/10.1007/s10278-024-01086-x (2024).
Article MATH Google Scholar
Ahn, S. et al. Neurologic dysfunction assessment in Parkinson disease based on fundus photographs using deep learning. JAMA Ophthalmol.. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9912166/ (2023).
Ferreira, M. I. A. S. N., Barbieri, F. A., Moreno, V. C., Penedo, T. & Tavares, J. M. R. S. Machine learning models for parkinson’s disease detection and stage classification based on spatial-temporal gait parameters. Gait Posture. https://pubmed.ncbi.nlm.nih.gov/36049418/ (2022).
Ranjan, N. M., Mate, G. & Bembde, M. Detection of parkinson’s disease using machine learning algorithms and handwriting analysis. https://doi.org/10.46610/JoDMM.2023.v08i01.004 (2023).
Tougui, I., Jilbab, A. & Mhamdi, J. E. Machine learning smart system for parkinson disease classification using the voice as a biomarker. Healthc. Inform. Res. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9388925/ (2022).
Barukab, O., Ahmad, A., Khan, T. & Thayyil Kunhumuhammed, M. R. Analysis of Parkinson’s disease using an imbalanced-speech dataset by employing decision tree ensemble methods. Diagnostics. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776735/ (2022).
Iyer, A. et al. A machine learning method to process voice samples for identification of Parkinson’s disease. Nat. News. https://www.nature.com/articles/s41598-023-47568-w (2023).
Synapse mPower Public Researcher Portal. mPower Mobile Parkinson Disease Study. https://www.synapse.org/Synapse:syn4993293/wiki/376006 (n. d.).
Schober, P. & Vetter, T. R. Repeated measures designs and analysis of longitudinal data: if at first you do not Succeed-Try, try again. Anesth. Analgesia 127(2). https://doi.org/10.1213/ANE.0000000000003511 (2018).
Park, S., Park, D. K. & Rhee, J. K. K. Variational quantum approximate support vector machine with inference transfer. Nat. News. https://www.nature.com/articles/s41598-023-29495-y (2023).
Solenov, D., Brieler, J. & Scherrer, J. F. The potential of quantum computing and machine learning to advance clinical research and change the practice of medicine. Missouri Med. 115(5), 463–467 (2018).
Chen, B. S. & Chern, J. L. Generating Quantum Feature Maps for SVM Classifier. https://arxiv.org/abs/2207.11449 (2022).
Adebiyi, M. O., Fatinikun-Olaniyan, D., Osang, F. & Adebiyi, A. A. Quantum theory approach to performance enhancement in machine learning. IEE Xplore 1–7. https://doi.org/10.1109/seb-sdg57117.2023.10124582 (2023).
Preskill, J. Quantum Computing in the NISQ era and beyond. https://arxiv.org/pdf/1801.00862 (2018).
PennyLane qml.AngleEmbedding—PennyLane 0.39.0 documentation. PennyLane. https://docs.pennylane.ai/en/stable/code/api/pennylane.AngleEmbedding.html (n.d.).
Srinivasan, S. et al. Detection of Parkinson disease using multiclass machine learning approach. Nat. News. https://www.nature.com/articles/s41598-024-64004-9 (2024).
Wang, W., Lee, J., Harrou, F. & Sun, Y. Early detection of Parkinson’s disease using deep learning and machine learning. IEEE J. Mag.. https://ieeexplore.ieee.org/abstract/document/9165732 (2020).
Ali, L. et al. Parkinson’s disease detection based on features refinement through L1 regularized SVM and deep neural network. Nat. News. https://www.nature.com/articles/s41598-024-51600 (2024).
Takahashi, Y. et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Nat. News. https://www.nature.com/articles/s41398-020-00957-5 (2020).
Zhang, Z., Reinikainen, J., Adeleke, K. A., Pieterse, M. E. & Groothuis-Oudshoorn, C. G. M. Time-varying covariates and coefficients in cox regression models. Ann. Transl. Med. https://pmc.ncbi.nlm.nih.gov/articles/PMC6015946/ (2018).
Elbatanouny, H. et al. Insights into Parkinson’s disease-related freezing of gait detection and prediction approaches: A meta analysis. Sensors. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11207947/ (2024).
Blakemore, R. L., MacAskill, M. R., Myall, D. J. & Anderson, T. J. Volitional suppression of parkinsonian resting tremor. Mov. Disord. Clin. Pract. https://pmc.ncbi.nlm.nih.gov/articles/PMC6660237/ (2019).
Skodda, S., Grönheit, W., Mancinelli, N. & Schlegel, U. Progression of voice and speech impairment in the course of Parkinson’s disease: A longitudinal study. Parkinson’s Dis.. https://doi.org/10.1155/2013/389195 (2013).
Cernak, M. et al. Characterisation of voice quality of parkinson’s disease using differential phonological posterior features. Comput. Speech Lang. https://www.sciencedirect.com/science/article/abs/pii/S0885230817300724 (2017).
Maryn, Y., Roy, N., De Bodt, M., Van Cauwenberge, P. & Corthals, P. The acoustic breathiness index (ABI): A multivariate acoustic model for estimating breathiness severity in dysphonia. J. Voice 24, 315.e11–315.e27 (1997).
Pagano, G., Ferrara, N., Brooks, D. J. & Pavese, N. Age at onset and Parkinson disease phenotype. Neurology 86, 1400–1407. https://doi.org/10.1212/WNL.0000000000002461 (2016). https://www.neurology.org/doi/
Article CAS PubMed PubMed Central MATH Google Scholar
Allam, M. F., Campbell, M. J. & Castillo, D. & Fernández-Crehuet, N. R. Parkinson’s disease protects against smoking?. Behav. Neurol. 15, 65–71 (2004). https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5488608/
PubMed Google Scholar
Latinus, M. & Taylor, M. J. Discriminating male and female voices: Differentiating pitch and gender - brain topography. https://link.springer.com/article/10.1007/s10548-011-0207-9 (2011).
Huang, S. et al. Applications of support vector machine (SVM) learning in cancer genomics. Cancer Genom. Proteomics. https://pmc.ncbi.nlm.nih.gov/articles/PMC5822181/ (2018).
Otten, M., Goumiri, I. R., Priest, B. W., Chapline, G. F. & Schneider, M. D. Quantum machine learning using Gaussian processes with performant quantum kernels. https://arxiv.org/abs/2004.11280 (2020).
Simoes, R. et al. Experimental evaluation of quantum machine learning algorithms. IEEE Trans. Neural Networks Learn. Syst. 34, 1–14 (2023). https://ieeexplore.ieee.org/document/10015720
MATH Google Scholar
Kariya, A. & Behera, B. Quantum Computing in the NISQ era and beyond. https://arxiv.org/abs/2112.06912 (2021).
Suzuki, T., Hasebe, T. & Miyazaki, T. Quantum support vector machines for classification and regression on a trapped-ion quantum computer. Quantum Mach. Intell. 6(1). https://doi.org/10.1007/s42484-024-00165-0 (2024).

Download references

Author information

Authors and Affiliations

Valley Christian High School, San Jose, CA, USA
Diya Vatsavai
Dougherty Valley High School, San Ramon, CA, USA
Anya Iyer
UC Davis Graduate School of Management, Davis, CA, USA
Ashwin A. Nair

Authors

Diya Vatsavai
View author publications
Search author on:PubMed Google Scholar
Anya Iyer
View author publications
Search author on:PubMed Google Scholar
Ashwin A. Nair
View author publications
Search author on:PubMed Google Scholar

Contributions

All authors developed the research concept. Anya Iyer performed the data acquisition. Diya Vatsavai coded the data preprocessing and model training. All authors wrote the manuscript and edited the manuscript. All authors have reviewed and approved the final manuscript.

Corresponding author

Correspondence to Ashwin A. Nair.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vatsavai, D., Iyer, A. & Nair, A.A. A quantum inspired machine learning approach for multimodal Parkinson’s disease screening. Sci Rep 15, 11660 (2025). https://doi.org/10.1038/s41598-025-95315-0

Download citation

Received: 14 December 2024
Accepted: 20 March 2025
Published: 04 April 2025
Version of record: 04 April 2025
DOI: https://doi.org/10.1038/s41598-025-95315-0

Subjects

Abstract

Similar content being viewed by others

Detection of Parkinson disease using multiclass machine learning approach

Voice biomarkers as prognostic indicators for Parkinson’s disease using machine learning techniques

Multi-modality machine learning predicting Parkinson’s disease

Introduction

Results

Data description

Feature selection

Model architecture

Evaluation and comparative analysis

Feature importance in classification

Discussion

Limitations

Ethics

Broader impact and future work

Methods

Gait data processing

Tapping feature extraction

Vocal feature extraction

Demographic features

SVM classifiers

qSVM model architecture

qSVM model framework

Benchmark models

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links