Data-driven clinical decision support tool for diagnosing mild cognitive impairment in Parkinson’s disease

Martínez Tirado, Gabriel; Martins Conde, Patricia; Sapienza, Stefano; Fröhlich, Holger; Pauly, Claire; Schröder, Valerie E.; Jónsdóttir, Sonja; Tsurkalenko, Olena; Krüger, Rejko; Klucken, Jochen

doi:10.1038/s41531-025-01222-6

Download PDF

Article
Open access
Published: 12 January 2026

Data-driven clinical decision support tool for diagnosing mild cognitive impairment in Parkinson’s disease

Gabriel Martínez Tirado¹,
Patricia Martins Conde¹,
Stefano Sapienza¹,
Holger Fröhlich^2,3,
Claire Pauly^4,5,
Valerie E. Schröder^1,4,5,
Sonja Jónsdóttir⁵,
Olena Tsurkalenko^4,5,
Rejko Krüger^1,4,5 &
Jochen Klucken^1,4
On behalf of the NCER-PD consortium

npj Parkinson's Disease volume 12, Article number: 15 (2026) Cite this article

2053 Accesses
25 Altmetric
Metrics details

Subjects

Abstract

Parkinson’s disease (PD) is a neurodegenerative condition that may affect both motor and cognitive function. Mild cognitive impairment (MCI) is a known risk factor for the progression to dementia in the later stages of the disease. Lengthy and time-consuming neuropsychological assessments, by trained experts, often make MCI diagnosis impractical in routine care. In this context, machine learning (ML) may offer promising support for MCI diagnosis. Thus, we analysed longitudinal data from 115 people with Parkinson’s disease (PwPD) and 226 healthy control participants from the Luxembourg Parkinson’s Study, combining ML with clinical data to support MCI diagnosis in PwPD. The data-driven model showed a non-inferior performance to the clinical diagnostic reference test (MDS PD-MCI Level II) and identified a subgroup of MCI individuals that was not captured by the clinical test. This finding suggests that ML models can complement clinical assessments, by facilitating the detection of MCI and complementing the diagnostic characterisation of PwPD.

Multimodal neuroimaging-based prediction of Parkinson’s disease with mild cognitive impairment using machine learning technique

Article Open access 11 November 2024

Construction of a mild cognitive impairment prediction model for Parkinson’s disease patients on the basis of multimodal data

Article Open access 18 November 2025

Subitem-level multi-scale assessment and machine learning for three-class cognitive status classification in Parkinson’s disease

Article Open access 04 December 2025

Introduction

Parkinson’s disease (PD) is the second most common neurodegenerative condition, affecting over 1% of the population aged 60 years and older¹, with a worldwide prevalence that has been steadily increasing in recent decades². PD is associated with both motor and non-motor symptoms, including cognitive impairment, depression, anxiety, and sleep disorders³. Among these, cognitive impairment is considered one of the most significant non-motor aspects of PD, as it significantly reduces patients’ quality of life by affecting attention, executive function, visuospatial abilities, and contributing to neuropsychiatric disturbances, ultimately leading to functional impairment^4,5,6. This worsening in the cognitive functions also has a substantial impact on caregiver well-being.

Mild Cignitive Impairment (MCI) can be considered a transitional stage between normal cognition (NC) and dementia⁷. It is a clinically important syndrome due to both its high prevalence and its role as a significant risk factor for further cognitive decline. In PD, cognitive changes follow a similar continuum, often progressing from subjective cognitive decline (SCD) to Parkinson’s disease–specific MCI (PD-MCI) and, in some cases, to dementia. Therefore, MCI represents a clinically relevant condition for diagnosis and early detection of cognitive impairment in PD, enabling early intervention and monitoring of potential progression to dementia. As a result, the establishment of diagnostic criteria has long been a priority in the field.

Diagnostic definitions for MCI in the elderly have evolved over time. Early definitions of MCI focused primarily on memory impairment with preserved functional abilities⁸. However, current consensus highlights the need for in-depth neuropsychological evaluation spanning the full spectrum of cognitive domains, including attention, executive function, language, and visuospatial skills⁹.

A wide range of diagnostic tools exists for diagnosing MCI in PwPD, from brief screenings, testing global cognition to comprehensive neuropsychological assessments specifically tailored for PwPD¹⁰. Among brief assessments, the Montreal Cognitive Assessment (MoCA)¹¹ and the Mini–Mental State Examination (MMSE)¹² are the most widely used. The MoCA total score has shown greater sensitivity when compared to the MMSE for screening MCI and monitoring cognitive decline in PwPD^13,14, although its limited granularity and suboptimal sensitivity remain a concern¹⁵. The MoCA total score, for instance, offers only a general overview of the cognitive function and lacks the sensitivity to detect subtle or domain-specific impairments¹⁵. This led to the development of clinical diagnostic criteria for MCI in PD by the “International Parkinson and Movement Disorder Society (MDS PD-MCI)” task force in 2012¹⁶. This criterion is divided into two levels of comprehensiveness: (a) level I (abbreviated assessment) and (b) level II (comprehensive assessment). The MDS PD-MCI Level II criteria require a comprehensive neuropsychological assessment of cognitive function across different domains to identify MCI and are widely recognised as the gold standard^{17,18,19,20,21}. Although MDS PD-MCI Level II criteria are accepted in clinical research, its implementation into everyday healthcare procedures has certain drawbacks. It relies on cognitive assessment through neuropsychological tests, which are time-consuming, require expert interpretation, and are based on predefined rigid cut-off values^22,23. Furthermore, there is currently no consensus on the optimal cutoff thresholds for identifying MCI in PwPD. Cutoff ranges from 1 to 2 standard deviations (SD) below the normative mean, are applied across one or multiple domains. This variation leads to significant differences in the proportion of PD-MCI cases identified in different studies^16,22, often leading to diagnostic heterogeneity. This variability not only complicates clinical decision-making but also limits comparability across studies and settings. These limitations highlight the urgent need for alternative, more flexible approaches to improve MCI identification in PD.

Artificial Intelligence (AI) and its resulting machine learning (ML) applications are emerging as an essential tool in clinical diagnostics, particularly in assessing cognitive function and supporting the diagnosis of neurodegenerative diseases. By analysing and integrating clinical data from different sources, ML models have demonstrated the ability to detect subtle patterns in cognitive assessments, offering new approaches to improve early detection of cognitive impairment in neurodegenerative diseases^24,25. This revolutionises traditional diagnostic approaches, making them more efficient, personalised, and scalable.

In Parkinson’s disease (PD), mild cognitive impairment (MCI) is an important stage for timely intervention, yet current diagnostic approaches can be time-consuming and may not fully capture patient variability. ML methods offer a way to complement neuropsychological evaluations by modelling complex relationships in clinical data that may relate to cognitive decline. Recent advances in ML, including ensemble learning methods and explainable AI, have shown potential to improve prediction accuracy and enhance interpretability, offering valuable insights for clinical decision-making^26,27,28.

Building on these advancements and capabilities, this study aims to improve the diagnostic process of MCI by exploring the identification of MCI subgroups overlooked by the clinical diagnostic reference test. To this end, we generated a data-driven diagnostic model using cross-sectional retrospective clinical data from both healthy controls and PwPD with different levels of cognitive decline. Rather than replacing current clinical criteria, this approach aims to complement existing diagnostic practices and provide insights that could refine early detection strategies in PD.

Results

Descriptive statistics of the study data

A total of 115 PwPD and 226 healthy controls, from the Luxembourg Parkinson’s Study (LuxPark) cohort data, met the eligibility criteria for analysis. Both groups present differences in socio-demographic and clinical characteristics. Regarding socio-demographic characteristics, the PD group consisted of older individuals with a similar level of education compared to the healthy control group, and a higher proportion of males (71.55% vs 53.98%). Clinically, the PD group exhibited significantly greater impairments than healthy controls across the cognitive and motor function MDS-Unified Parkinson’s Disease Rating Scale (MDS-UPDRS III), activities of daily living (ADL), subjective cognitive complaints and neuropsychiatric symptoms (depression, apathy). When examining domain-specific cognitive performance, a heterogeneous pattern emerged: the PD group showed significantly worse performance in attention, executive function, and memory, whereas no significant differences were observed in language and visuospatial abilities (Table 1).

Table 1 Main clinical and socio-demographic characteristics of Parkinson’s disease (PD) and healthy control participants at the baseline visit in the LuxPark cohort

Full size table

Data-driven model

To develop clinical decision support tools for diagnosing MCI, multiple data-driven models based on distinct clustering techniques (Gaussian Mixture Models (GMM), K-Means and Spectral Clustering (SC)) were built. An overview of the hyperparameters that were optimised and the best hyperparameter combination resulting from a grid search optimisation for each model is given in the Supplementary Table 1. Subsequently, the best models from the three algorithms were compared based on their prediction overlap with the clinical diagnostic reference test (MDS PD-MCI Level II). SC emerged as the most suitable method for diagnosing MCI, outperforming both methods, K-Means and GMM, in terms of sensitivity (recall) and area under the curve (AUC), while maintaining a comparable precision. A higher AUC indicates greater discriminatory power in distinguishing between NC and MCI, with SC achieving an AUC of 0.81, compared to 0.74 for GMM and 0.77 for K-Means. Additionally, the higher overall sensitivity of SC reflects its superior ability to correctly identify true positive MCI cases. More specifically, the sensitivity for MCI reached 0.97 — substantially higher than GMM (0.55) and K-Means (0.65). A comprehensive summary of the performance metrics is provided in Supplementary Table 2.

Having established SC as the most effective method for MCI diagnosis, we further examined the feature importance to elucidate which clinical and demographic factors most strongly influenced cluster separation for clinical interpretability, and to validate the clinical relevance of the identified subgroups. Domain-specific cognitive assessments were the primary drivers of cluster differentiation. Additionally, disease-related factors (e.g., age at PD diagnosis, disease duration), comorbidities (e.g. cardiovascular disease, diabetes) or other clinical motor features (e.g., MDS-UPDRS III, Hoehn and Yahr stage) also played a significant role in defining the resulting clusters. Among the domain-specific cognitive assessments, executive, memory and attention functions emerged as the most relevant, whereas language and visuospatial abilities had lower contributions.

Diagnostic prediction strength comparison between the clinical diagnostic reference test and the optimal data-driven model

The optimal data-driven model and the clinical diagnostic reference test were applied to the PD study population, and their diagnostic predictive strength was compared. The predictive performance of both diagnostic tools was evaluated based on their ability to maximise distinctions between PD-NC and PD-MCI phenotypes using a set of predefined measures of cognitive performance: an objective cognitive assessment measured by global cognition (MoCA total score), a physician-rated score of cognitive impairment (MDS-UPDRS 1.1) and a patient-reported outcome measure (PROM) measuring subjective cognitive complaints (Parkinson's Disease Questionnarie-39 (PDQ-39) subitems 30–33) hypothesised to differ between these groups. Overall, the effect sizes obtained by both diagnostic tools were comparable (Fig. 1). The data-driven model demonstrated higher effect sizes in global cognition (MoCA total score) and subjective cognitive complaints (PDQ-39 subitems 30–33) compared to the clinical diagnostic reference test, both of which were statistically significant (Fig. 1). Conversely, the clinical diagnostic reference test showed higher effect sizes in cognitive impairment rated by a physician (Fig. 1). A bootstrap approach was employed to assess the statistical significance of the differences in effect sizes across the set of clinical characteristics mentioned before in this subsection; however, no significant differences were observed. These results suggest that the data-driven model is non-inferior to the clinical diagnostic reference test, proving its ability for MCI diagnosis.

**Fig. 1: Diagnostic prediction strength. Clinical diagnostic reference test vs data-driven model.**

Identification of cognitively distinct PD subgroups

After confirming the non-inferiority of the data-driven model compared to the clinical diagnostic reference test, a detailed analysis was conducted to explore the ability of the data-driven approach to identify MCI subgroups potentially overlooked by the clinical diagnostic reference test.

PwPD were categorised into four subgroups by comparing the overlap of the PD-NC and PD-MCI groups between the data-driven model and the clinical diagnostic reference test approach. This procedure resulted in the following groups: (1) NC group, participants with PD classified as NC by both diagnostic tools (n = 49); (2) Misclassified, participants with PD classified as NC by the data-driven model and as MCI by the clinical diagnostic reference test (n = 1); (3) Misclassified, participants with PD classified as MCI by the data-driven model and as NC by the clinical diagnostic reference test (n = 26); (4) MCI group, participants with PD classified as MCI by both diagnostic tools (n = 39). The group consisting of a single patient was excluded from the follow-up analysis due to its insufficient sample size for meaningful statistical comparisons.

External variables measuring cognitive function (objective cognitive assessment measuring global cognition (MoCA total score), and a physician-rated score of cognitive impairment (MDS-UPDRS 1.1)) were selected to investigate and determine the clinical phenotype of the data-driven early-MCI group (n = 26). Distributional analyses were conducted to quantify the differences between the groups at the baseline visit. The data-driven early-MCI group demonstrated statistically significant differences from the NC group in MoCA total scores, indicating poorer global cognitive function (Fig. 2A). These results, beyond showing a specific profile of cognitive impairment, constitute a clinical validation of the existence of the data-driven early-MCI group as a separate clinical entity from NC. However, no significant differences were observed for the MDS-UPDRS 1.1 due to the lack of sensitivity of the score (Fig. 2B).

**Fig. 2: Validation of the data-driven model for MCI subgroup identification.**

Moreover, Cox proportional hazards models were applied to investigate whether the subgroups displayed different longitudinal trajectories in converting to moderate cognitive impairment. The trajectory of the data-driven early-MCI group was distinct and differed statistically from both the MCI and NC groups in progressing towards moderate impairment, as measured by the MoCA total score (global cognition) and the MDS-UPDRS 1.1 (physician-rated cognitive impairment), with global log-rank test p-values of p < 0.001 and 0.01, respectively (Fig. 3). Post-hoc analyses were conducted to study the differences between the NC and the data-driven early-MCI group, and significant differences were found for MoCA total score (0.03) and MDS-UPDRS 1.1 (0.04).

**Fig. 3: Validation of the data-driven model for MCI subgroup identification.**

Cognitive characterisation of the identified groups

After validating the existence of the data-driven early-MCI group as a distinct clinical entity, a detailed characterisation of the subgroups’ phenotype, at the baseline visit, was conducted (Table 2, Fig. 4 and Supplementary Table 3).

Fig. 4: Cognitive impairment profile.

Full size image

A Bar plot showing the proportion of individuals within each group (NC: green, data-driven early-MCI: yellow, MCI: red) without any impairments or with an impairment by cognitive domain (attention, executive, memory, visuospatial, and language). B Bar plot showing the proportion of individuals without impairment (green), single domain impairment (yellow), and multiple domain impairment (red) within the NC, data-driven early-MCI, and MCI groups. Percentages, in both graphs, were calculated relative to the total number of participants in each group (NC: n = 49; data-driven early-MCI: n = 26; MCI: n = 39). C Box plot showing the differences in depression and subjective cognitive decline across the identified groups. Each subgroup is represented by a colour: NC (green), data-driven early-MCI group (yellow), and MCI (red). Statistical comparisons were performed using two-tailed Mann–Whitney U tests (due to the non-normal distribution of the features), with p-values adjusted for multiple comparisons using the Benjamini–Hochberg procedure. The alpha level was set at α = 0.05. Outliers are represented as individual diamonds, and the interquartile ranges are shown within each box.

Table 2 Clinical and socio-demographic characterisation of the groups

Full size table

The NC group (n = 49) presents low impairment, with no amnestic nor multidomain impairment, where over 75% of individuals (n = 37) do not show any impairment (Fig. 4). Within the data-driven early-MCI group (n = 26), 23% of the individuals (n = 6) did not present any cognitive impairment and the remaining ones (n = 20) present a single domain impairment profile where non-amnestic impairment dominates, with memory being impaired in 15.4% of the individuals (n = 4). The most impaired domain was executive function (n = 9, 34.6%) followed by memory (n = 4, 15.4%), visuospatial (n = 3, 11.5%) and attention (n = 3, 11.5%). On the other hand, the MCI group presents a completely distinct cognitive impairment profile, with all individuals presenting an impairment in at least one domain and the prevalence of multi-domain impairment being higher than 90% (Fig. 4). The most impaired domains were executive function and visuospatial abilities, affecting over 72% (n = 28) and 64% (n = 25) of individuals, respectively (Fig. 4). Amnestic impairment (memory) affected 41% of the MCI individuals (n = 16). A more detailed characterisation of the individual cognitive impairment profiles is shown in Supplementary Fig. 2.

Looking at the different neuropsychological assessments between the data-driven early-MCI and the NC group, a heterogeneous pattern of cognitive impairment was found in the data-driven early-MCI group. Statistically significant differences were observed in attention Trail Making Test Part A (TMT-A), executive function Trail Making Test Part B minus Part A (TMTB-A) and Frontal Assessment Battery (FAB) and verbal memory (Word list learning total score, Word list Delayed Recall), where the misclassified group showed a higher cognitive impairment in these domains when compared to the NC group. No differences were found in visuospatial abilities (judgement of line orientation, MoCA clock) or language (phonemic fluency S, MoCA naming) (Table 2). Moreover, additional statistically significant differences on other clinical characteristics were found, such as disease stage (Hoehn and Yahr scale), severity of neuropsychiatric symptoms (depression and apathy) and subjective cognitive complaints (PDQ-39 subitems 30–33), indicating a worsening of the condition of the data-driven early-MCI group compared to the NC group. However, no differences were observed on motor function (MDS-UPDRS III), ADL or disease-related factors such as disease duration or age at PD onset (Table 2 and Fig. 4). Apart from initially detected differences in age, which were no longer significant after adjustment, no major differences were found in socio-demographic variables (Table 2).

These results suggest that the data-driven early-MCI group has a higher cognitive impairment profile when compared to the NC group in attention, memory and executive functions and may be a distinct clinical entity.

Discussion

In this study we developed and tested a new data-driven diagnostic tool for the early detection of cognitive impairment by targeting MCI in PwPD. By leveraging different sources of clinical data covering a comprehensive neuropsychological test battery, motor and staging characteristics (e.g., MDS-UPDRS III, Hoehn and Yahr stage), disease-related factors, comorbidities and clustering methodologies combined with different domain weighting, the developed model demonstrated non-inferiority to the clinical diagnostic reference test (MDS PD-MCI Level II) in its ability to produce group classifications that showed comparable effect size differences.

An intriguing finding of our diagnostic approach was the ability of the data-driven model to identify subgroups of MCI with a mild impairment pattern, which was validated by a longitudinal analysis. Interestingly, this subgroup cannot be captured by the clinical gold standard (MDS PD-MCI Level II). Here, the data-driven model classified patients with NC as MCI patients. This data-driven early-MCI group differed from the NC group in terms of global cognition, executive memory and attention domains but showed milder levels of cognitive impairment than the MCI group. Importantly, analyses on the follow-up visit data revealed that this data-driven early-MCI group displays an intermediate cognitive progression trajectory in terms of global cognition (MoCA total score) and physician-rated cognitive impairment (MDS-UPDRS 1.1). Interestingly, no significant differences were observed in physician-rated cognitive impairment (MDS-UPDRS 1.1) at the baseline visit, where both true NC and data-driven detected MCI patients were similarly rated by these physicians. These findings suggests that the group identified by the data-driven model is not a misclassified group, but a clinically distinct profile detected by the data-driven model, that extends beyond cognitive decline related to ageing (NC), the clinical staging interpretation remains ambiguous as the mild profile observed in these patients could reflect either an early or subtle mild cognitive impairment phenotype. In the comparison of multi-domain versus single-domain impairment, the MCI group (n = 39) showed a high prevalence of multi-domain impairment, affecting 90% of individuals (n = 35). In contrast, in the data-driven early-MCI group (n = 27), single-domain impairment predominated, observed in 77% of individuals (n = 21). For the NC group (n = 49), the majority of participants – 75% (n = 37) – showed no cognitive impairment. Although the proportion of multi-domain impairment on the MCI group might be a bit higher than expected, several studies reported similar or comparable results when applying MDS PD-MCI Level II criteria with a 1.5 SD threshold, such as Marras et al. and Goldman et al., reporting 93% of multiple-domain impairment within the MCI group and Broeders et al. reporting lower percentages (65%) but still comparable to our findings^21,29,30.

The developed data-driven diagnostic tool offers a novel approach to diagnosing MCI by emphasising the most relevant factors in the clustering process. Specifically, domain-specific cognitive assessments emerged as the primary drivers in the data-driven model. Additionally, disease-related factors (e.g., age at diagnosis, disease duration), comorbidities (e.g., cardiovascular disease, diabetes), and other clinical characteristics (e.g., MDS-UPDRS III, Hoehn and Yahr stage) played a significant role in defining the resulting clusters. These findings underscore the relevance of non-cognitive factors, including comorbidities, disease-related factors, and clinical characteristics, in MCI diagnosis and specifically in the identification of subgroups overlooked by the clinical diagnostic test. The model’s strength lies in its heightened sensitivity to specific cognitive domains, such as memory, executive function, and attention. In contrast, the clinical diagnostic reference test assigns equal weight to all cognitive domains¹⁶. The absence of predefined cutoffs in the data-driven model, combined with its domain-specific weighting, allowed a different perspective, making it a comprehensive tool in terms of MCI diagnostic. Within the domain-specific cognitive assessments, the most important features for the data-driven model were Word list Delayed Recall (memory function), TMT B-A (executive function), Word list learning total score (memory function), and FAB (executive function).

Diving into the clinical staging more thoroughly, we analysed follow-up data from both the MoCA total score and the MDS-UPDRS 1.1. These analyses revealed distinct but stable cognitive trajectories across the three groups. The data-driven group that was detected as having early or subtle mild cognitive impairment, exhibited consistently a milder impairment profile compared to the MCI group, but higher than the NC one. The progression trajectory did not overlap with the one of the NC group, nor did it converge with the more pronounced decline seen in the MCI group. This pattern suggests that the data-driven model may have detected a group that corresponds to an early or subtle mild cognitive impairment group.

The analysis conducted at the baseline visit provided additional insights regarding the clinical staging interpretation further supporting the early-MCI stage rather than the subtle mild cognitive impairment one. The predominance of executive impairment, alongside deficits in working memory and attention, commonly referred to as a frontal cognitive impairment phenotype³¹, is consistent with cognitive profiles observed in early-stage MCI individuals in the literature^32,33. Notably, deficits in mental flexibility (executive function) and working memory (attention) are among the earliest cognitive domains affected^32,33. In contrast, visuospatial deficits were less pronounced in this group, which aligns with the expected clinical picture in the earliest phases of MCI, as they are usually presented in more advanced stages of MCI. Meanwhile, individuals in the MCI group showed more substantial visuospatial impairments, as anticipated from the literature^32,33. Moreover, individuals in the data-driven MCI group showed significantly higher levels of self-perceived cognitive deficits (subjective cognitive complaints), as well as more severe depressive and apathy symptoms (Fig. 4 and Table 2). Both depressive and apathy symptoms are well-established risk factors for the subsequent development of MCI and dementia, which can be indicative of an early MCI profile^34,35,36. Notably, depressive symptoms are known to influence both subjective cognitive complaints and frontal-executive cognitive impairment profiles^35,36,37. While this group shows subjective cognitive complaints and mild objective impairments, its clinical interpretation remains challenging. On one hand, the presence of clear subjective complaints could suggest a link with subjective cognitive decline (SCD), a well-established pre-MCI risk state in Alzheimer’s disease. However, the lack of consensus and formal guidelines for defining SCD in Parkinson’s disease leads to high heterogeneity across people with PD-SCD, making characterisation in this population challenging^38,39. SCD has been conceptualised largely in the Alzheimer’s field, if the SCD definition proposed by the subjective cognitive decline initiative (SCD-I) in Alzheimer’s disease would be used as a proxy, most individuals in this group would not meet the criteria as they already present measurable deficits in executive function, attention, and working memory⁴⁰, which in combination with elevated depressive and apathy symptoms may place them beyond a purely preclinical stage of SCD and closer to an early or mild cognitive impairment phenotype.

This raises interesting conceptual and practical questions: does this group reflect individuals on the cusp of MCI, where subjective concerns and psychiatric symptoms act as early markers of future progression, or does it constitute a mild but stable MCI subtype, particularly sensitive to executive dysfunction. The longitudinal analyses suggest that these patients do not follow the same trajectory as cognitively normal individuals, indicating that this is not merely a subjective or psychiatric phenomenon, but instead a subtle, quantifiable decline with clinical relevance. Nevertheless, more extended follow-up is required to determine whether this profile represents a stable, mild endophenotype of MCI or a dynamic pre-MCI stage progressing toward more typical MCI presentations.

Altogether, the identification of a group overlooked by the clinical diagnostic reference test underscores the potential of data-driven approaches to enhance and complement clinical decision-making by identifying subgroups and subtle cognitive changes. The mechanisms by which the data-driven model identified this novel group provide valuable insights and a foundation for future studies aimed at improving MCI diagnosis, with a focus on developing abbreviated data-driven models prioritising the study of memory and executive functions. Such models could facilitate the creation of MCI diagnostic tools that are more generalisable and applicable in routine clinical care²⁸.

This study has some methodological limitations that should be acknowledged. A key limitation is the lack of clinical validation of the cognitive status of the individuals from the standard of care physician, which complicates the benchmarking and validation of the data-driven model in terms of accuracy and precision. Also, it should be noted that MDS-UPDRS 1.1 and PDQ-39 subitems used through all the analyses are short and not comprehensive cognitive scales. In particular, the PDQ-39, when used to assess ADL impairment in PwPD, has limitations due to its nature as a self-reported outcome, as patients with dementia may not provide fully reliable information. However, in our cohort, the PDQ-39 was the most widely available ADL assessment. While other informant-based tools, such as the Functional Activities Questionnaire (FAQ), were introduced later in the study, their data coverage was more limited.

In addition, the lack of normative values specific to the Luxembourgish population posed a challenge. To address this, statistical procedures based on regression analyses were applied to estimate normative data from the healthy control group. Although this approach has been proposed and used in the literature, it is less precise than established normative values, which are typically derived from larger and more representative samples. As a result, this limitation, combined with the reduced number of available healthy controls, may introduce variability in the calculation of z-scores and the identification of cognitive impairments. An additional source of variability in the z-score calculations arises from the fact that some individuals performed the tests in a non-native language due to the linguistic diversity of Luxembourg’s population. Another important consideration is the lack of universal consensus on how to group neuropsychological tests into cognitive domains. Different grouping strategies may yield varying results, potentially influencing the interpretation of domain-specific impairments. Additionally, the selection of participants for the extensive neuropsychological assessment represents a potential source of bias, as it was performed on a voluntary basis rather than systematically across the cohort.

In applying the MDS PD-MCI Level II criteria, the choice of the 1.5 SD threshold to define cognitive impairment is a critical factor that may introduce heterogeneity in the diagnosis of MCI. Alternative thresholds, such as 1 SD or 2 SD, could yield slightly different outcomes. However, consistent with the recommendations of Dalrymple-Alford et al.²², we considered that 1.5 SD provides the most suitable framework for MCI detection. Another limitation is that, while we followed the cognitive testing–based inclusion criteria for PD-MCI diagnosis, we were unable to fully apply the broader PD-MCI criteria proposed by Litvan et al.¹⁶, which require documentation of gradual cognitive decline and preserved ADL, due to the lack of consistent longitudinal data and detailed informant-based ADL measures across all participants.

Lastly, the effects of levodopa and other dopaminergic drugs on cognitive performance were not accounted for in this study. While these medications play a crucial role in alleviating and controlling motor symptoms in PwPD, their impact on cognition remains unclear and mixed. In some individuals, they may enhance executive function, whereas in others, they can contribute to impulse control disorders, learning impairments, or even psychosis^41,42,43. Other aspects, such as genetic factors, were not considered.

Besides clinical or patient-specific characteristics, some methodological limitations should be acknowledged, such as the lack of external validation for the data-driven model. Although the results have clinical validation, successful replication in independent cohorts would further ensure the absence of overfitting and strengthen methodological confidence in the findings. Moreover, the relatively small sample size (n = 116) compared to the number of input features (n = 21) may increase the risk of overfitting and introduce potential bias into the model. Further research is needed to validate the data-driven model in independent cohorts, as well as to conduct feasibility studies assessing the potential applicability of the screening model in routine clinical practice by evaluating factors such as data availability, usability, and resources required for implementation. Finally, further research is also warranted to directly compare the data-driven and cognitive testing–based models with the full gold-standard clinical assessment in a prospectively designed study including detailed longitudinal and informant-based ADL data.

We demonstrate how a data-driven model can provide with a different perspective in the diagnosis of MCI allowing to identify early patterns of impairment. Through the comparison with the standard clinical assessment test (MDS PD-MCI Level II), a unique clinically distinct subgroup has been identified, whose subtle cognitive changes may be overlooked by the traditional criteria. This allows for early identification of patients with a higher risk, thereby facilitating early interventions that can slow down the progression to dementia.

Methods

Study population

The Luxembourg Parkinson’s study (LuxPark) is a prospective longitudinal cohort of individuals with Parkinsonism, including more than 850 idiopathic and atypical individuals, and around 900 healthy control participants^44,45. All the subjects have signed a written informed consent, and the collection has been approved by the National Ethics Board (CNER Ref: 201407/13) and Data Protection Committee (CNPD Ref: 446/2017). PwPD were followed annually for up to 8 years while healthy controls had one follow-up visit after 4 years. For the current analysis, only participants with idiopathic PD, meeting the inclusion criteria proposed by the United Kingdom Parkinson’s Disease Society Brain Bank Clinical Diagnostic Criteria⁴⁶ (n = 736) and with complete in-depth neuropsychological (NPSY) assessments at the baseline visit (n = 196) were selected. The primary reason for this study population reduction is the availability of neuropsychological assessment data. Within the LuxPark study, two levels of comprehensiveness were used for the assessments, with the more extensive evaluation being optional and therefore conducted in fewer patients. Since the application of MDS PD-MCI Level II criteria requires the highest level of assessment, the number of eligible PD patients and healthy controls was considerably reduced. To ensure that cognitive function was not influenced by neurological or psychiatric conditions unrelated to PD, participants with a score higher or equal than 30 out of 63 in the Beck Depression Inventory-I (BDI-I)⁴⁷, diagnosed with bipolar disorder or schizophrenia were excluded. Additionally, participants with self-reported seizures, strokes, traumatic brain injuries (TBI) or brain tumours were excluded. Furthermore, healthy control participants at risk of developing PD, defined by a family history of PD or the presence of risk factors such as Rapid Eye Movement (REM) sleep behaviour disorder (RBD), defined by a score ≥ 6 in the REM Sleep Behaviour Disorder Screening Questionnaire (RBDSQ)⁴⁸, diabetes, alcohol consumption, smoking, hypertension and cardiovascular disease, were excluded from the analysis. Healthy control participants presenting any psychiatric or neurological condition mentioned above were also removed. Lastly, participants with Parkinson’s disease dementia (PDD) were excluded. This was determined using the criteria proposed by Dubois et al.⁴⁹, which require that Parkinson’s disease precedes the onset of dementia, that there is global cognitive impairment defined as an MMSE score below 26 (corresponding to a MoCA score below 22)⁵⁰, and that dementia has a significant impact on ADL, operationalized as a score greater than 14 on the ADL subscore of the PDQ-39. In addition, participants were required to have no evidence of major depression (BDI-I score ≥30)⁴⁷ and absence of delirium. The application of the inclusion and exclusion criteria resulted in a dataset of 226 healthy controls and 115 PwPD constituting the study population for the analyses presented in this paper. Follow-up visits from this study population were also available for conducting further meta-analyses.

Overview of available data

Data collected from participants included socio-demographic characteristics, clinical history, details of medication use, results of clinical examinations, PROMs and clinician-reported outcome measures (ClinRO), as well as other relevant disease-related factors. Socio-demographic variables comprised age, sex, years of education. The clinical history section covered past diagnoses, such as diabetes and hypertension, as well as lifestyle factors that are potential risk factors, such as smoking and alcohol consumption, and other disease-related factors, such as age at PD onset and disease duration. PROMs were collected through psychometric scales and quality of life surveys. Depressive symptoms were assessed using the BDI-I⁴⁷, and apathy using the Starkstein Apathy Scale (SAS)⁵¹. ADL were measured using the sum of PDQ-39 subitems 11 to 16⁵², and subjective cognitive complaints using the sum of PDQ-39 subitems 30 to 33. ClinRO measured cognitive impairment in patients. Examination data included assessments of motor function using the MDS-UPDRS-III⁵³, as well as assessments of cognition focusing on both global cognition and specific cognitive domains. Global cognitive functioning was measured using the MoCA¹¹. A comprehensive neuropsychological test battery was also administered to evaluate performance across the five main cognitive domains: attention, executive function, memory, visuospatial abilities, and language, further details on the test selection is provided in the 'Neuropsychological test selection' subsection. Additional information on the implemented tests can be found in Hipp et al. (2018)⁴⁴.

Defining PD-MCI and cognitive domain impairment

In the present study, PD-MCI was defined according to the MDS PD-MCI criteria¹⁶. As mentioned in the introduction, this definition is divided into two levels of comprehensiveness: (a) Level I (abbreviated assessment), where only one test per cognitive domain is used and (b) Level II (comprehensive assessment), which involves the usage of two tests per domain. Consequently, the definition of PD-MCI varies across levels. While both levels require impairment in at least two tests, the interpretation of these impairments differs. In Level I, a patient must show impairment in at least two cognitive domains, whereas in Level II, impairment in only one domain (assessed by 2 independent tests) is sufficient to diagnose PD-MCI. Another distinction is that in Level I, individuals must perform within the normative threshold in 80% of the tests (4 out of 5) to avoid classification as MCI, whereas in Level II, this percentage increases to 90% (9 out of 10)¹⁶. In our study, a PwPD was classified as having PD-MCI if impairment was observed in at least two tests, regardless of whether the NPSY tests assessed the same domain or different domains following the MDS PD-MCI level II criteria²⁰. Reduced performance on an NPSY test was defined as a score of 1.5 SD or more below the age-, sex-, and years of education adjusted normative mean further details on how the z-scores are defined are given in the 'Normative data calculation' subsection. This threshold was chosen as it provides a suitable balanced criteria, avoiding an excess of false positives associated with less conservative thresholds (e.g., 1 SD) while also minimising false negatives linked to more restrictive thresholds (e.g., 2 SD)^16,22.

Neuropsychological test selection

In cases where multiple tests were available for a given domain, the selection was based on three main factors: (1) to maximise the number of participants included in the study by giving priority to tests that have been carried out on a maximum number of subjects, (2) using current knowledge on which specific tests provide better accuracy for assessing MCI in PwPD²¹, and (3) the expertise of neuropsychologists in allocating specific features to each domain, since most tests cover several domains.

For the current analysis, two tests were selected for each of the five major cognitive domains. Attention was assessed using the TMT-A⁵⁴ and the Block-Tapping test (Forward)⁵⁵. Executive function was evaluated using the FAB⁵⁶ and the TMT B-A⁵⁴. Verbal episodic memory performance was measured using the Word list learning total score and Word list Delayed Recall from the word list memory subset from the Consortium to Establish a Registry for Alzheimer’s Disease (CERAD) (RRID:SCR_003016) battery⁵⁷. Visuospatial abilities were assessed with the Judgement of Line Orientation test⁵⁸ and the Clock Drawing Test, part of the MoCA assessment¹¹. Finally, language was evaluated using the Verbal Fluency Test (phonemic) and the Naming sub-item within the MoCA¹¹.

Normative data calculation

The definition of impairment is based on z-scores, which represent how many SDs an individual’s test performance deviates from the population mean after adjusting for relevant confounders. Therefore, these z-scores are critical to the implementation of the MDS PD-MCI Level II criteria, whose calculation relies on the availability of normative data for the given population.

As population-specific normative data are not available for Luxembourg's population, normative values (expected scores for non-PD individuals) were derived from the healthy control subgroup included in this study, following an approach previously described in the literature⁵⁹. In this approach, the normative values are the expected values in an NPSY test for a non-PD patient adjusted to their socio-demographic characteristics (age, sex, and years of education). These socio-demographic features are commonly considered confounding variables in NPSY assessments because they represent the most relevant non-pathological factors that can significantly influence cognitive performance. The approach outlined in Shirk et al.⁵⁹, is claimed to effectively account for the effect of confounders on cognitive ability by using multivariate linear regression and has been applied by other studies^60,61,62. In this regression, the outcome variable (Y_obs) is the observed cognitive test score, and the covariates or regressors are the confounders (sex, age, years of education). This method yields coefficients that quantify the amount of change in the outcome variable per unit change in the covariate, while controlling for all other confounders. These coefficients, together with the intercept, are used to calculate the normative values according to Equation 1. The intercept is modified by adding or subtracting the effect of the confounders, calculated as the product between the coefficient and the individual’s confounding variable, resulting in the expected value (Y_exp) adjusted specifically to the age, sex, and education of the individual. Once the normative value is obtained, Equation 2 is applied to calculate the z-scores. An overview of the z-score distributions is presented in Supplementary Fig. 1.

Equation 1

$${\rm{B}})$$

Where:

Y: Intercept of the linear regression

${\boldsymbol{f}}{\boldsymbol{age}}$: Coefficient for the age variable

Age: Age of the individual

${\boldsymbol{f}}{\boldsymbol{ed}}$: Coefficient for the education level variable

ed: Years of education of the individual

${\boldsymbol{f}}{\boldsymbol{sex}}$: Sex coefficient for the sex variable

Sex: Sex of the individual

The calculation of z-scores is based on Equation 2.

Equation 2

$${\boldsymbol{Z}}=\frac{{{\boldsymbol{Y}}}_{{obs}}-{{\boldsymbol{Y}}}_{\exp }}{{\boldsymbol{SD}}}$$

Where:

Z: Z-score of a cognitive test of an individual

Y_obs: Individual's observed score on a given cognitive test

Y_exp: Expected (predicted) population mean score (normative value)

SD: Standard deviation from the normative data

Data-driven model

Unsupervised learning approaches, specifically clustering algorithms, were employed. Three clustering algorithms were evaluated: SC using symmetric normalised Laplacian, due to its highest robustness and Euclidean distance for constructing the affinity matrix; K-Means, and GMM.

The input data for the MCI diagnostic model consisted of the selected NPSY assessments encompassing all five major cognitive domains at baseline visit, described in the 'Neuropsychological test selection' subsection of the methodology. Disease-related factors, such as age at diagnosis and disease duration, were also considered. Additionally, well-established self-reported risk factors, such as comorbidities including cardiovascular disease, hypertension, and diabetes, and lifestyle factors such as smoking and alcohol consumption⁶³, were included in the analysis. Finally, other variables describing the progression of PD were also included: motor symptoms (MDS-UPDRS III), disease stage (Hoehn and Yahr), and RBDSQ⁴⁸. For this analysis a cross-sectional dataset covering the baseline visit was used.

Before training, input features underwent tailored preprocessing to enhance model performance. Categorical variables, such as risk factors, were encoded numerically using a one-hot encoder, while numerical variables were normalised using robust scaling. NPSY test scores, already standardised as Z-scores, required no additional encoding. The TMT B-A and TMT-A scores were inverted so that lower scores corresponded to worse performance, aligning with the interpretation of other neuropsychological measures where lower scores indicate poorer performance.

To optimise the clustering performance, a grid search was used to find the optimal set of hyperparameters for the distinct clustering algorithms. Hyperparameter optimisation was guided by both internal and external criteria. Internal validation focused on cohesion and separation metrics, which assess within-cluster similarity and between-cluster distinctiveness⁶⁴. To conduct the internal validation, the silhouette score⁶⁵, a widely adopted internal clustering metric, was used to get the best hyperparameters for each of the clustering trains. This metric provides normalized scores and helps mitigate cluster size-related bias⁶⁶.

Given the objective of distinguishing between PD-NC and PD-MCI, clinical assumptions regarding the phenotypes of these groups were incorporated (external criteria). These rules assumed that individuals with PD-MCI would be older, have a longer disease duration, and advanced Hoehn and Yahr stages, and would have lower MoCA total scores⁶⁷. Clustering results inconsistent with these clinical expectations were excluded from further analysis.

Aligning with the purpose of distinguishing between PD-NC and PD-MCI, the number of clusters was set to two. Three or four clusters were also tested, but two clusters remained the most optimal solution (Supplementary Table 1). For enhancing interpretability, feature importance was assessed by examining the contribution of each input variable to the clustering by using the mutual information score.

A global comparison between the best models from the different algorithms was conducted by evaluating reported metrics, such as accuracy, sensitivity, and AUC in relation to the clinical diagnostic reference test (MDS PD-MCI Level II).

Comparison of the diagnostic strength between the clinical diagnostic reference test and the best performing data-driven model

The diagnostic strength of the clinical diagnostic reference test (MDS PD-MCI Level II), and the best performing data-driven model were assessed by evaluating their ability to maximise distinctions between PD-NC and PD-MCI phenotypes based on a set of clinical characteristics hypothesised to differ between these groups⁶⁷: an objective cognitive assessment, MoCA total score, which evaluates global cognition; a physician-rated score, MDS-UPDRS 1.1, which evaluates cognitive impairment; and a PROM, defined as the sum of PDQ-39 subitems 30 to 33, which evaluates subjective cognitive complaints.

The effect sizes between procedures were compared using a bootstrap resampling approach with 10,000 iterations. In each iteration, data for both groups (NC and MCI) and both procedures were resampled with replacement. Cohen’s d was computed for each procedure, and the difference between the effect sizes was stored. The 95% confidence interval of the bootstrapped differences was estimated from the 2.5^th and 97.5^th percentiles. A confidence interval including zero indicated a non-significant difference between effect sizes.

Identification and characterisation of cognitively distinct subgroups

Cognitively distinct subgroups were identified by comparing how the PD-NC and PD-MCI groups overlap between the data-driven model and the clinical diagnostic reference test, and by running sub-analyses on the matched and misclassified groups resulting from this comparison.

Different analyses were conducted, both at the baseline visit (cross-sectional data) and follow-up visits (longitudinal data), using the MoCA total score (global cognition) and the MDS-UPDRS 1.1 (physician-rated cognitive impairment), to investigate the cognitive profile of the subgroups and validate the existence of the identified subgroups as distinct clinical entities.

Distributional analyses at the baseline visit (cross-sectional data) were conducted by selecting the most appropriate statistical method based on the feature distribution, while the statistical significance and effect sizes were reported as indicated in the 'Statistical analyses' subsection.

A longitudinal data analysis using Cox proportional-hazards models, which includes up to 8 visits of the whole study population, was conducted to assess significant differences in cognitive decline trajectories among the subgroups. For the Cox proportional hazards model, it was necessary to define a specific event or threshold in each outcome variable. In each case, the event was chosen to reflect the progression of individuals toward a level of cognitive impairment that significantly affects activities of daily living, beyond mild deficits. For MoCA (global cognition), a score of ≤21 was used^49,50, indicating decreased global cognitive efficiency, while for MDS-UPDRS item 1.1 (physician-rated cognitive impairment), a score of ≥3 was applied, based on the established clinical interpretation of the subitem. We adopted a more conservative threshold, following Dubois et al.⁴⁹, who defined decreased global cognitive efficiency as a score of <26. As MMSE data were not collected in this study, MoCA–MMSE conversion studies were used to guide the selection of the cut-off of MoCA ≤21, as a MMSE score of 26 corresponds to a MoCA score of 22. Individuals that already reached the endpoint at the first visit were excluded from this analysis due to the impossibility of determining at what time point the threshold was reached. A total of four PwPD died before experiencing the event of interest. Given the small number, and to preserve statistical power, these cases were treated as right-censored at the time of death, rather than modelled as competing events. Hazard ratios (HR) and 95% confidence intervals (CI) were calculated, and comparisons across groups were conducted using the global log-rank test. Each Cox model was adjusted for age, baseline score of the cognitive variable, and years of education. Sex was excluded from the analysis due to its violation of the proportional hazard assumption.

A detailed analysis of the group characteristics was conducted at the baseline visit to identify key differences across the socio-demographic and clinical profile (e.g. cognitive and motor function (MDS-UPDRS III), and other disease-related factors (age at diagnosis, disease duration)), see 'Statistical analyses' subsection for additional details on the statistical methods employed.

Statistical analyses

Distributional analyses were performed using the two-tailed t-test for numerical variables (normally distributed), two-tailed Mann–Whitney U test was applied to numerical non-normally distributed features⁶⁸, while categorical and ordinal data were analysed using the chi-squared test⁶⁹. Statistical significance was reported using p-values, selecting an alpha level of α = 0.05, and effect sizes were calculated to quantify the magnitude of the observed differences. Cohen’s d was used for t-test; Standardised Point biserial correlation coefficient was used for Mann–Whitney U test, and Phi and Cramer’s V were used for Pearson’s chi-squared, Phi being specific for 2 × 2 contingency tables⁷⁰.

Ordinal variables included the Hoehn and Yahr stage and the physician-rated cognitive impairment score (MDS-UPDRS 1.1). Categorical variables consisted of sex. Numerical variables not normally distributed included measures such as the TMT-A, Block Span, TMT B-A, FAB, Word list learning total score, Word list delayed recall, Judgement of Line Orientation, MoCA clock drawing, Phonemic fluency and naming (MoCA subitem), MoCA total score, disease duration, and activities of daily living (PDQ-39 subitems 11–16). Lastly, normally distributed numerical variables included years of education, MDS-UPDRS III, age, and age at PD onset. Normality of the features was checked using Shapiro–Wilk test.

All statistical analyses and machine learning models were performed using Python (version 3.10.7). All clustering methods were implemented using Scikit-learn version 1.2.1.

Data availability

Patient data used in the preparation of this manuscript were obtained from the National Centre of Excellence in Research on Parkinson’s Disease (NCER-PD). NCER-PD datasets are not publicly available, as they are linked to the Luxembourg Parkinson’s Study and its internal regulations. The NCER-PD Consortium is willing to share its available data. Its access policy was devised based on the study ethics documents, including the informed consent form, as approved by National Ethics Board (CNER Ref: 201407/13) and Data Protection Committee (CNPD Ref: 446/2017). Requests to access datasets should be directed to the Data and Sample Access Committee via email: [request.ncer-pd@uni.lu](mailto:request.ncer-pd@uni.lu).

Code availability

The underlying code for this study is available on Gabriel Martinez / Data driven model for MCI screening · GitLab.

References

Lau, L. M. de & Breteler, M. M. Epidemiology of Parkinson’s disease. Lancet Neurol. 5, 525–535 (2006).
Article PubMed Google Scholar
Wolff, A. et al. Parkinson’s disease therapy: what lies ahead? J. Neural Transm. 130, 793–820 (2023).
Article PubMed Google Scholar
Duncan, G. W. et al. Health-related quality of life in early Parkinson’s disease: the impact of nonmotor symptoms. Mov. Disord. 29, 195–202 (2014).
Article PubMed Google Scholar
Mantovani, E., Bressan, M. M., Tinazzi, M. & Tamburin, S. Towards multimodal cognition-based treatment for cognitive impairment in Parkinson’s disease: drugs, exercise, non-invasive brain stimulation and technologies. Curr. Opin. Neurol. 37, 629–637 (2024).
Article PubMed Google Scholar
Watson, G. S. & Leverenz, J. B. Profile of Cognitive Impairment in Parkinson’s disease. Brain Pathol. 20, 640–645 (2010).
Article PubMed PubMed Central Google Scholar
Yan, Y. et al. The effect of multi-component exercise intervention in older people with Parkinson’s disease and mild cognitive impairment: a randomized controlled study. Geriatr. Nurs. 60, 137–145 (2024).
Article PubMed Google Scholar
Caviness, J. N. et al. Defining mild cognitive impairment in Parkinson’s disease. Mov. Disord. 22, 1272–1277 (2007).
Article PubMed Google Scholar
Petersen, R. C. et al. Mild Cognitive Impairment: clinical characterization and outcome. Arch. Neurol. 56, 303–308 (1999).
Article PubMed CAS Google Scholar
Winblad, B. et al. Mild cognitive impairment – beyond controversies, towards a consensus: report of the International Working Group on Mild Cognitive Impairment. J. Intern. Med. 256, 240–246 (2004).
Article PubMed CAS Google Scholar
Díaz-Orueta, U., Blanco-Campal, A. & Burke, T. Rapid review of cognitive screening instruments in MCI: proposal for a process-based approach modification of overlapping tasks in select widely used instruments. Int. Psychogeriatr. 30, 663–672 (2018).
Article PubMed Google Scholar
Nasreddine, Z. S. et al. The Montreal Cognitive Assessment, MoCA: a brief screening tool for mild cognitive impairment. J. Am. Geriatr. Soc. 53, 695–699 (2005).
Article PubMed Google Scholar
Folstein, M. F., Folstein, S. E. & McHugh, P. R. “Mini-mental state”. A practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 12, 189–198 (1975).
Ciesielska, N. et al. Is the Montreal Cognitive Assessment (MoCA) test better suited than the Mini-Mental State Examination (MMSE) in mild cognitive impairment (MCI) detection among people aged over 60? Meta-analysis. Psychiatr. Pol. 50, 1039–1052 (2016).
Article PubMed Google Scholar
Hoops, S. et al. Validity of the MoCA and MMSE in the detection of MCI and dementia in Parkinson disease. Neurology 73, 1738–1745 (2009).
Article PubMed PubMed Central CAS Google Scholar
Rosenblum, S. et al. The Montreal Cognitive Assessment: is it suitable for identifying mild cognitive impairment in Parkinson’s disease?. Mov. Disord. Clin. Pract. 7, 648–655 (2020).
Article PubMed PubMed Central Google Scholar
Litvan, I. et al. Diagnostic criteria for mild cognitive impairment in Parkinson’s disease: Movement Disorder Society Task Force guidelines. Mov. Disord. 27, 349–356 (2012).
Article PubMed PubMed Central Google Scholar
Bezdicek, O. et al. The diagnostic accuracy of Parkinson’s Disease mild cognitive impairment battery using the Movement Disorder Society Task Force criteria. Mov. Disord. Clin. Pract. 4, 237–244 (2017).
Article PubMed Google Scholar
Goldman, J. G. et al. Defining optimal cutoff scores for cognitive impairment using Movement Disorder Society Task Force criteria for mild cognitive impairment in Parkinson’s disease. Mov. Disord. 28, 1972–1979 (2013).
Article PubMed PubMed Central Google Scholar
Stefanova, E. et al. Mild cognitive impairment in early Parkinson’s disease using the Movement Disorder Society Task Force criteria: cross-sectional study in Hoehn and Yahr Stage 1. Dement. Geriatr. Cogn. Disord. 40, 199–209 (2015).
Article PubMed Google Scholar
Boel, J. A. et al. Level I PD-MCI using global cognitive tests and the risk for Parkinson’s disease dementia. Mov. Disord. Clin. Pract. 9, 479–483 (2022).
Article PubMed PubMed Central Google Scholar
Goldman, J. G. et al. Diagnosing PD-MCI by MDS task force criteria: how many and which neuropsychological tests? Mov. Disord. J. Mov. Disord. Soc. 30, 402–406 (2015).
Article Google Scholar
Dalrymple-Alford, J. C. et al. Characterizing mild cognitive impairment in Parkinson’s disease. Mov. Disord. 26, 629–636 (2011).
Article PubMed Google Scholar
Geurtsen, G. J. et al. Parkinson’s disease mild cognitive impairment: application and validation of the criteria. J. Park. Dis. 4, 131–137 (2014).
Google Scholar
Grueso, S. & Viejo-Sobera, R. Machine learning methods for predicting progression from mild cognitive impairment to Alzheimer’s disease dementia: a systematic review. Alzheimers Res. Ther. 13, 162 (2021).
Article PubMed PubMed Central Google Scholar
Yousefi, M. et al. Machine learning based algorithms for virtual early detection and screening of neurodegenerative and neurocognitive disorders: a systematic review. Front. Neurol. 15, 1413071 (2024).
Article PubMed PubMed Central Google Scholar
Basta, M. et al. Personalized screening and risk profiles for Mild Cognitive Impairment via a machine learning framework: implications for general practice. Int. J. Med. Inf. 170, 104966 (2023).
Article Google Scholar
Lundberg, S. M. & Lee, S. I. A unified approach to interpreting model predictions. Adv. Neural Infor. Proc. Syst. 30, 4768–4777 (2017).
Veneziani, I. et al. Applications of Artificial Intelligence in the neuropsychological assessment of dementia: a systematic review. J. Pers. Med. 14, 113 (2024).
Article PubMed PubMed Central Google Scholar
Broeders, M. et al. Evolution of mild cognitive impairment in Parkinson's disease. Neurology 81, 346–352 (2013).
Article PubMed CAS Google Scholar
Marras, C. et al. Measuring mild cognitive impairment in patients with Parkinson’s disease. Mov. Disord. 28, 626–633 (2013).
Article PubMed PubMed Central Google Scholar
Miller, I. N., Neargarder, S., Risi, M. M. & Cronin-Golomb, A. Frontal and posterior subtypes of neuropsychological deficit in Parkinson’s disease. Behav. Neurosci. 127, 175–183 (2013).
Article PubMed PubMed Central Google Scholar
Johnson, D. K., Langford, Z., Garnier-Villarreal, M., Morris, J. C. & Galvin, J. E. Onset of Mild Cognitive Impairment in Parkinson's Disease. Alzheimer Dis. Assoc. Disord. 30, 127 (2016).
Article PubMed PubMed Central Google Scholar
Karr, J. E., Graham, R. B., Hofer, S. M. & Muniz-Terrera, G. When does cognitive decline begin? A systematic review of change point studies on accelerated decline in cognitive and neurological outcomes preceding mild cognitive impairment, dementia, and death. Psychol. Aging 33, 195–218 (2018).
Article PubMed PubMed Central Google Scholar
Erro, R. et al. Do subjective memory complaints herald the onset of mild cognitive impairment in Parkinson's disease? J. Geriatr. Psychiatry Neurol. 27, 276–281 (2014).
Article PubMed Google Scholar
Siciliano, M., Tessitore, A., Morgante, F., Goldman, J. G. & Ricciardi, L. Subjective cognitive complaints in Parkinson’s disease: a systematic review and meta-analysis. Mov. Disord. 39, 17–28 (2024).
Article PubMed CAS Google Scholar
Weintraub, D. et al. The neuropsychiatry of Parkinson’s disease: advances and challenges. Lancet Neurol. 21, 89–102 (2022).
Article PubMed PubMed Central Google Scholar
Schmid, M. & Hammar, Å. First-episode patients report cognitive difficulties in executive functioning 1 year after initial episode of major depressive disorder. Front. Psychiatry 12, 667238 (2021).
Huang, J. et al. Subjective cognitive decline in patients with Parkinson’s disease: an updated review. Front. Aging Neurosci. 15, 1117068 (2023).
Oedekoven, C., Egeri, L., Jessen, F., Wagner, M. & Dodel, R. Subjective cognitive decline in idiopathic Parkinson’s disease: a systematic review. Ageing Res. Rev. 74, 101508 (2022).
Article PubMed CAS Google Scholar
Jessen, F. et al. A conceptual framework for research on subjective cognitive decline in preclinical Alzheimer’s disease. Alzheimers Dement. 10, 844–852 (2014).
Article PubMed Google Scholar
Cools, R. & D’Esposito, M. Inverted-U–shaped dopamine actions on human working memory and cognitive control. Biol. Psychiatry 69, e113–e125 (2011).
Article PubMed PubMed Central CAS Google Scholar
Grall-Bronnec, M. et al. Dopamine agonists and impulse control disorders: a complex association. Drug Saf. 41, 19–75 (2018).
Article PubMed CAS Google Scholar
Martins, D., Faria, R., Pinho, M. & Rodrigues, S. Impulse control disorders and dopamine agonists. Eur. Psychiatry 64, S475 (2021).
Article PubMed Central Google Scholar
Hipp, G. et al. The Luxembourg Parkinson’s study: a comprehensive approach for stratification and early diagnosis. Front. Aging Neurosci. 10, 326 (2018).
Article PubMed PubMed Central Google Scholar
Pavelka, L. et al. Luxembourg Parkinson’s study -comprehensive baseline analysis of Parkinson’s disease and atypical Parkinsonism. Front. Neurol. 14, 1330321 (2023).
Hughes, A. J., Daniel, S. E., Kilford, L. & Lees, A. J. Accuracy of clinical diagnosis of idiopathic Parkinson’s disease: a clinico-pathological study of 100 cases. J. Neurol. Neurosurg. Psychiatry 55, 181–184 (1992).
Article PubMed PubMed Central CAS Google Scholar
Beck, A. T., Ward, C. H., Mendelson, M., Mock, J. & Erbaugh, J. An inventory for measuring depression. Arch. Gen. Psychiatry 4, 561–571 (1961).
Article PubMed CAS Google Scholar
Stiasny-Kolster, K. et al. The REM sleep behavior disorder screening questionnaire—A new diagnostic instrument. Mov. Disord. 22, 2386–2393 (2007).
Article PubMed Google Scholar
Dubois, B. et al. Diagnostic procedures for Parkinson’s disease dementia: recommendations from the movement disorder society task force. Mov. Disord. 22, 2314–2324 (2007).
Article PubMed Google Scholar
van Steenoven, I. et al. Conversion between mini-mental state examination, Montreal Cognitive Assessment, and dementia rating scale-2 scores in Parkinson’s disease. Mov. Disord. J. Mov. Disord. Soc. 29, 1809–1815 (2014).
Article Google Scholar
Starkstein, S. E., Robinson, R. G. & Mayberg, H. S. Reliability, validity, and clinical correlates of apathy in Parkinson’s disease. J. Neuropsychiatry Clin. Neurosci. 4, 134–139 (1992).
Article PubMed CAS Google Scholar
Peto, V., Jenkinson, C. & Fitzpatrick, R. PDQ-39: a review of the development, validation and application of a Parkinson’s disease quality of life questionnaire and its associated measures. J. Neurol. 245, S10–S14 (1998).
Article PubMed Google Scholar
Goetz, C. G. et al. Movement Disorder Society-sponsored revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS): scale presentation and clinimetric testing results. Mov. Disord. 23, 2129–2170 (2008).
Article PubMed Google Scholar
Reitan, R. M. Validity of the trail making test as an indicator of organic brain damage. Percept. Mot. Skills 8, 271–276 (1958).
Article Google Scholar
Corsi, P. M. Corsi: Human memory and the medial temporal region… - Google Académico. https://scholar.google.com/scholar_lookup?title=Human%20memory%20and%20the%20medial%20temporal%20region%20of%20the%20brain&publication_year=1973&author=P.M.%20Corsi (1972).
Dubois, B., Slachevsky, A., Litvan, I. & Pillon, B. The FAB: a frontal assessment battery at bedside. Neurology 55, 1621–1626 (2000).
Article PubMed CAS Google Scholar
Moms, J. C. et al. The Consortium to establish a registry for Alzheimer’s disease (CERAD). Part I. Clinical and neuropsychological assesment of Alzheimer’s disease. Neurology 39, 1159–1159 (1989).
Article Google Scholar
Benton, A. L., Varney, N. R. & Hamsher, K. deS. Judgment of Line Orientation. https://doi.org/10.1037/t11036-000 (1975).
Shirk, S. D. et al. A web-based normative calculator for the uniform data set (UDS) neuropsychological test battery. Alzheimers Res. Ther. 3, 32 (2011).
Article PubMed PubMed Central Google Scholar
Borland, E. et al. The Montreal Cognitive Assessment: normative data from a large swedish population-based cohort. J. Alzheimers Dis. 59, 893–901 (2017).
Article PubMed PubMed Central Google Scholar
Fellows, R. P. & Schmitter-Edgecombe, M. Symbol digit modalities test: regression-based normative data and clinical utility. Arch. Clin. Neuropsychol. 35, 105–115 (2020).
Article Google Scholar
Kiselica, A. M. et al. Development and validity of norms for cognitive dispersion on the uniform data set 3.0 neuropsychological battery. Arch. Clin. Neuropsychol. 39, 732–746 (2024).
Article PubMed PubMed Central Google Scholar
Livingston, G. et al. Dementia prevention, intervention, and care: 2020 report of the Lancet Commission. Lancet 396, 413–446 (2020).
Article PubMed PubMed Central Google Scholar
Palacio-Niño, J.-O. & Berzal, F. Evaluation metrics for unsupervised learning algorithms. Preprint at https://doi.org/10.48550/ARXIV.1905.05667 (2019).
Rousseeuw, P. J. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987).
Article Google Scholar
Williams, M. C. et al. Unsupervised learning to characterize patients with known coronary artery disease undergoing myocardial perfusion imaging. Eur. J. Nucl. Med. Mol. Imaging 50, 2656–2668 (2023).
Article PubMed PubMed Central Google Scholar
Wen, M.-C., Chan, L. L., Tan, L. C. S. & Tan, E. K. Mild cognitive impairment in Parkinson’s disease: a distinct clinical entity? Transl. Neurodegener. 6, 24 (2017).
Article PubMed PubMed Central Google Scholar
Mann, H. B. & Whitney, D. R. On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat. 18, 50–60 (1947).
Article Google Scholar
Ugoni, A. & Walker, B. F. The Chi square test: an introduction. COMSIG Rev. 4, 61–64 (1995).
PubMed PubMed Central CAS Google Scholar
Tomczak, M. & Tomczak, E. The need to report effect size estimates revisited. An overview of some recommended measures of effect size. TRENDS Sport Sci 21, 19–25 (2014).
Google Scholar

Download references

Acknowledgements

This project was supported by the Luxembourg National Research Fund (FNR) through the FNR/PEARL/dHealthPD/14146272 and FNR/PREVENE/14781425. The National Centre of Excellence in Research on Parkinson’s Disease (NCER-PD) was funded by the Luxembourg National Research Fund (FNR) (FNR/NCER13/BM/11264123). We would like to thank all participants of the Luxembourg Parkinson’s Study for their important support to our research. Furthermore, we acknowledge the joint effort of the partner institutions within the National Centre of Excellence in Research on Parkinson’s Disease (NCER-PD): Luxembourg Centre for Systems Biomedicine, Luxembourg Institute of Health, Centre Hospitalier de Luxembourg, and Laboratoire National de Santé generally contributing to the Luxembourg Parkinson’s Study (see Supplementary Material for list of Consortium members).

Author information

Authors and Affiliations

Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
Gabriel Martínez Tirado, Patricia Martins Conde, Stefano Sapienza, Valerie E. Schröder, Rejko Krüger, Jochen Klucken, Anne Grünewald, Armin Rauschenberger, Clarissa P. C. Gomes, Dheeraj Reddy Bobbili, Ekaterina Soboleva, Elisa Gómez De Lope, Enrico Glaab, Evi Wollscheid-Lengeling, Francoise Meisch, Giuseppe Arena, Ibrahim Boussaad, Jens Schwamborn, Kirsten Roomp, Maria Fernanda NIÑO Uribe, Michael T. Heneka, Michele Bassis, Muhammad Ali, Jade Jaber, Patrick May, Paul Wilmes, Piotr Gawron, Rebecca Ting Jiin Loo, Reinhard Schneider, Ruxandra Soare, Sabine Schmitz, Sarah Nickels, Sascha Herzinger, Sinthuja Pachchek, Soumyabrata Ghosh, Valentin Groues, Venkata Satagopam, Iñigo Yoldi Bergua & Michel Mittelbronn
Bonn-Aachen International Center for Information Technology (B-IT), Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
Holger Fröhlich
Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
Holger Fröhlich
Centre Hospitalier de Luxembourg, Strassen, Luxembourg
Claire Pauly, Valerie E. Schröder, Olena Tsurkalenko, Rejko Krüger, Jochen Klucken, Ekaterina Soboleva, Maria Fernanda NIÑO Uribe, Jade Jaber, Elodie Thiry, Michel Mittelbronn, Gelani Zelimkhanov, Guy Berchem, Liliana Vilas Boas, Linda Hansen, Martine Goergen, Nancy De Bremaeker, Nico Diederich, Romain Nati, Roxane Batutu, Sylvia Herbrink, Lukas Pavelka & Marijus Giraitis
Luxembourg Institute of Health (LIH), Strassen, Luxembourg
Claire Pauly, Valerie E. Schröder, Sonja Jónsdóttir, Olena Tsurkalenko, Rejko Krüger, Armin Rauschenberger, Lukas Pavelka, Marijus Giraitis, Laure Pauly, Achilleas Pexaras, Alexander Hundt, Alexia Mendibide, Ana Festas Lopes, Angelo Ferrari, Brian Dewitt, Carlos Gamio, Estelle Henry, Gaël Hammot, Geeta Acharya, Hermann Thien, Ilsé Richard, Johanna Trouet, Kate Sokolowska, Katy Beaumont, Laura Georges, Lorieza Castillo, Lucie Remark, Maeva Munsch, Margaux Henry, Maud Theresine, Olga Kofanova, Olivia Roland, Pauline Lambert, Saïda Mtimet, Wim Ammerlann, Jochen Ohnmacht, Anne-Marie Hanff, Carlos Vega, Chouaib Mediouni, Deborah Mcintyre, Eduardo Rosales, Fozia Noor, Gessica Contesotto, Gloria Aguayo, Guilherme Marques, Jérôme Graas, Joëlle Fritz, Magali Perquin, Manon Gantenbein, Maura Minelli, Michel Vaillant, Myriam Alexandre, Myriam Menster, Raquel Severino, Sibylle Béchet, Tainá M. Marques, Ulf Nehrbass, Victoria lorentz & Zied Landoulsi
Laboratoire National de Santé, Dudelange, Luxembourg
Michel Mittelbronn, David Bouvier & Katrin Frauenknecht
Faculty of Science, Technology and Medicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Michel Mittelbronn, Laure Pauly & Anne-Marie Hanff
Luxembourg Center of Neuropathology, Dudelange, Luxembourg
Michel Mittelbronn
Department of Life Sciences and Medicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Michel Mittelbronn
Department of Epidemiology, CAPHRI School for Public Health and Primary Care, Maastricht University Medical Centre+, Maastricht, the Netherlands
Anne-Marie Hanff
Centre Hospitalier Emile Mayrisch, Esch-sur-Alzette, Luxembourg
Alexandre Bisdorff & Rene Dondelinger
Parkinson Luxembourg Association, Leudelange, Luxembourg
Roseline Lentz
Association of Physiotherapists in Parkinsons Disease Europe, Esch-sur-Alzette, Luxembourg
Mariella Graziano
Private practice, Ettelbruck, Luxembourg
Nadine Jacoby
Private practice, Luxembourg, Luxembourg
Jean-Paul Nicolay

Authors

Gabriel Martínez Tirado
View author publications
Search author on:PubMed Google Scholar
Patricia Martins Conde
View author publications
Search author on:PubMed Google Scholar
Stefano Sapienza
View author publications
Search author on:PubMed Google Scholar
Holger Fröhlich
View author publications
Search author on:PubMed Google Scholar
Claire Pauly
View author publications
Search author on:PubMed Google Scholar
Valerie E. Schröder
View author publications
Search author on:PubMed Google Scholar
Sonja Jónsdóttir
View author publications
Search author on:PubMed Google Scholar
Olena Tsurkalenko
View author publications
Search author on:PubMed Google Scholar
Rejko Krüger
View author publications
Search author on:PubMed Google Scholar
Jochen Klucken
View author publications
Search author on:PubMed Google Scholar

Consortia

On behalf of the NCER-PD consortium

Gabriel Martínez Tirado
, Patricia Martins Conde
, Stefano Sapienza
, Claire Pauly
, Sonja Jónsdóttir
, Olena Tsurkalenko
, Rejko Krüger
, Jochen Klucken
, Anne Grünewald
, Armin Rauschenberger
, Clarissa P. C. Gomes
, Dheeraj Reddy Bobbili
, Ekaterina Soboleva
, Elisa Gómez De Lope
, Enrico Glaab
, Evi Wollscheid-Lengeling
, Francoise Meisch
, Giuseppe Arena
, Ibrahim Boussaad
, Jens Schwamborn
, Kirsten Roomp
, Maria Fernanda NIÑO Uribe
, Michael T. Heneka
, Michele Bassis
, Muhammad Ali
, Jade Jaber
, Patrick May
, Paul Wilmes
, Piotr Gawron
, Rebecca Ting Jiin Loo
, Reinhard Schneider
, Ruxandra Soare
, Sabine Schmitz
, Sarah Nickels
, Sascha Herzinger
, Sinthuja Pachchek
, Soumyabrata Ghosh
, Valentin Groues
, Venkata Satagopam
, Iñigo Yoldi Bergua
, Elodie Thiry
, Michel Mittelbronn
, Gelani Zelimkhanov
, Guy Berchem
, Liliana Vilas Boas
, Linda Hansen
, Martine Goergen
, Nancy De Bremaeker
, Nico Diederich
, Romain Nati
, Roxane Batutu
, Sylvia Herbrink
, Lukas Pavelka
, Marijus Giraitis
, Laure Pauly
, Achilleas Pexaras
, Alexander Hundt
, Alexia Mendibide
, Ana Festas Lopes
, Angelo Ferrari
, Brian Dewitt
, Carlos Gamio
, Estelle Henry
, Gaël Hammot
, Geeta Acharya
, Hermann Thien
, Ilsé Richard
, Johanna Trouet
, Kate Sokolowska
, Katy Beaumont
, Laura Georges
, Lorieza Castillo
, Lucie Remark
, Maeva Munsch
, Margaux Henry
, Maud Theresine
, Olga Kofanova
, Olivia Roland
, Pauline Lambert
, Saïda Mtimet
, Wim Ammerlann
, Jochen Ohnmacht
, Anne-Marie Hanff
, Carlos Vega
, Chouaib Mediouni
, Deborah Mcintyre
, Eduardo Rosales
, Fozia Noor
, Gessica Contesotto
, Gloria Aguayo
, Guilherme Marques
, Jérôme Graas
, Joëlle Fritz
, Magali Perquin
, Manon Gantenbein
, Maura Minelli
, Michel Vaillant
, Myriam Alexandre
, Myriam Menster
, Raquel Severino
, Sibylle Béchet
, Tainá M. Marques
, Ulf Nehrbass
, Victoria lorentz
, Zied Landoulsi
, David Bouvier
, Katrin Frauenknecht
, Alexandre Bisdorff
, Rene Dondelinger
, Roseline Lentz
, Mariella Graziano
, Nadine Jacoby
& Jean-Paul Nicolay

Contributions

All authors read, approved the final manuscript and gave consent for publication. All authors participated in reviewing and editing the manuscript. G.M.T.—Data analysis and draft manuscript writing. G.M.T., P.M.C., S.S., C.P., V.E.S., S.J., R.K. and J.K.—Results interpretation. C.P., V.E.S., S.J., O.T., R.K. and J.K.—Clinical expertise. H.F.—Machine learning and data analysis expertise. P.M.C., S.S. and J.K.—Conception and coordination of this project.

Corresponding author

Correspondence to Gabriel Martínez Tirado.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Martínez Tirado, G., Martins Conde, P., Sapienza, S. et al. Data-driven clinical decision support tool for diagnosing mild cognitive impairment in Parkinson’s disease. npj Parkinsons Dis. 12, 15 (2026). https://doi.org/10.1038/s41531-025-01222-6

Download citation

Received: 09 July 2025
Accepted: 18 November 2025
Published: 12 January 2026
Version of record: 13 January 2026
DOI: https://doi.org/10.1038/s41531-025-01222-6

Subjects

Abstract

Similar content being viewed by others

Multimodal neuroimaging-based prediction of Parkinson’s disease with mild cognitive impairment using machine learning technique

Construction of a mild cognitive impairment prediction model for Parkinson’s disease patients on the basis of multimodal data

Subitem-level multi-scale assessment and machine learning for three-class cognitive status classification in Parkinson’s disease

Introduction

Results

Descriptive statistics of the study data

Data-driven model

Diagnostic prediction strength comparison between the clinical diagnostic reference test and the optimal data-driven model

Identification of cognitively distinct PD subgroups

Cognitive characterisation of the identified groups

Discussion

Methods

Study population

Overview of available data

Defining PD-MCI and cognitive domain impairment

Neuropsychological test selection

Normative data calculation

Data-driven model

Comparison of the diagnostic strength between the clinical diagnostic reference test and the best performing data-driven model

Identification and characterisation of cognitively distinct subgroups

Statistical analyses

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

On behalf of the NCER-PD consortium

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information (download DOCX )

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links