Abstract
Cardiovascular disease (CVD) risk prediction models for the general population may not provide accurate predictions in individuals with bipolar disorder (BD) who have elevated risks of cardiometabolic conditions and premature mortality. Therefore, we aimed to: 1) develop a five-year CVD risk prediction model in this population by using nationwide register data from Sweden, 2) investigate whether the performance improved when we considered additional risk factors, including psychiatric comorbidity, psychotropic medication, and socio-demographic variables, compared to using established CVD risk factors only, and 3) whether machine learning approach provided improvements compared to standard logistic regression models. We followed 33,933 persons with BD aged 30–82 years old, without previous CVD, from the date of BD diagnosis registered between 2007–2014, for up to five years. The logistic regression model containing only established risk factors yielded an area under the receiver operating characteristic curve (AUC) of 0.76 (95% confidence interval 0.74–0.78) in the test dataset, while the logistic regression model and the best performing machine learning model including additional predictors yielded similar results (AUC was 0.77 (0.75, 0.79) in both models). The performance of logistic regression models slightly improved with additional predictors when continuous risk scores were used. In conclusion, standard logistic regression and established CVD risk factors may be sufficient to predict CVD in individuals with BD when using population register-based data from Sweden. External validation across diverse healthcare settings and rigorous assessment of clinical impact will be crucial next steps before implementing these models in clinical practice.
Similar content being viewed by others
Introduction
Bipolar disorder (BD) is a chronic and severe mental disorder, characterized by alternating episodes of depression and either mania (bipolar I disorder), or hypomania (bipolar II disorder), with a global lifetime prevalence between 1–3% [1,2,3]. Individuals with BD have an increased risk not only of comorbid psychiatric conditions but also of a wide range of medical conditions [4], including metabolic disorders like type 2 diabetes and cardiovascular diseases (CVDs) [5,6,7]. Consequently, BD carries a substantial burden of disease and has been linked to premature mortality, and shorter life expectancy [7, 8]. Increased suicidality rates can partially explain the excess mortality in BD. However, about two-thirds of deaths are linked to medical conditions, with CVDs being the leading cause of premature mortality [9, 10]. It is therefore important to create risk stratification tools to identify individuals with BD who are at high risk of developing CVDs.
CVD risk stratification tools are well established for clinical use in the general population (e.g., SCORE 2, Framingham Risk Score, QRISK, PREDICT) [11,12,13,14]. These include established risk factors such as blood pressure, lipid profile, diabetes, smoking, and body mass index. Yet, these prediction models might underestimate the cardiovascular risk in the population with severe mental disorders [14,15,16], hence the models should be either updated within this population, or they should consider severe mental disorders as one of the predictors. To rectify this underestimation, some models have included severe mental disorders in cardiovascular risk prediction. For instance, the QRISK3, a prediction model of a ten-year cardiovascular risk in the general population aged 25 to 84 includes atypical antipsychotics and severe mental disorders along with established predictors (variables from the QRISK2) and several other additional cardiovascular risk factors [12]. Furthermore, in the PRIMROSE study, ten-year risk prediction models of the first cardiovascular event were developed specifically for people with severe mental disorders between 30 to 79 years of age, using primary care data from the UK [15, 17]. The authors found that the PRIMROSE models containing additional predictors (i.e., area-based deprivation, severe mental disorder diagnosis, prescriptions for antidepressants and/or antipsychotics, and heavy alcohol use) performed better compared to the models that included only established CVD risk factors. However, such models have not been evaluated in Sweden using available registry data nor in individuals with BD specifically, in whom CVD risk may differ from the risk in people with other severe mental disorders (e.g., schizophrenia) [8, 18].
Machine learning (ML) has emerged as a new approach in developing CVD risk prediction tools. ML may provide advantages over standard regression models by being more flexible and making fewer assumptions, e.g., using nonparametric and semiparametric methods [19]. ML algorithms can optimize the use of large-scale data and of many predictors by considering complex non-linear associations and interactions between predictors [20,21,22]. Several applications of ML in CVD risk prediction have shown significant improvements compared to standard regression approaches when developed in the general population [20, 22, 23]. However, such prediction models are not available for the population with BD.
In the current study, we aimed: (i) to develop and internally evaluate a five-year risk prediction model of incident CVD in people diagnosed with BD by using large-scale data from Swedish population-based registers; (ii) to investigate whether additional predictors (i.e., psychiatric comorbidity, use of psychotropic medication, and socio-demographic variables) improve the predictive accuracy of established cardiovascular risk factors available in Swedish registries (i.e., age, sex, hypertension, hyperlipidaemia, diabetes mellitus, tobacco use disorder, family history of CVD); and (iii) to compare the performance of standard logistic regression (LR) models with more complex ML models.
Materials and methods
We followed the TRIPOD + AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods [24]. Data management was performed using SAS software version 9.4 and statistical analyses were conducted using R V.4.3.1 (packages ‘ROCR’, ‘CalibrationCurves’, ‘rms’, ‘glmnet’, ‘caret’), and Python 3.11.9 (scikit-learn [25] version 1.5.1, imbalanced-learn [26] version 0.12.3, and XGBoost [27] version 2.0.3libraries). The study protocol is available in Supplementary material, page 18.
Data sources
Data were acquired from a record linkage from several Swedish national registers, using the unique personal number assigned to all Swedish residents. Socio-demographic data were obtained from the Total Population Register, which contains demographic information since 1968 [28], and the Longitudinal integration database for health insurance and labour market studies register (LISA), with data such as education, income, and civils status, since 1990 on all individuals aged 16 and older [29]. We obtained information on diagnoses from the National Patient Register (NPR), containing all inpatient care diagnoses since 1987 and specialized outpatient diagnoses since 2001 [30], and the Cause of Death Register (CDR) [31], which covers all deaths from 1952. Diagnoses in the NPR and CDR are classified per the International Classification of Diseases (ICD) version 8 (1969–1986), ICD-9 (1987–1996), and ICD-10 (1997-present). Information on medication prescription was acquired from the Prescribed Drug Register (PDR) [32] which covers data on all dispensed prescriptions, with a date of prescription and dosage, since July 1st, 2005, using the Anatomical Therapeutic Classification (ATC) system. We linked individuals from our study cohort to their first-degree relatives by using the Multigeneration Register [33], to obtain information on medical family history.
Definition of bipolar disorder
To identify individuals with BD from the NPR, we used an algorithm validated in Sweden, showing high specificity [34], which defines BD as having at least two inpatient or outpatient admissions for a core BD diagnosis (ICD-8: 296.0–296.3, 296.8, 296.9; ICD-9: 296A–296E, 296 W, 296X; and ICD-10: F30, F31), with exclusion of sole diagnoses of ICD-8: 296.2 (manic-depressive psychosis, depressed type) and/or ICD-9: 296B (unipolar affective psychosis, melancholic form). Additionally, individuals with two or more diagnoses of schizophrenia before the start of follow up were excluded from the cohort.
Population and study period
The study cohort included individuals with BD born between 1932–1984, without previous history of CVDs, aged 30 and older. This age cut off was selected following previous CVD risk prediction studies in people with severe mental disorders [15]. Our cohort was subsequently divided into the 80% training data set, or 20% hold-out test sample, by using a stratified random split by the outcome classes. All individuals were followed from the date of a BD diagnosis that was registered after January 1st, 2007, and after age 30 (i.e., first diagnosis of BD after these two dates, with at least one previously recorded diagnosis), but before December 31st, 2014, until diagnosis of CVD, emigration, death, or end of the five years (Fig. 1). The inclusion period started on January 1st, 2007, to allow for enough time for medication prescriptions to be recorded (i.e., at least 18 months from the start of PDR on July 1st, 2005). Although the PRIMROSE model addressed a ten-year CVD risk in people with severe mental disorders [15, 16], we chose the five-year time window as it may be more clinically appropriate for high-risk groups such as psychiatric populations.
Candidate predictors and selection of relevant predictors
Candidate predictors were selected a priori, considering relevant literature and established CVD risk prediction models. Well-established risk prediction models for CVDs commonly consider risk factors such as blood pressure/hypertension/hypertensive therapy, body mass index (BMI), smoking, diabetes, total/LDL/HDL cholesterol [13], and family history of CVDs (i.e., first-degree relative with CVD, before age 60) [12, 13, 35, 36]. Due to the nature of available data in Swedish registers (i.e., electronic health records of diagnoses and medication prescriptions), we included proxy measures of established CVD risk factors: a diagnosis or dispensed medication prescription for hypertension, diabetes mellitus (Type I and Type II), hyperlipidaemia, obesity, and tobacco use disorder; first-degree family history (a diagnosis) of CVDs before age 60, as well as sex and age at the start of follow up (Supplementary tables 1 and 2).
We considered additional predictors as potentially relevant risk factors, including the number of psychiatric and non-psychiatric hospitalisations in the last two years, psychiatric comorbidity of BD disorders (i.e., depression, anxiety, alcohol use disorder, substance use disorder other than alcohol and tobacco, ADHD), previous use of psychotropic medication, relevant socio-demographic factors (e.g., educational attainment, birth country, civil status, having children, residential area [urban/rural] and income), and other health conditions (autoimmune diseases, migraine, epilepsy, etc.), which have been associated with CVD risk [6, 37]. Among psychotropic medications, we included the following categories as separate predictors (details in Supplementary Table 4): anxiolytics, antidepressants, hypnotics and sedatives, antiepileptics (including benzodiazepine derivates clonazepam, and other antiepileptic medication, but excluding mood stabilizers), mood stabilizers (lithium and other mood stabilizers), antipsychotics, medication for ADHD, and medication for the treatment of addictive disorders.
First, we considered a narrower set of predictors, including the lifetime history of diagnoses and medication prescriptions and most recent socio-demographic information, with multivariable LR and ML models (30 candidate predictors, Supplementary Table 1). Second, we investigated a wider set of predictors using ML models (63 predictors, Supplementary Table 2), combining lifetime history of diagnoses and recent history of medication prescriptions (within the last two years). As we did not expect that information on family medical history or socio-demographic variables was missing at random (e.g., foreign born individuals are more likely to have missing data), we created separate predictors for missingness. We also excluded individuals with missing data on multiple predictors (N = 36, or 0.06%). Details on missing data are in Supplementary Table 6.
For the standard LR approach, we applied a limited backward stepwise procedure in the training dataset to determine whether to retain candidate predictors based on their p-values. The established risk factors were kept fixed in the model, while among additional risk factors, variables with the highest p-value were sequentially rejected, until none of them remained with a p-value greater than 0.1 [38, 39]. This approach allowed us to retain established CVD risk factors in the model and to consider including additional risk factors, finally creating a relatively simple model with good face validity. All models were internally evaluated in the test dataset.
Outcomes
We included an incident diagnosis (primary or any secondary diagnosis) of or medication prescription for the following CVDs: ischemic heart disease, cerebrovascular diseases and transient ischemic attack, thromboembolic diseases, heart failure, arteriosclerosis, and arrhythmia, acquired after the start of follow-up and within the five years. We identified individuals with CVDs based on ICD-10 diagnostic codes from the NPR and CDR. CVDs are often diagnosed in primary care, which is not covered by the NPR. We, therefore, also considered CVD dispensed medication prescriptions based on ATC codes from the PDR, which includes dispensed medication prescribed in primary care as well. The full list of ICD and ATC codes is in the Supplementary Table 5.
Statistical analysis
For internal validation of the models, we randomly assigned individuals from the main cohort to either the 80% training data set or the 20% hold-out test sample, by using a stratified random split by the outcome classes. We first applied multivariable LR analysis to assess the associations between CVDs and candidate predictors in the training data. We also applied penalized LR which incorporates a penalty term in the loss function to address overfitting in LR models [40]. We tested LASSO (L1 regularization), Ridge (L2 regularization), and Elastic Net models (L1 and L2 regularization). A 10-fold cross-validation was used to find the best hyperparameter λ. To further investigate the robustness of our findings obtained using the backwards stepwise selection to identify relevant predictors in the LR model, we also employed the Akaike information criterion (AIC) (Supplementary material, page 17) [41]. This approach balances model fit and complexity and penalizes excessive parameters to identify relevant features.
Furthermore, we trained several ML models, including a Random Forest, XGBoost, and Histogram-Based Gradient Boosting model, which implicitly incorporate potential interactions between predictors, as well as Naïve Bayes, and Support Vector Machine. Additionally, we created a soft-voting ensemble model by combining the three top-performing models, as determined by AUC scores in the cross-validation results. To optimize the hyperparameters of each ML model, a 10-fold cross-validation with grid search was conducted, using AUC as the scoring metric. The grid including each hyperparameter space is in the Supplementary Table 8. Continuous variables were scaled for the Support Vector Machine model.
To assess the discrimination of the models, i.e., the ability of the model to differentiate those with and without CVDs, we used the receiver operating characteristic (ROC) curve with the area under the receiver operating characteristic curve (AUC) and 95% confidence intervals using the DeLong algorithm [42]. We also used the area under the precision-recall curve (AUPRC), which is a useful measure in unbalanced models (uneven class distribution in the outcome variable). Furthermore, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and balanced accuracy were analysed for the predefined high-risk threshold of predicted probability set at 10 and 20% [16, 43, 44], as well as an optimal high-risk threshold calculated using the Youden’s index [45]. To assess the calibration of the model (i.e., assess the agreement between predicted and observed values at different levels of predicted probability), we used the Brier score [46], and calibration plots [47, 48]. Additionally, to test the statistical significance of the incremental value of additional predictors in LR models, we first compared the AUCs of the models with the DeLong’s test [49] and with the Net Reclassification Index (NRI), which summarises reclassification of participants when new predictors are added for high-risk thresholds of 10 and 20%. We also used two continuous measures that consider all possible thresholds of predicted probability: the category-free NRI and the Integrated Discrimination Improvement (IDI) index [50, 51].
As the cardiovascular risk differs between men and women, and the risk prominently increases on average from age 50 [52], we tested the performance of the LR models across males and females, and in those younger and older than 50 years of age.
Results
Descriptive information
Our study cohort comprised of 33,933 individuals with BD aged 30–82 years (mean age 47.4, SD = 12.0) at the beginning of follow-up. From the total cohort, 27,147 individuals (80%) were randomly assigned to the training dataset, of whom 2322 developed CVD (8.55%), while 6786 individuals (20%) were assigned to the hold-out or test dataset, of whom 580 (8.55%) developed CVD during the five-year follow up (Fig. 1). Descriptive characteristics are in Supplementary Table 6.
Selected additional predictors of cardiovascular disease in logistic regression models
By using LR and the limited backwards stepwise selection procedure for predictor selection, 17 predictors were retained – nine established predictors, and eight additional risk factors (Table 1) in the training dataset. Among proxy measures of established risk factors, the strongest association with CVDs was found for diagnosis or medication prescription for hypertension (OR = 1.57, 95% CI 1.39–1.76, p < 0.001). Among additional risk factors, substance use disorder other than tobacco and alcohol (OR = 1.37, 1.20–1.57, p < 0.001) and antiepileptic medication (OR = 1.37, 1.14–1.64, p < 0.001) had the strongest association with CVDs. Majority of antiepileptic medication prescriptions (53%) was for benzodiazepine derivate clonazepam (ATC code N03AE01).
Model performance measures and incremental value of additional risk factors in logistic regression models
We first tested the performance of the standard LR model, which included nine established risk factors. In the test data set, the model showed an AUC of 0.76 (0.74–0.78), an AUPRC of 0.25 and a Brier score of 0.07 (Table 2), suggesting good model discrimination and calibration. The LR model containing additional eight predictors (17 predictors in total) showed a similar AUC of 0.77 (0.75–0.79) in the test data set, with an AUPRC of 0.25 (see Table 2, and Supplementary fig. 1 for corresponding ROC and AUPR curve), and a Brier score of 0.07 (calibration curve is in Fig. 2). Penalized LR provided results consistent with the standard LR analysis (Table 2). We compared the performance of the LR model containing only established risk factors versus the model containing additional risk factors using the DeLong test, which showed that the difference in AUC between the two models was not statistically significant (p value 0.296). Furthermore, the model derived based on the AIC criteria yielded results consistent with the model derived based on the stepwise selection process, albeit this model kept only four additional predictors (Supplementary material, page 17).
We tested several high-risk thresholds including the two predefined thresholds of 10 and 20%, as well as an “optimal” threshold obtained using the Youden’s index from the LR model (9% for both LR models). Using the 10% high-risk threshold in the test dataset, which was closely aligned with the calculated optimal high-risk threshold, the model showed a sensitivity of 0.67 (0.63, 0.70), specificity of 0.74 (0.73, 0.75), PPV of 0.19 (0.18, 0.21), NPV of 0.96 (0.95, 0.96) and balanced accuracy of 0.70 (Table 3). We did not find a statistically significant improvement when we compared the performance of the models using a high-risk threshold of 10%, with the NRI being −0.001 (95% CI −0.024, 0.021), or 20%, the NRI was 0.013 (95% CI −0.011, 0.036). Yet, we identified a statistically significant improvement when we used category-free or continuous measures (the IDI index was 0.006, 95% CI 0.003, 0.009, and integrated NRI was 0.006, 95% CI 0.003, 0.009).
LR models performed similarly across males and females (Supplementary Table 9) as in the total test dataset. However, their performance was weaker when it was tested separately in individuals younger than 50, and older than 50 years. To further investigate differences in CVD risk prediction between these two age groups, we performed a post hoc analysis where we retrained the logistic regression model with established risk factors separately within each group (Supplementary material, page 16). This analysis did not reveal improvements in the re-trained models. Moreover, predictor coefficients remained generally similar between the models in younger and older individuals, except for obesity, which appeared to be more relevant for CVD risk in those younger than 50.
Machine learning models compared to the standard logistic regression approach
When evaluating the performance of the tuned ML models trained with 17 predictors, all showed lower AUROC and AUPRC values on the test set compared to LR, except for the ensemble model (comprising a random forest, histogram-based gradient boosting, and LR), which demonstrated comparable performance (AUC of 0.77 vs 0.77; AUPRC of 0.25 vs 0.26) (Table 2). However, when a larger set of predictors was considered, the ensemble model outperformed LR minimally, with an AUC of 0.77 versus 0.77 and an AUPRC of 0.26 versus 0.25. The balanced accuracy of the ensemble model was 0.70 with a high-risk threshold of 10% and 0.59 with a threshold of 20%, comparable with LR models (Table 3). Furthermore, with a high-risk threshold of 20%, the ensemble model outperformed the LR model when it comes to specificity (0.96 vs 0.92) and PPV (0.33 vs 0.28), although sensitivity (0.23 vs 0.31) was somewhat lower compared with LR.
Discussion
We used registry-based data from Sweden to develop and internally validate a five-year risk prediction model of CVD in people with BD. We found that established cardiovascular risk factors and a standard LR approach provided relatively good predictive performance, with an AUC of 0.76 in the test dataset. Additional predictors and ML methods did not provide substantial improvements. As there are no available CVD risk prediction models in people with BD, we compared the achieved AUC to established cardiovascular risk prediction models, developed and validated in the general population [53], and models developed in people with severe mental disorders [15]. These models, which have demonstrated promising results for clinical application, yielded similar results to ours [17].
We observed no improvement in AUC with additional predictors, whether with the narrower (17 predictors) or broader set of predictors (63 predictors). This finding held for both the LR model and the best-performing ensemble ML models, with both achieving an AUC of 0.77. Thus, our study confirmed the relevance of established cardiovascular risk factors in the population with BD, even when they are defined as registry-based data available in Swedish electronic health records, used as proxy measures of detailed clinical indicators. The only non-significant association of CVD was with hyperlipidaemia, defined as either a diagnosis or dispensed medication prescription for hyperlipidaemia. This could be due to a potential underdiagnosis of the condition or successful remission following the treatment. The relatively modest performance of LR models with additional predictors and ML approaches compared to LR with established risk factors only, likely stems from several interconnected factors. First, additional predictors are highly co-occurring with the established risk factors. Thus, established risk factors may provide an approximation of the CVD risk that incorporates the effects of additional predictors as well. For instance, socio-demographic factors and psychiatric comorbidity may be reflected through health-adverse lifestyle choices (e.g., lack of exercise, inadequate diet, etc), which in turn, affect established risk factors. Second, the binary nature of most predictors (i.e., presence/absence of diagnoses and medication use) might have reduced the opportunity for ML to discover complex, non-linear associations and interactions between predictors. This is in line with previous ML studies on the prediction of major chronic diseases with low incidence (e.g., CVDs, chronic kidney disease, diabetes, and hypertension) and predominantly linear associations with simple clinical predictors [54]. Conversely, the advantages of ML models become more pronounced when analysing high-dimensional data with complex interactions, such as continuous physiological, psychosocial and environmental factors, in addition to medical history of diagnoses/medication prescriptions [20, 22, 23]. Finally, the modest number of events relative to the number of predictors may have limited the ability of ML to learn more complex patterns without risking overfitting, making the more parsimonious logistic regression approach equally effective.
Nevertheless, when continuous risk prediction scores were applied in LR models, eight additional predictors provided a statistically significant increment compared to established risk factors only. This finding supports previous research indicating increased cardiovascular risks associated with the use of psychotropic medication [55], or the risk associated with psychiatric conditions treated with these medications (i.e., anxiety, sleep disorders) [56,57,58]. Furthermore, previous research has identified that individuals with severe mental disorders and substance use disorder are at even higher CVD risk than those without substance abuse [59, 60]. Moreover, patients with BD have high rates of psychiatric comorbidity, with a concurrent use of several psychotropic medications (i.e., polypharmacy) [61], and psychiatric polypharmacy has been associated with risks of cardiovascular morbidity and mortality [62]. We did not find that mood stabilizers (including lithium and other mood stabilizers) have a statistically significant association with CVDs. This may be due to the low variability of this predictor in our cohort (i.e., 72% of the cohort had a prescription for mood stabilizers). Further, socio-economic factors (low education) were relevant for CVD risk prediction [63, 64], indicating that patients with lower education may need additional support to adequately manage cardiometabolic risk factor (e.g., diet, exercise, smoking). We also found that being born outside Sweden was associated with a decreased risk of CVD, while having missing information on family history was associated with an increased risk of CVD. These findings seem contradictory as foreign-born individuals were more likely to have missing data for family history. The cardiovascular health of immigrants compared to Swedish-born individuals may depend on their country of origin, reasons for/duration of immigration, or the selection effect [65], while also less severe CVDs may be underdiagnosed/under-reported in immigrants with mental disorders. As we did not consider more detailed information on immigration or other potential reasons for data missingness, it is difficult to translate these findings into specific clinical implications.
All tested models performed better with a high-risk threshold of 10% compared to 20% for sensitivity and balanced accuracy, a useful measure of predictive performance for unbalanced data [66]. This result may be due to the lower thresholds likely reflecting the true probability of the outcome (the mean predicted probability in the test data was 8%, and the incidence of CVD within the follow up period was 8.5%), and the calculated optimal high-risk thresholds being close to 10%. Also, this finding supports the previously used high-risk threshold of 10% in the general population for a five-year CVD risk prediction [44], and the NICE guidelines recommendations of using a 10% threshold for the initiation of preventative therapy with statins [67]. In our study, when a high-risk threshold of 10% was applied, the LR model containing both established and additional risk factors correctly classified 388 out of 580 persons with CVD at the end of follow up (67%), while with the high-risk threshold set at 20%, the same model correctly classified 180 individuals (31%). Due to the unavailability of CVD risk prediction models in people with BD, we could only compare our results with the PRIMROSE models [15]. The PRIMROSE BMI model correctly classified 390 out of 2096 men who developed CVDs (18.6%) as being high-risk, while the PRIMROSE lipid model correctly classified 410 out of 2137 men (19.2%), with similar results for women. However, the PRIMROSE models only applied a 20% high-risk threshold in calculating a 10-year CVD risk in people with severe mental disorders and incorporated both detailed continuous risk factors and history of diagnoses/medication prescriptions.
With regards to the potential clinical implications of our findings, the derived LR models, containing predominantly lifetime history of risk factors, could be applied during a single psychiatric visit in individuals with BD aged 30 and above. The ML models, incorporating a broader set of predictors with both lifetime history of diagnoses and recent history of medication prescriptions, may be more useful for follow-up assessments. Nevertheless, the derived models might need to be updated using more detailed measures from regular follow-up visits to provide adequate repeated risk calculations. Further, although our results confirmed the relevance of established risk factors for CVD in individuals with BD, eight additional risk predictors might provide improvements in predictive accuracy when continuous risk scores are applied. The use of eight additional predictors in LR models would not demand substantial additional clinical resources as these data often routinely collected in psychiatric care. Our results also emphasize the need of not only close monitoring of general health, including established risk factors, but also of psychiatric comorbidity, psychotropic medication use, and socio-economic factors to enable timely and adequate prevention of CVDs in this population. This, albeit small, increment may still be clinically relevant as it has been shown that patients with psychiatric disorders can experience poorer management of cardiovascular risk compared to the general population [68].
Strengths and limitations
In this study we used large-scale data with national coverage of medical records and socio-demographic information in Sweden, and the considered predictors could be collected either during a clinical interview or from medical records. The investigated models, both standard LR and ML models, showed good discrimination. Furthermore, although considered predictors are clinically relevant indicators of CVD risk factors (e.g., hypertension, hyperlipidaemia, diabetes), preclinical stages and continuous measures of risk factors (e.g., cardiometabolic risk indicators such as blood pressure, glucose and lipid levels) may provide more detailed predictions of the outcomes. We also only had access to data from specialist care, while additional access to primary care records could provide a wider coverage including individuals with less severe clinical presentations of considered conditions. This limitation is reflected in our observed underestimation of the population prevalence of obesity and smoking in Sweden [69, 70]. Finally, we did not have access to other relevant CVD risk factors, such as physical activity and nutrition [71]. Thus, obtained models require further evaluation and updating by using additional data sources (e.g., primary care) and more detailed measures of risk factors. Such measures could be potentially obtained from Swedish quality registers, such as the Swedish National Diabetes Register (NDR) that provides a possibility to investigate cardiometabolic conditions registered in the primary and specialist health care, while the Swedish National Quality Register for Bipolar Disorder – BipoläR provides detailed clinical data on individuals with BD.
Furthermore, our models demonstrated reduced performance when tested separately in individuals younger and older than 50, and this did not improve when the model with established risk factors was retrained separately in these two age groups. Regarding individual predictor relevance, only obesity appeared to be more relevant for CVD risk prediction in younger individuals compared to older individuals, while other predictors showed similar relevance across both groups. Therefore, potential effects of age warrant further investigation.
Finally, in our LR modelling strategy, we employed a limited backward stepwise selection for additional risk factors while retaining all established risk factors in the model. This strategy served two purposes: maintaining clinical relevance by including established cardiovascular predictors and identifying additional important factors while preserving face validity for clinicians. To address potential overfitting concerns inherent in the stepwise selection [72], we evaluated our model in a hold-out test dataset. We also employed the AIC approach to select relevant predictors among additional risk factors and conducted a penalized LR analysis to evaluate the performance of the simple LR model with additional predictors selected in the stepwise selection approach. These analyses provided performance metrics consistent with the main findings. This is probably because we have a relatively large data set, and a modest amount of candidate predictors (30 candidate predictors) relative to the number of outcomes (580 individuals in test data). Nevertheless, to ensure the broader applicability of our findings, external validation across different healthcare systems, countries, and data sources remains essential. This validation will be particularly important given the potential variations in healthcare and documentation practices (e.g., without access to universal health care and/or to electronic health records), and patient populations across different settings.
Conclusions
Our study suggests that standard LR using established CVD risk factors achieved satisfactory predictive performance for five-year CVD risk in individuals with BD using Swedish register-based data. This parsimonious approach proved effective, while incorporating additional predictors, i.e., psychiatric comorbidity, use of psychotropic medication and socio-demographic factors, may only provide modest improvements when continuous risk scores are used. Further studies are needed to consider more detailed health-related variables in this population (e.g., continuous measures of cardiometabolic risk, physical activity, and nutrition) to confirm or update our findings. External validation across diverse healthcare settings and rigorous assessment of clinical impact will be crucial next steps before implementing these models in routine clinical practice.
Data availability
The Public Access to Information and Secrecy Act in Sweden prohibits us from making individual level data publicly available. Researchers who are interested in replicating our work can apply for individual level data at Statistics Sweden: www.scb.se/en/services/guidance-for-researchers-and-universities/. The code can be made available upon request.
Code availability
The Public Access to Information and Secrecy Act in Sweden prohibits us from making individual level data publicly available. Researchers who are interested in replicating our work can apply for individual level data at Statistics Sweden: www.scb.se/en/services/guidance-for-researchers-and-universities/. The code can be made available upon request.
References
Nierenberg AA, Agustini B, Köhler-Forsberg O, Cusin C, Katz D, Sylvia LG, et al. Diagnosis and treatment of bipolar disorder: a review. JAMA. 2023;330:1370–80.
Merikangas KR, Jin R, He J-P, Kessler RC, Lee S, Sampson NA, et al. Prevalence and correlates of bipolar spectrum disorder in the world mental health survey initiative. Arch. Gen. Psychiatry. 2011;68:241–51.
Clemente AS, Diniz BS, Nicolato R, Kapczinski FP, Soares JC, Firmo JO, et al. Bipolar disorder prevalence: a systematic review and meta-analysis of the literature. Rev. Brasileira de. Psiquiatria. 2015;37:155–61.
Krishnan KRR. Psychiatric and medical comorbidities of bipolar disorder. Psychosom. Med. 2005;67:1–8.
Fiedorowicz JG, Palagummi NM, Forman-Hoffman VL, Miller DD, Haynes WG. Elevated prevalence of obesity, metabolic syndrome, and cardiovascular risk factors in bipolar disorder. Ann. Clin. Psychiatry. 2008;20:131–7.
Coello K, Kjærstad HL, Stanislaus S, Melbye S, Faurholt-Jepsen M, Miskowiak KW, et al. Thirty-year cardiovascular risk score in patients with newly diagnosed bipolar disorder and their unaffected first-degree relatives. Australian N. Zealand J. Psychiatry. 2019;53:651–62.
Paljärvi T, Herttua K, Taipale H, Lähteenvuo M, Tanskanen A, Tiihonen J. Cardiovascular mortality in bipolar disorder: population-based cohort study. Acta Psychiatr Scand. 2024;150:56–64.
Laursen TM. Life expectancy among persons with schizophrenia or bipolar affective disorder. Schizophrenia Res. 2011;131:101–4.
Correll CU, Solmi M, Veronese N, Bortolato B, Rosson S, Santonastaso P, et al. Prevalence, incidence and mortality from cardiovascular disease in patients with pooled and specific severe mental illness: a large-scale meta-analysis of 3,211,768 patients and 113,383,368 controls. World Psychiatry. 2017;16:163–80.
Chan JKN, Wong CSM, Yung NCL, Chen EYH, Chang WC. Excess mortality and life-years lost in people with bipolar disorder: an 11-year population-based cohort study. Epidemiol. Psychiatr. Sci. 2021;30:e39.
Kist JM, Vos RC, Mairuhu ATA, Struijs JN, van Peet PG, Vos HMM, et al. SCORE2 cardiovascular risk prediction models in an ethnic and socioeconomic diverse population in the Netherlands: an external validation study. eClinicalMedicine. 2023;57:101862.
Hippisley-Cox J, Coupland C, Brindle P. Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study. bmj. 2017;357:j2099.
Wilson PW, D’Agostino RB, Levy D, Belanger AM, Silbershatz H, Kannel WB. Prediction of coronary heart disease using risk factor categories. Circulation. 1998;97:1837–47.
Pylypchuk R, Wells S, Kerr A, Poppe K, Riddell T, Harwood M, et al. Cardiovascular disease risk prediction equations in 400 000 primary care patients in New Zealand: a derivation and validation study. Lancet. 2018;391:1897–907.
Osborn DP, Hardoon S, Omar RZ, Holt RI, King M, Larsen J, et al. Cardiovascular risk prediction models for people with severe mental illness: results from the prediction and management of cardiovascular risk in people with severe mental illnesses (PRIMROSE) research program. JAMA psychiatry. 2015;72:143–51.
Cunningham R, Poppe K, Peterson D, Every-Palmer S, Soosay I, Jackson R. Prediction of cardiovascular disease risk among people with severe mental illness: a cohort study. PLoS one. 2019;14:e0221521.
Zomer E, Osborn D, Nazareth I, Blackburn R, Burton A, Hardoon S, et al. Effectiveness and cost-effectiveness of a cardiovascular risk prediction algorithm for people with severe mental illness (PRIMROSE). BMJ open. 2017;7:e018181.
Foguet-Boreu Q, Fernandez San Martin MI, Flores Mateo G, Zabaleta Del Olmo E, Ayerbe García-Morzon L, Perez-Piñar López M, et al. Cardiovascular risk assessment in patients with a severe mental illness: a systematic review and meta-analysis. BMC Psychiatry. 2016;16:141.
Rose S. Intersections of machine learning and epidemiological methods for health services research. Int. J. Epidemiol. 2020;49:1763–70.
Alaa AM, Bolton T, Di Angelantonio E, Rudd JH, Van der Schaar M. Cardiovascular disease risk prediction using automated machine learning: a prospective study of 423,604 UK Biobank participants. PLoS one. 2019;14:e0213653.
Krittanawong C, Virk HUH, Bangalore S, Wang Z, Johnson KW, Pinotti R, et al. Machine learning prediction in cardiovascular diseases: a meta-analysis. Sci. Rep. 2020;10:16057.
Zhao Y, Wood EP, Mirin N, Cook SH, Chunara R. Social determinants in machine learning cardiovascular disease prediction models: a systematic review. Am. J. preventive Med. 2021;61:596–605.
Kakadiaris IA, Vrigkas M, Yen AA, Kuznetsova T, Budoff M, Naghavi M. Machine learning outperforms ACC/AHA CVD risk calculator in MESA. J. Am. Heart Assoc. 2018;7:e009476.
Collins GS, Moons KG, Dhiman P, Riley RD, Beam AL, Van Calster B, et al. TRIPOD + AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. bmj. 2024;385:q902.
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in python. J. Mach. Learn. Res. 2011;12:2825–30.
Lemaître G, Nogueira F, Aridas CK. Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res. 2017;18:1–5.
Chen T, Guestrin C, editors. Xgboost: a scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining; 2016.https://doi.org/10.1145/2939672.2939785
Ludvigsson JF, Almqvist C, Bonamy A-KE, Ljung R, Michaëlsson K, Neovius M, et al. Registers of the Swedish total population and their use in medical research. Eur. J. Epidemiol. 2016;31:125–36.
Ludvigsson JF, Svedberg P, Olén O, Bruze G, Neovius M. The longitudinal integrated database for health insurance and labour market studies (LISA) and its use in medical research. Eur. J. Epidemiol. 2019;34:423–37.
Ludvigsson JF, Andersson E, Ekbom A, Feychting M, Kim J-L, Reuterwall C, et al. External review and validation of the Swedish national inpatient register. BMC Public. Health. 2011;11:450.
Brooke HL, Talbäck M, Hörnblad J, Johansson LA, Ludvigsson JF, Druid H, et al. The Swedish cause of death register. Eur. J. Epidemiol. 2017;32:765–73.
Wettermark B, Hammar N, MichaelFored C, Leimanis A, Otterblad Olausson P, Bergman U, et al. The new Swedish prescribed drug register—opportunities for pharmacoepidemiological research and experience from the first six months. Pharmacoepidemiol drug. Saf. 2007;16:726–35.
Ekbom A. The Swedish multi-generation register. Methods Mol. Biol. 2011;675:215–20.
Sellgren C, Landén M, Lichtenstein P, Hultman C, Långström N. Validity of bipolar disorder hospital discharge diagnoses: file review and multiple register linkage in Sweden. Acta Psychiatr Scand. 2011;124:447–53.
Tunstall-Pedoe H, Woodward M, Tavendale R, A’Brook R, McCluskey MK. Comparison of the prediction by 27 different factors of coronary heart disease and death in men and women of the Scottish Heart Health Study: cohort study. Bmj. 1997;315:722–9.
SCORE2 working group and ESC Cardiovascular risk collaboration. SCORE2 risk prediction algorithms: new models to estimate 10-year risk of cardiovascular disease in Europe. Eur. Heart J. 2021;42:2439–54.
Penninx BW, Lange SM. Metabolic syndrome in psychiatric patients: overview, mechanisms, and implications. Dialogues Clin. Neurosci. 2018;20:63–73.
Royston P, Sauerbrei W. Multivariable model-building: a pragmatic approach to regression anaylsis based on fractional polynomials for modelling continuous variables. West Sussex, England: John Wiley & Sons; 2008.
Fazel S, Wolf A, Larsson H, Mallett S, Fanshawe TR. The prediction of suicide in severe mental illness: development and validation of a clinical prediction rule (OxMIS). Transl. psychiatry. 2019;9:98.
Greenwood CJ, Youssef GJ, Letcher P, Macdonald JA, Hagg LJ, Sanson A, et al. A comparison of penalised regression methods for informing the selection of predictive markers. PLoS One. 2020;15:e0242730.
Cavanaugh JE, Neath AA. The Akaike information criterion: Background, derivation, properties, application, interpretation, and refinements. WIREs Computational Stat. 2019;11:e1460.
Sun X, Xu W. Fast implementation of DeLong’s algorithm for comparing the areas under correlated receiver operating characteristic curves. IEEE Signal. Process. Lett. 2014;21:1389–93.
Grundy SM, Cleeman JI, Merz CN, Brewer HB Jr., Clark LT, Hunninghake DB, et al. Implications of recent clinical trials for the national cholesterol education program adult treatment panel iii guidelines. Circulation. 2004;110:227–39.
Polonsky TS, McClelland RL, Jorgensen NW, Bild DE, Burke GL, Guerci AD, et al. Coronary artery calcium score and risk classification for coronary heart disease prediction. Jama. 2010;303:1610–6.
Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3:32–5.
Brier GW. Verification of forecasts expressed in terms of probability. Monthly Weather. Rev. 1950;78:1–3.
Harrell FE Jr., Lee KL, Mark DB. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat. Med. 1996;15:361–87.
Harrell FE, Califf RM, Pryor DB, Lee KL, Rosati RA. Evaluating the yield of medical tests. Jama. 1982;247:2543–6.
DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–45.
Pencina MJ, D'Agostino RB Sr, D’Agostino RB Jr., Vasan RS. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat. Med. 2008;27:157–72.
Steyerberg EW, Pencina MJ, Lingsma HF, Kattan MW, Vickers AJ, Van Calster B. Assessing the incremental value of diagnostic and prognostic markers: a review and illustration. Eur. J. Clin. Invest. 2012;42:216–28.
Visseren FLJ, Mach F, Smulders YM, Carballo D, Koskinas KC, Bäck M, et al. 2021 ESC Guidelines on cardiovascular disease prevention in clinical practice: developed by the task force for cardiovascular disease prevention in clinical practice with representatives of the European society of cardiology and 12 medical societies with the special contribution of the European Association of Preventive Cardiology (EAPC). Eur. Heart J. 2021;42:3227–337.
Siontis GC, Tzoulaki I, Siontis KC, Ioannidis JP. Comparisons of established risk prediction models for cardiovascular disease: systematic review. Bmj. 2012;344:e3318.
Nusinovici S, Tham YC, Chak Yan MY, Wei Ting DS, Li J, Sabanayagam C, et al. Logistic regression was as good as machine learning for predicting major chronic diseases. J. Clin. Epidemiol. 2020;122:56–69.
De Hert M, Detraux J, Van Winkel R, Yu W, Correll CU. Metabolic and cardiovascular adverse effects associated with antipsychotic drugs. Nat. Rev. Endocrinol. 2012;8:114–26.
Jiang F, Li S, Pan L, Zhang N, Jia C. Association of anxiety disorders with the risk of smoking behaviors: a meta-analysis of prospective observational studies. Drug. alcohol. dependence. 2014;145:69–76.
Hennekens CH, Hennekens AR, Hollar D, Casey DE. Schizophrenia and increased risks of cardiovascular disease. Am. heart J. 2005;150:1115–21.
Wang D, Li W, Cui X, Meng Y, Zhou M, Xiao L, et al. Sleep duration and risk of coronary heart disease: A systematic review and meta-analysis of prospective cohort studies. Int. J. Cardiology. 2016;219:231–9.
Onyeka MMBAPIN, Collier Høegh MM, Nåheim Eien CMEM, Nwaru MPBI, Melle MDPI. Comorbidity of physical disorders among patients with severe mental illness with and without substance use disorders: a systematic review and meta-analysis. J. Dual Diagnosis. 2019;15:192–206.
Abosi O, Lopes S, Schmitz S, Fiedorowicz JG. Cardiometabolic effects of psychotropic medications. Hormone Mol. Biol. Clin. Investigation. 2018;36:1–27.
Aguglia A, Natale A, Fusar-Poli L, Amerio A, Costanza A, Fesce F, et al. Complex polypharmacy in bipolar disorder: results from a real-world inpatient psychiatric unit. Psychiatry Res. 2022;318:114927.
Edinoff AN, Ellis ED, Nussdorf LM, Hill TW, Cornett EM, Kaye AM, et al. Antipsychotic polypharmacy-related cardiovascular morbidity and mortality: a comprehensive review. Neurol. Int. 2022;14:294–309.
Clark AM, DesMeules M, Luo W, Duncan AS, Wielgosz A. Socioeconomic status and cardiovascular disease: risks and implications for care. Nat. Rev. Cardiology. 2009;6:712–22.
Khaing W, Vallibhakara SA, Attia J, McEvoy M, Thakkinstian A. Effects of education and income on cardiovascular outcomes: a systematic review and meta-analysis. Eur. J. Preventive Cardiology. 2017;24:1032–42.
Dotevall A, Rosengren A, Lappas G, Wilhelmsen L. Does immigration contribute to decreasing CHD incidence? coronary risk factors among immigrants in Göteborg, Sweden. J. Intern. Med. 2000;247:331–9.
Thölke P, Mantilla-Ramos Y-J, Abdelhedi H, Maschke C, Dehgan A, Harel Y, et al. Class imbalance should not throw you off balance: Choosing the right classifiers and performance metrics for brain decoding with imbalanced data. NeuroImage. 2023;277:120253.
Minder CM, Blumenthal RS, Blaha MJ. Statins for primary prevention of cardiovascular disease: the benefits outweigh the risks. Curr. Opin. cardiology. 2013;28:554–60.
Ayerbe L, Forgnone I, Foguet-Boreu Q, González E, Addo J, Ayis S. Disparities in the management of cardiovascular risk factors in patients with psychiatric disorders: a systematic review and meta-analysis. Psychol. Med. 2018;48:2693–701.
Folkhälsomyndigheten. Overweight and obesity. The Public Health Agency of Sweden (Folkhälsomyndigheten); 2018.
Folkhälsomyndigheten. Use of tobacco and nicotine products. The Public Health Agency of Sweden (Folkhälsomyndigheten); 2024.
Zhang Y-B, Pan X-F, Chen J, Cao A, Xia L, Zhang Y, et al. Combined lifestyle factors, all-cause mortality and cardiovascular disease: a systematic review and meta-analysis of prospective cohort studies. J. Epidemiol. Community Health. 2021;75:92–9.
Smith G. Step away from stepwise. J. Big Data. 2018;5:32.
Acknowledgements
This study is a part of the project Medical comorbidities in bipolar disorder (BIPCOM): clinical validation of risk factors and biomarkers to improve prevention and treatment. We acknowledge the relevant work of collaborators within the BIPCOM project (in alphabetical order): Maximilian Bayas, Paolo Brambilla, Pietro Carmellini, Elisa Caselani, Julia Clemens, Lorena Di Consoli, Philippe Courtet, Bruno Etain, Andrea Fagiolini, Julian Fuhrer, Stine H. Glastad, Ophélia Godin, Christina Jäger, Ingrid T. Johansen, Florian Klingler, Nene Kobayashi, Marion Leboyer, Marta Magno, Silke Matura, Elisabeth Michaelis, Chiara Möser, Emilie Olié, Agnes Pelletier, Maximilian Pilhatsch, Laura Poddighe, Roberto Poli, Jonathan Repple, Philipp Ritter, Linn Rødevand, Anil Sen, Francesca Siri, Falk Gerrik Verhees, and Sabrina Vogel.
Funding
This project was supported by VINNOVA (Sweden’s Innovation Agency; grant number: 2022-00541) under the framework of ERA PerMed, ERAP‑ERMED2022‑087—BIPCOM. The project has also received funding from the Swedish Research Council (2025- 03176). OAA was supported by Research Council of Norway (#326813), Regional Health Authority (2022-073, 2023-031) Nordforsk (#164218). AR was supported by Hessian Ministry of Science and Arts (HMWK) LOEWE program (LOEWE Centre DYNAMIC). Open access funding provided by Örebro University.
Author information
Authors and Affiliations
Contributions
MD, HL, and MGA designed the study with input from all other co-authors. ML, PA, OAA, MB, RC, GDG, and AR provided clinical expertise. MD conducted data management and the analysis, with input from MGA and RKH. MD drafted the manuscript. All authors assisted in interpreting the results and provided critical feedback on the manuscript and revisions. HL and MGA provided supervision.
Corresponding author
Ethics declarations
Competing interests
ML has received lecture honoraria from Lundbeck pharmaceuticals, outside the submitted work. ZC received speaker fees from Takeda Pharmaceuticals, outside the submitted work. OAA is consultant to Precision Health AS and Cortechs, and has received speakers honorarium from Lundbeck, Janssen, Otsuka and Lilly. AR has received honoraria for lectures and/or advisory boards from Janssen, Boehringer Ingelheim, COMPASS, SAGE/Biogen, LivaNova, Medice, Shire/Takeda, MSD and cyclerion. Also, he has received research grants from Medice and Janssen, all outside the submitted work. HL reports receiving grants from Shire Pharmaceuticals; personal fees from and serving as a speaker for Medice, Shire/Takeda Pharmaceuticals and Evolan Pharma AB; and sponsorship for a conference on attention-deficit/hyperactivity disorder from Shire/Takeda Pharmaceuticals and Evolan Pharma AB, all outside the submitted work. HL is editor-in-chief of JCPP Advances. All other authors declare no competing interests.
Ethics approval and consent to participate
This project was reviewed and approved by the Swedish Ethical Review Authority (Dnr 2020-06540; 2022-06204-02). The requirement for informed consent is waived for anonymised register-based research according to Swedish law. All methods were performed in accordance with the relevant guidelines and regulations.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Dobrosavljevic, M., Landén, M., Brikell, I. et al. A five-year risk prediction model of cardiovascular disease in individuals with bipolar disorder: a nationwide register study from Sweden. Mol Psychiatry (2025). https://doi.org/10.1038/s41380-025-03381-7
Received:
Revised:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41380-025-03381-7




