Abstract
Antihypertensives are associated with increased risk of syncope, hypotension, and fractures, but the highest-risk individuals are unclear. This study aimed to develop and validate three models to predict these outcomes in patients with an indication for antihypertensive treatment. A cohort study was conducted using data from Clinical Practice Research Datalink (CPRD). Patients aged 40+ with systolic blood pressure 130-179 mmHg were included. Outcomes were first hypotension, syncope, or fracture leading to hospitalization or death within 10 years. Models were derived from CPRD GOLD data (n = 1,773,224) and validated with CPRD Aurum data (n = 3,805,366). Each model had 31-37 predictors. Validation demonstrated strong discriminative ability (10-year C-statistic: hypotension 0.824; syncope 0.819; fracture 0.790), with close agreement between predicted and observed risks for the hypotension and syncope models. Some underprediction was observed for the fracture model. These models could be used to help reassure patients about the relatively low risk of harm from antihypertensive treatment, or identify the small number of individuals with a higher risk where additional monitoring may be indicated.
Introduction
Hypertension is the leading risk factor for cardiovascular disease, making it an important target for intervention in routine clinical practice1. Blood pressure-lowering with antihypertensive treatment has been shown to be very effective at reducing the risk of cardiovascular events across all age groups2. However, blood pressure lowering is not without harm3. Previous studies have highlighted the increased risk of adverse events such as hypotension, syncope, falls, acute kidney injury, and electrolyte abnormalities, especially in older patients and those with frailty4. For some individuals, where the risk of harm is high, it may not be appropriate to prescribe antihypertensive treatment. In those already treated, interventions such as deprescribing may be considered5,6. To enable informed decision-making, clinicians need to understand an individual’s underlying risk of adverse events, so that this can be weighed against a patient’s likelihood of benefit from new or continued treatment.
In previous studies, clinical prediction models have been developed to estimate the risk of serious falls and acute kidney injury in patients indicated for antihypertensive therapy7,8. Although most patients in these studies with a high risk of acute kidney injury or falls also had a high risk of cardiovascular disease (CVD) a small number were shown to be at high risk of adverse events but low risk of cardiovascular disease8. In these individuals, new or continued treatment may not be appropriate. Using prediction models to understand an individual’s risk of specific adverse events, treatment strategies can be personalised to ensure antihypertensive treatment is only prescribed to those with the most to gain.
In this study, we develop and externally validate three clinical prediction models for adverse events commonly associated with antihypertensive treatment—namely hypotension, syncope, and fracture—using data from over five million patients in the Clinical Practice Research Datalink. These models estimate baseline risk regardless of treatment status and are designed to support personalised prescribing decisions by identifying individuals at higher risk of adverse outcomes.
Results
Study population characteristics
A total of 1,773,224 patients were included in the model development cohort (CPRD GOLD) with a mean age of 59 years (SD 13 years) and a mean systolic blood pressure at study inclusion of 144 mmHg (SD 12 mmHg) (Table 1, figure S1). The 10-year prevalence of hazardous hypotension was 1.6% (n = 28,450), syncope was 2.2% (n = 39,898), and fracture was 4.1% (n = 73,491). The median follow-up time across the whole cohort was 6 years (IQR 2.6 to 10 years).
In the validation cohort, 3,805,366 patients were included, with 63,019 (1.7%) experiencing a hazardous hypotensive event, 84,262 (2.2%) a syncope event and 151,630 (4.0%) a fracture event during 10-year follow-up (table S3). Median follow-up time in the validation cohort was 7.0 years (2.9 to 10). Ethnicity data were more complete in the validation cohort compared to the development cohort (81% vs 44% complete data).
Model Development
A total of 31 predictors were included in the final STRATIFY-Hypotension model, after the exclusion of covariates with little or no association with hypotension (Table 2). High social deprivation, current smoking status, previous hypotension, chronic kidney disease, and Parkinson’s disease were strong predictors of hypotension resulting in hospitalisation or death. Prescription of all types of antihypertensive medication were associated with an increased risk of hazardous hypotension, with ACE inhibitors (SHR 1.41, 95% CI 1.37 to 1.45), angiotensin II receptor antagonists (SHR 1.36, 95% CI 1.30 to 1.43) and alpha blockers (SHR 1.35, 95% CI 1.26 to 1.45) conferring the greatest risk.
A total of 31 predictors were included in the final STRATIFY-Syncope model (Table 2). Covariates predictive of syncope were similar to those predictive of hypotension, with the exception of South Asian ethnicity, other ethnicity, dementia and heart failure which were all associated with a reduced risk of syncope requiring hospitalisation or leading to death, and antipsychotic medication prescription which was associated with increased risk.
A total of 37 predictors were included in the final STRATIFY-Fracture model (Table 2). Of these, 14 were unique predictors of fracture that were not included in the STRATIFY-Hypotension and STRATIFY-Syncope models. Strong predictors of fracture included heavy drinking, female sex, chronic liver disease, previous fracture, multiple sclerosis, epilepsy, osteoporosis and rheumatoid arthritis. All antihypertensive medications had a weak or no association with the risk of fracture. Other medications were associated with an increased risk of fracture, with the exception of hormone replacement therapy which conferred a lower risk of fracture (Table 2).
Age was not linearly related with any of the outcomes so transformations were used. Miscalibration was observed across all models at 5 and 10 years so they were recalibrated to the observed pseudo-values in the development dataset (figure S2).
External validation
The distribution of the prognostic index for the derivation and external validation datasets can be seen in the appendix (figure S3), and model performance statistics are given in Table 3, S4 and S5 and Fig. 1. The final STRATIFY-Hypotension model exhibited strong discriminative ability at 10 years (C-statistic 0.824, 95% CI 0.823 to 0.826) and close agreement between predicted and observed risks depending on time horizon (Observed/Expected [O/E] at 10 years 0.983, 95% CI 0.961 to 1.005). The STRATIFY-Syncope model also showed strong discriminative ability (C-statistic at 10 years 0.819, 95% CI 0.817 to 0.821) and close agreement between predicted and observed risks (O/E ratio at 10 years 1.028, 95% CI 1.009 to 1.047). The STRATIFY-Fracture model showed good discrimination (C-statistic at 10 years 0.790, 95% CI 0.789 to 0.792) and close agreement between predicted and observed risks but with some underprediction for low probabilities (O/E ratio at 10 years 1.13, 95% CI 1.11 to 1.14). Model performance varied more among smaller practices, with more consistent performance seen as practice size increased (figures S4 to S6).
Green line corresponds to the line of equality, red line the calibration curve with 95% CI and yellow dots the deciles of predicted risk. The 5- and 10-year models were re-calibrated using the derivation dataset. For one-year re-calibration was not required. Groups represent tenths of the linear predictor, as created between deciles. Histograms underneath each calibration plot show the distribution of predicted probabilities.
Using a threshold of 5% across all models 732,598 (41%) of the patients were classified as high risk for at least one of the three adverse events at 10 years. Amongst the patients who were classified as high risk for at least one adverse event, 280,326 (38%) were classified as high risk for all three, 287,345 (39%) were classified high risk only for fracture, 31,289 (4.3%) only for syncope and 2,829 (0.4%) only for hypotension. ~24,195 (3.3%) were at high risk for both syncope and hypotension (Fig. 2). Results were similar when using a 10% threshold to define high risk patients in each model (figure S7).
Venn diagrams showing the overlap of high-risk patients ( ≥ 5%) as classified using the final STRATIFY-Hypotension, STRATIFY-Syncope, and STRATIFY-Fracture in the CPRD Gold cohort (derivation). Denominator population is the total number of patients with either a high ( ≥ 5%) STRATIFY-Hypotension, STRATIFY-Syncope or STRATIFY-Fracture risk.
Decision curve analysis indicated that all three models had clinical utility across all three time points (Fig. 3). For example, using the STRATIFY-Hypotension model with a 10-year time horizon to guide decisions on prescribing would result in a higher net benefit compared to a “deprescribe/don’t treat anyone” strategy, and the same was true for the STRATIFY-Syncope model and the STRATIFY-Fracture model.
Decision curves, showing the smoothed, standardised net benefit of using the prediction models across different threshold probabilities for assigning treatment. Treat all corresponds to introducing adverse event prevention measures (which may include deprescribing) for all patients and treat none corresponds to not introducing adverse event prevention measures for all patients.
Subgroup analyses of the 10-year risk models showed similar performance in younger ( < 65 years) and older patients ( ≥ 65 years) and in females and males (figures S8, S10; tables S6 and S7). There was some evidence of under-prediction of hypotension risk in patients of white, black and South Asian ethnicity (figures S11, S12, table S8), although net benefit was consistent across ethnic minority groups for all three models (figure S13).
Comparison with CVD risk
When using a 10% risk threshold for both cardiovascular disease and adverse events, no patients had a high risk of adverse events but low risk of cardiovascular disease. At the 5% threshold, among those patients with a low risk of cardiovascular disease at 10 years, 244 (0.01%) had a high risk of hypotension, 2656 (0.2%) had a high risk of syncope and 17,040 (1%) had a high-risk fracture. Most patients had a high risk of cardiovascular disease but low risk adverse events (Fig. 4).
Discussion
This study developed three clinical prediction models for adverse events related to antihypertensive treatment, which estimate the baseline risk of hypotension, syncope and fracture over the next 1, 5 and 10 years. The models demonstrated good discrimination and suggested that individuals were most likely to be classified at high risk of a fracture, followed by syncope and hypotension. Calibration showed close agreement between predicted and observed risks for all models across all time horizons, with the exception of the hypotension model at 1 year, which tends to underestimate the risk. Some minor underprediction was also observed for the fracture model at 10 years for low predicted probabilities. Given this, we would suggest caution in using the 1-year hypotension model at this stage.
When compared to risk of cardiovascular disease, only a very small proportion of patients ( < 1%) were found to be at high risk of adverse events and low risk of cardiovascular disease. A large proportion of patients (39-56%) were identified to have high risk of cardiovascular disease and low risk of adverse events. This suggests that for the majority of people, the potential benefits of treatment will outweigh the risk of hypotension, syncope or fracture. The information from these models may therefore be useful in helping patients make informed decisions about their treatment options, potentially reducing unnecessary worry or apprehension.
As part of the clinical utility assessment, the predicted risks from each model were compared with the risk of cardiovascular disease, generated by the QRisk2 algorithm for specific thresholds9. This latter model was recommended by NICE during the study period and estimates the risk of cardiovascular disease in patients aged 35-84 years over a 10-year period, but does not consider the competing risk of death10. This can lead to overestimation of cardiovascular risk, particularly over shorter timeframes and in older patients with multiple health conditions where the competing risk of death from other causes is higher11. As a result, analyses showing that the risk of cardiovascular disease outweighs the risk of adverse events should be interpreted with caution, particularly over shorter timeframes.
Of the adverse events examined in the present study, fracture has most commonly been studied in previous risk prediction modelling studies12. Common examples include the FRAX score, Garvan Fracture Risk Calculator and the QFracture tool13,14,15. These typically focus on hip and osteoporotic fracture (FRAX and QFracture) and display varying performance upon external validation due to differences in underlying population and input variables12,14,15. However, unlike the present STRATIFY models, none of these previous models takes into account the competing risk of death and this has been shown to lead to significant over-prediction of fracture risk in older patients with multi-morbidity16. This is important when considering adverse event risk in particular, where one treatment strategy for high risk patients might include deprescribing, or not starting therapy which still carries benefit. Few studies have examined the risk of hypotension or syncope, but these tend to focus on risk prediction during emergency department admission and inpatient stays in hospital17,18,19,20. One study examined the risk of postural hypotension in the community and found moderate discrimination, but this model was not externally validated and calibration was not assessed21.
Clinical guidelines for the management of hypertension are increasingly recommending consideration of deprescribing antihypertensive therapy in specific circumstances, where the benefits of treatment may be outweighed by the harms22. In the UK, the National Institute for Health and Care Excellence currently advises that clinicians should use clinical judgement in blood pressure lowering treatment decisions in the presence of multimorbidity10. The purpose of developing these risk prediction models was to help clinicians estimate the baseline risk of adverse reactions. The models can be applied to both patients that are on antihypertensive treatment or patients for which treatment is being considered. Clinicians can then combine the estimated baseline risk with relative treatment effects obtained from well conducted randomised clinical trials or observational studies to estimate how the risk gets modified by starting, changing or altering the dosage of a medication3,4. Based on this updated risk different treatment strategies can be considered depending on the outcome of interest. For example, in patients at high risk of hypotension and fracture, modification of antihypertensive treatment may be considered, whereas in patients at risk of fracture alone, other prevention strategies may be more appropriate. These tools should be used alongside CVD risk estimation tools to get a more complete picture of the harm/benefit profile of the patient. This can enable better informed decisions regarding when to prescribe, continue or deprescribe antihypertensive treatment. To this end, these algorithms could easily be integrated into electronic health records systems to work alongside existing risk stratification tools such as QRisk9.
All three models suggested net clinical benefit when compared to usual care (with the exception of the 1-year hypotension model), which typically would not involve modifying treatment to account for adverse event risk. However, very few patients who were at high risk of serious hypotension or syncope were also observed to be at low risk of cardiovascular disease (between 0.01% and 1%), thus implementing interventions which withhold or deprescribe treatment due to the risk of hypotension or syncope alone is only likely to be considered for a very small number of patients.
Slightly more individuals were at high risk of fracture and low risk of cardiovascular disease (1%) when using a fracture and cardiovascular risk threshold of 5%, however, the direct association between antihypertensive treatment and fractures is disputed and is likely to be small3,4,23,24. Therefore enthusiasm for intervening in such patients should be tempered by a likely small effect from withholding or deprescribing treatment. For most patients, these models should be employed to provide reassurance that the risk of adverse events is low for the vast majority of the population even after taking into consideration treatment effects from other studies. This information can be useful for clinicians and patients in helping them to make informed decisions about their treatment options, potentially reducing unnecessary apprehension about starting treatment. Where patients are considered at high risk of adverse events such as fracture, other strategies besides modifying antihypertensive treatment may be more appropriate such as monitoring or addressing some of the other risk factors that might be increasing the risk.
This analysis has several strengths, including the robust analytical approach with both internal and external validation across multiple practices, which demonstrate each of the models’ reliability and generalisability within the UK primary care population25. There are also some limitations. These data may not accurately capture all events of interest (due to incorrect or incomplete coding), potentially affecting the model’s performance, especially if certain events are systematically underreported or misclassified26. Findings from clinical utility analyses should be interpreted with caution, as they estimate factual risk (i.e., risk based on baseline characteristics). The net benefit in Decision Curve Analysis (DCA) reflects the clinical utility of using the model’s predictions for treatment decision-making, without accounting for the causal effect of altering antihypertensive medication on outcomes. This study primarily aims to predict baseline risk and does not evaluate how treatment changes will modify these risks. While the model enhances risk stratification and supports decision-making by identifying high- and low-risk patients, it does not provide direct insight into how starting or adjusting antihypertensive treatments affects patient outcomes. Thus, any conclusions regarding treatment effects should rely on additional evidence, such as randomised controlled trials or observational studies. However, it is reasonable to assume that a model with higher net benefit may still improve clinical outcomes by better targeting interventions
The present study used large datasets of electronic health records to derive and externally validate three clinical prediction models to estimate the baseline risk of adverse events associated with antihypertensive therapy. These models were shown to perform well but revealed that only a small proportion of patients eligible for antihypertensive treatment are at high risk of adverse events in the short to medium term. Therefore, these models are most likely to be of clinical utility in providing reassurance to patients considering antihypertensive treatment, potentially reducing unnecessary worry or apprehension.
Methods
Ethics approval
The study protocol was approved by CPRD’s Independent Scientific Advisory Committee in February 2019 before obtaining the data relevant to the project (protocol given in the eAppendix in the Supplement). All data are fully anonymised so consent was not required. A project summary is published on the CPRD website (https://www.cprd.com/isac). Elements of the methodology used in this manuscript have been previously reported in related publications from the same research programme, using the same dataset and protocol and are summarised here for completeness7,8,27.
Design
We conducted a retrospective observational cohort study to develop three clinical prediction models using data from the Clinical Practice Research Datalink (CPRD) GOLD. This dataset includes primary care records from general practices that use the Vision electronic health record system (Cegedim Healthcare Solutions, London, England). The cohort comprised 11.33 million patients from 674 general practices, of whom 4.4 million were active (alive)28. For external validation, we used a second retrospective observational cohort based on CPRD Aurum, which contains data from practices using Egton Medical Information Systems (EMIS, Leeds, England) which t included 19 million patients from 738 practices, with 7 million active patients29. Both CPRD GOLD and Aurum datasets are representative of the UK population in terms of age, sex, ethnicity, and deprivation28,29. Primary care data from both sources were linked to additional datasets, including Office for National Statistics (ONS) mortality data, Hospital Episode Statistics (HES), and the Index of Multiple Deprivation (IMD). The study protocol was approved by the Independent Scientific Advisory Committee (ISAC) for CPRD (protocol number 19_042; see Protocol S1 in the supplementary material).
Population
Patients were eligible for inclusion if they were registered at a general practice in England contributing linked data to CPRD GOLD between 1 January 1998 and 31 December 2018. To avoid duplicate entries, individuals appearing in both CPRD GOLD and CPRD Aurum due to transitions between electronic health record systems during the study period were excluded from the CPRD Aurum (validation) dataset. Inclusion criteria required patients to be aged 40 years or older at the time of data entry (with no upper age limit), registered with a CPRD “up-to-standard” practice (for GOLD only), and to have records available during the defined study period.
Patients entered the cohort at the point they became potentially eligible for antihypertensive therapy, defined as the date of their first systolic blood pressure measurement ≥130 mmHg following the start of the study period. Follow-up continued for a maximum of 10 years.
The 130 mmHg threshold was selected to align with the varying treatment initiation criteria outlined in international hypertension guidelines22,30. Patients with a systolic blood pressure measurement ≥180 mmHg were excluded from the cohort, as treatment would be indicated for them regardless of their estimated risk of adverse outcomes.
Baseline patient characteristics and model predictors were assessed at the index date, defined as 12 months after cohort entry. The same eligibility criteria and procedures for determining baseline characteristics were applied consistently to both the development and validation cohorts.
Patients exited the cohort at the end of follow-up (31 December 2018) or upon transferring out of a CPRD-registered practice, death, or occurrence of the specific outcome of interest.
Outcomes
For each model, the primary outcome was defined as any hospitalisation or death with a primary diagnosis of (hazardous) hypotension, syncope, or fracture occurring within 10 years of the index date. This 10-year time horizon aligns with those used in established cardiovascular risk prediction models9. Outcomes were identified using ICD-9 and ICD-10 codes recorded in Hospital Episode Statistics (HES) and Office for National Statistics (ONS) mortality data (see Table S1 for code lists). Pre-specified secondary outcomes included hypotension, syncope, or fracture (defined identically) occurring within 1 and 5 years of the index date, to account for potential short-term clinical relevance.
Model covariates
Potential predictors of hazardous hypotension, syncope, and fracture were identified based on published literature and consultation with clinical experts. A detailed summary of included variables is provided in Supplementary Table S2. A total of 40 predictors were assessed for the hypotension model, 41 for the syncope model, and 44 for the fracture model. These included demographic characteristics (such as age, sex, smoking history, and alcohol intake), medical history (e.g., prior relevant adverse events, diabetes, chronic kidney disease, stroke, atrial fibrillation, arrhythmias, osteoporosis, rheumatoid arthritis, epilepsy), and current medications (including but not limited to antihypertensives, opioids, sedatives, antidepressants, corticosteroids, and proton pump inhibitors; see Table S2). All comorbidities and clinical history were defined using relevant Read codes recorded any time before the index date. In contrast, medication exposure was defined by at least one prescription issued in the 12 months prior to index.
Sample size
A pre-specified sample size calculation was used to guide model development, yielding an estimated events-per-variable (EPV) range from 7 for the hypotension model to 20 for the fracture model. These estimates were based on assumptions of event rates between 18 and 51 per 10,000 person-years, a median follow-up duration of 7 years, an anticipated Nagelkerke’s R² value of 0.15, and a maximum of 40 candidate predictor parameters per model4,31. Under these assumptions, the required number of outcome events was estimated to range from 277 to 784. The development cohort from CPRD GOLD substantially exceeded these requirements.
For external validation, the syncope model required a minimum sample of ~8000 individuals, including at least 400 events, to ensure a 95% confidence interval width of 0.2 around the estimated calibration slope32. This estimate was derived under several assumptions: a skew-normal distribution for the linear predictor with mean 0.16, variance 0.5, skewness 1, and kurtosis 4; an exponential survival time distribution with a baseline hazard rate of 0.008 (corresponding to 89% survival at 10 years); and an exponential distribution for censoring times with a rate of 0.2 (implying ~87% censoring by 10 years). Comparable sample size estimates were obtained for the other outcomes. The CPRD Aurum validation dataset also exceeded these sample size thresholds.
Statistical analysis
The analysis was conducted using the statistical software R versions 4.02 and 4.1.1 and STATA 16. All models are reported in line with the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) guidelines for reporting of clinical prediction models (see Guideline S1 in appendix)33. Descriptive statistics were calculated for baseline characteristics in the model development and external validation cohorts separately.
Model development
Model development and internal validation were carried out by researchers at the University of Oxford (CK, AW, JPS). For each imputed dataset, multivariable models were constructed using Fine-Grey subdistribution hazard regression to account for the competing risk of death from other causes34. This approach was selected to avoid overestimating the cumulative incidence of adverse events in the presence of competing mortality35. Model coefficients are presented as subdistribution hazard ratios (SHRs) with corresponding 95% confidence intervals. Baseline cumulative incidence functions were estimated post-estimation using a Breslow-type method as outlined by Fine and Grey34. Analyses were undertaken using the fastcmprsk package in RStudio36. Automated selection algorithms were not employed; all predictors were predefined based on prior literature and expert clinical judgement. Given the large sample size, most variables would be statistically significant, so further filtering was used to improve parsimony. Specifically, predictors with SHRs close to 1 (i.e., between 0.95 and 1.05) and low prevalence were excluded in the final model-fitting stage. Shrinkage or penalisation methods to adjust for overfitting were not necessary due to the very large sample size.
To assess the linearity of continuous predictors (age, systolic and diastolic blood pressure, frailty index), fractional polynomial transformations were used37. The best-fitting transformation for each continuous covariate was applied uniformly across all imputations to ensure consistent coefficient estimates.
Potential interactions between age, sex, and antihypertensive therapy were explored but were excluded due to convergence issues, model instability and for the sake of parsimony. The proportional hazards assumption for each covariate was evaluated using Schoenfeld residuals38.
Apparent validation using development data
Apparent calibration of the models was evaluated using calibration plots that compared predicted and observed risks at 1, 5, and 10 years. Observed outcome probabilities were estimated using pseudo-values—jackknife-based estimators that quantify an individual’s contribution to the cumulative incidence function for each outcome while accounting for the competing risk of death, derived using the Aalen–Johansen method39. To enhance stability, pseudo-values were computed separately within 50 groups stratified by linear predictor values. These calculations accounted for both competing risks and non-informative right censoring40,41. Calibration plots were constructed from the pseudo-values, incorporating a non-parametric smooth curve (symmetric nearest neighbour smoothing) with 95% confidence intervals to visualise the agreement between predicted and observed risks across the full risk spectrum42. Plots were generated separately for each imputed dataset, and consistency across imputations was assessed.
When miscalibration was observed at any time point (1, 5, or 10 years), the original model was recalibrated for that time point by fitting a generalised linear model with a logit link to the observed pseudo-values in the development cohort. The recalibration model used only the original model’s linear predictor as the independent variable and allowed for non-linear recalibration via fractional polynomials. These recalibrated models were then subjected to external validation using the independent validation dataset.
External validation
External validation of the prediction models was conducted independently by researchers at the University of Birmingham (LA, KIES, RDR), separate from the model development team. The full prediction algorithms are presented in the supplementary appendix (Equations S1–S3) and were applied to individuals in the validation dataset to generate predicted probabilities of hypotension, syncope, or fracture within 1, 5, and 10 years, accounting for the competing risk of death from other causes43. Calibration was assessed by comparing predicted risks to observed event probabilities, estimated using pseudo-values as described previously39.
Model performance was summarised using calibration plots, observed-to-expected (O/E) ratios, Harrell’s C-statistic, and Royston’s D-statistic along with its associated R² each calculated using the pseudo-values as described above. To evaluate variability in model performance across general practices, we used random-effects meta-analysis with restricted maximum likelihood estimation (REML), acknowledging that case mix and event incidence may differ between sites25,44. The O/E ratio was pooled on the natural log scale, the C-statistic on the logit scale (standard errors derived via the delta method), and the D-statistic was pooled on its original scale45,46. Pooled estimates are reported with prediction intervals (PI) to give an indication of expected model performance in a new GP practice.
Clinical utility was examined using decision curve analysis for a range of potential threshold probabilities probabilities47. A decision threshold is defined as the probability at which a patient is classified as high risk and thus a decision has to be made (i.e., treat, refer for further investigation, etc.). The range of probabilities should reflect potential decision thresholds for the STRATIFY models. In the UK a CVD risk of 10% is considered high and given that the STRATIFY models should be used alongside a CVD risk tool a range of threshold probabilities of up to 20% was considered reasonable. The final models for each outcome were compared at 1, 5 and 10 years to ‘model-blind methods’ of (a) introducing adverse event prevention measures (which may include deprescribing) for all patients or (b) not introducing adverse event prevention measures for all patients, regardless of risk. If the models have higher net benefit than the strategies (a) or (b) then this would suggest using the models to inform prescribing would be preferable.
Direct comparison with existing cardiovascular risk prediction tools (e.g., QRisk2) using decision curve analysis is not possible. Therefore, to further explore clinical utility and potential implementation, we examined the relationship between predicted risk of hypotension, syncope, and fracture (at 1, 5, and 10 years) and 10-year cardiovascular risk using QRisk2 at 5% and 10% thresholds9. The overlap in patients identified as high-risk ( ≥ 5% and ≥10%) by the STRATIFY models versus QRisk2 was quantified and visualised using Venn diagrams.
Missing data
To address missing data in both the development and validation cohorts, we employed multiple imputation using chained equations, generating ten imputed datasets for each cohort. Separate imputation procedures were conducted independently for the development and validation datasets. The imputation models included all model covariates within each dataset, along with the Nelson-Aalen estimator for the cumulative baseline cause-specific hazards for hypotension, syncope or fracture and for the competing event of death, and binary event indicators for each of these possible event types48,49. For comorbidity diagnoses and prescribed medications, missingness was handled under the assumption that absence of data reflected absence of diagnosis or prescription. Variables imputed included ethnicity, body mass index (BMI) category, smoking status, alcohol consumption, and (in the validation cohort only) the deprivation score.
Imputations were assessed for consistency and validity by comparing density plots, histograms, and summary statistics across imputations and back to complete values. Following imputation, model coefficients and performance metrics were estimated separately within each imputed dataset and then combined using Rubin’s Rules50. In instances where Rubin’s Rules were inappropriate due to non-normal posterior distributions, summary measures across imputations were reported using the median and interquartile range (IQR)51. A sensitivity analysis using a full case approach was also employed to compare with the imputed models.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
This study is based on data obtained through an institutional license from the Clinical Practice Research Datalink (CPRD). Access to these data is restricted and permitted only to researchers with approved study protocols reviewed by CPRD’s Independent Scientific Advisory Committee (ISAC). Data access may involve a fee and is subject to specific governance and licensing conditions. Full details on how to request access, including contact information and procedures, are available at: (https://www.cprd.com/data-access).
Code availability
The developed algorithms are freely available for research use and can be downloaded from (https://process.innovation.ox.ac.uk/software/). Code lists used to define variables included in the dataset are available at (https://github.com/jamessheppard48/STRATIFY-BP (https://doi.org/10.5281/zenodo.15481343).
References
Lewington, S., Clarke, R., Qizilbash, N., Peto, R. & Collins, R. Age-specific relevance of usual blood pressure to vascular mortality: A meta-analysis of individual data for one million adults in 61 prospective studies. Lancet 360, 1903–1913 (2002).
Rahimi, K. et al. Age-stratified and blood-pressure-stratified effects of blood-pressure-lowering pharmacotherapy for the prevention of cardiovascular disease and death: an individual participant-level data meta-analysis. Lancet 398, 1053–1064 (2021).
Albasri, A. et al. Association between antihypertensive treatment and adverse events: systematic review and meta-analysis. BMJ 372, n189 (2021).
Sheppard, J. P. et al. The association between antihypertensive treatment and serious adverse events by age and frailty: a cohort study. PLoS Med 20, e1004223 (2023).
Sheppard, J. P., Benetos, A. & McManus, R. J. Antihypertensive deprescribing in older adults: a practical guide. Curr. Hypertens. Rep. 24, 571–580 (2022).
Sheppard, J. P. et al. Effect of antihypertensive medication reduction vs usual care on short-term blood pressure control in patients with hypertension aged 80 years and older: the OPTIMISE randomized clinical trial. JAMA 323, 2039–2051 (2020).
Archer, L. et al. Development and external validation of a risk prediction model for falls in patients with an indication for antihypertensive treatment: retrospective cohort study. BMJ 379, e070918 (2022).
Koshiaris, C. et al. Predicting the risk of acute kidney injury in primary care: derivation and validation of STRATIFY-AKI. Br. J. Gen. Pract. 73, e605 (2023).
Hippisley-Cox, J. et al. Predicting cardiovascular risk in England and Wales: prospective derivation and validation of QRISK2. BMJ 336, 1475–1482 (2008).
Overview | Hypertension in adults: diagnosis and management | Guidance | NICE. https://www.nice.org.uk/guidance/ng136 (2023).
Livingstone, S. et al. Effect of competing mortality risks on predictive performance of the QRISK3 cardiovascular risk prediction tool in older people and those with comorbidity: external validation population cohort study. Lancet Healthy Longev. 2, e352–e361 (2021).
Kanis, J. A. et al. Overview of fracture prediction tools. J. Clin. Densitom. 20, 444–450 (2017).
Kanis, J. A., Johnell, O., Oden, A., Johansson, H. & McCloskey, E. FRAX and the assessment of fracture probability in men and women from the UK. Osteoporos. Int 19, 385–397 (2008).
Nguyen, N. D., Frost, S. A., Center, J. R., Eisman, J. A. & Nguyen, T. V. Development of prognostic nomograms for individualizing 5-year and 10-year fracture risks. Osteoporos. Int 19, 1431–1444 (2008).
Hippisley-Cox, J. & Coupland, C. Derivation and validation of updated QFracture algorithm to predict risk of osteoporotic fracture in primary care in the United Kingdom: prospective open cohort study. BMJ 344, e3427 (2012).
Livingstone, S. J. et al. Effect of competing mortality risks on predictive performance of the QFracture risk prediction tool for major osteoporotic fracture and hip fracture: external validation cohort study in a UK primary care population. BMJ Med. 1, e000316 (2022).
Yoon, J. H. et al. Prediction of hypotension events with physiologic vital sign signatures in the intensive care unit. Crit. Care 24, 1–9 (2020).
Del Rosso, A. et al. Clinical predictors of cardiac syncope at initial evaluation in patients referred urgently to a general hospital: the EGSYS score. Heart 94, 1620–1626 (2008).
Quinn, J. V. et al. Derivation of the San Francisco Syncope Rule to Predict Patients with Short-Term Serious Outcomes. Ann. Emerg. Med 43, 224–232 (2004).
Reed, M. J. et al. The ROSE (risk stratification of syncope in the emergency department) study. J. Am. Coll. Cardiol. 55, 713–721 (2010).
Clark, C. E. et al. Detecting risk of postural hypotension (DROP): derivation and validation of a prediction score for primary care. BMJ Open 8, e020740 (2018).
Mancia, G. et al. 2023 ESH guidelines for the management of arterial hypertension the task force for the management of arterial hypertension of the European Society Of Hypertension: endorsed by the International Society of Hypertension (ISH) and the European Renal Association (ERA). J. Hypertens. 41, 1874–2071 (2023).
Butt, D. A. et al. The risk of hip fracture after initiating antihypertensive drugs in the elderly. Arch. Intern Med 172, 1739–1744 (2012).
Ruths, S. et al. Risk of hip fracture among older people using antihypertensive drugs: a nationwide cohort study. BMC Geriatr. 15, 153 (2015).
Riley, R. D. et al. External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges. BMJ 353, i3140 (2016).
Herrett, E. et al. Completeness and diagnostic validity of recording acute myocardial infarction events in primary care, hospital care, disease registry, and national mortality records: cohort study. BMJ 346, f2350 (2013).
Wang, A. et al. Developing prediction models for electrolyte abnormalities in patients indicated for antihypertensive therapy: evidence-based treatment and monitoring recommendations. J. Hypertens. https://doi.org/10.1097/HJH.0000000000004032 (2025).
Herrett, E. et al. Data resource profile: Clinical Practice Research Datalink (CPRD). Int J. Epidemiol. 44, 827–836 (2015).
Wolf, A. et al. Data resource profile: Clinical Practice Research Datalink (CPRD) Aurum. Int J. Epidemiol. 48, 1740–1740g (2019).
Whelton, P. K. et al. 2017 ACC/AHA/AAPA/ABC/ACPM/AGS/APhA/ ASH/ASPC/NMA/PCNA guideline for the prevention, detection, evaluation, and management of high blood pressure in adults a report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Hypertension 71, E13–E115 (2018).
Riley, R. D. et al. Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes. Stat. Med 38, 1276–1296 (2019).
Riley, R. D. et al. Minimum sample size calculations for external validation of a clinical prediction model with a time-to-event outcome. Stat. Med 41, 1280–1295 (2022).
Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. G. M. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD Statement. BMC Med 13, 1–10 (2015).
Fine, J. P. & Gray, R. J. A proportional hazards model for the subdistribution of a competing risk. J. Am. Stat. Assoc. 94, 496 (1999).
Feakins, B. G., McFadden, E. C., Farmer, A. J. & Stevens, R. J. Standard and competing risk analysis of the effect of albuminuria on cardiovascular and cancer mortality in patients with type 2 diabetes mellitus. Diagn. Progn. Res 2, 1–9 (2018).
Kawaguchi, E. S., Shen, J. I., Li, G. & Suchard, M. A. A fast and scalable implementation method for competing risks data with the R package fastcmprsk. R. J. 12, 1–18 (2019).
Royston, P., Ambler, G. & Sauerbrei, W. The use of fractional polynomials to model continuous risk variables in epidemiology. Int J. Epidemiol. 28, 964–974 (1999).
Schoenfeld, D. Partial residuals for the proportional hazards regression model. Biometrika 69, 239 (1982).
Aalen, O. O. & Johansen, S. An empirical transition matrix for non-homogeneous markov chains based on censored observations. Scand. J. Stat. 5, 141–150 (1978).
Graw, F., Gerds, T. A. & Schumacher, M. On pseudo-values for regression analysis in competing risks models. Lifetime Data Anal. 15, 241–255 (2009).
Andersen, P. K. & Pohar Perme, M. Pseudo-observations in survival analysis. Stat. Methods Med Res 19, 71–99 (2010).
Royston, P. Tools for checking calibration of a Cox model in external validation: Approach based on individual event probabilities. Stata J. 14, 738–755 (2014).
Austin, P. C. & Fine, J. P. Practical recommendations for reporting Fine-Gray model analyses for competing risk data. Stat. Med 36, 4391–4400 (2017).
Rahman, M. S., Ambler, G., Choodari-Oskooei, B. & Omar, R. Z. Review and evaluation of performance measures for survival prediction models in external validation settings. BMC Med Res Methodol. 17, 1–15 (2017).
Snell, K. I. E., Ensor, J., Debray, T. P. A., Moons, K. G. M. & Riley, R. D. Meta-analysis of prediction model performance across multiple studies: which scale helps ensure between-study normality for the C-statistic and calibration measures? Stat. Methods Med Res 27, 3505–3522 (2018).
Debray, T. P. A. et al. A framework for meta-analysis of prediction model studies with binary and time-to-event outcomes. Stat. Methods Med Res 28, 2768–2786 (2019).
Vickers, A. J., Van Calster, B. & Steyerberg, E. W. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ 352, i6 (2016).
White, I. R. & Royston, P. Imputing missing covariate values for the Cox model. Stat. Med 28, 1982 (2009).
Lau, B. & Lesko, C. Missingness in the setting of competing risks: from missing values to missing potential outcomes. Curr. Epidemiol. Rep. 5, 153–159 (2018).
Rubin, D. B. et al. Multiple Imputation for Nonresponse in Surveys. https://doi.org/10.1002/9780470316696 (1987).
Marshall, A., Altman, D. G., Holder, R. L. & Royston, P. Combining estimates of interest in prognostic modelling studies after multiple imputation: current practice and guidelines. BMC Med Res Methodol. 9, 1–8 (2009).
Acknowledgements
We thank Lucy Curtin for administrative support throughout the project, and Simon Griffin and Juliet A Usher-Smith (Department of Public Health and Primary Care, Primary Care Unit, University of Cambridge) for their contributions as STRATIFY Investigators to the project. We thank Lucy Curtin for administrative support throughout the project, and Simon Griffin and Juliet A Usher-Smith (Department of Public Health and Primary Care, Primary Care Unit, University of Cambridge) for their contributions as STRATIFY Investigators to the project. The Hospital Episode Statistics data used in this analysis are re-used with permission from NHS Digital who retain the copyright for those data. We thank the Office for National Statistics for providing data on mortality. The Office for National Statistics, and NHS Digital bear no responsibility for the analysis or interpretation of the data. Finally, we are very grateful to all those patients who permit their anonymized routine NHS data to be used for this approved research. For this research, JPS and CK were funded in whole, or in part, by the Wellcome Trust/Royal Society via a Sir Henry Dale Fellowship held by JPS (ref: 211182/Z/18/Z) and the National Institute for Health Research (NIHR) School for Primary Care (project 430). LA, KIES and RR are supported by funding from the NIHR Birmingham Biomedical Research Centre at the University Hospitals Birmingham NHS Foundation Trust and the University of Birmingham. CEC is part supported by a NIHR School for Primary Care grant (project 580). RJMcM is supported by an NIHR Senior Investigator award and by NIHR ARC Oxford Thames Valley. FDRH acknowledges part support from the NIHR ARC Oxford Thames Valley, and the NIHR Oxford OUH BRC. KIES is funded by an NIHR SPCR Launching Fellowship. AB has received research funding from Astra-Zeneca, NIHR, BMA Medical Research Foundation and UKRI. RAP receives funding from the NIHR. AC is part funded by NIHR ARC Yorkshire & Humber and Health Data Research UK, an initiative funded by UK Research and Innovation Councils, National Institute for Health Research and the UK devolved administrations, and leading medical research charities. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health and Social Care. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission. The sponsor and funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
Author information
Authors and Affiliations
Consortia
Contributions
J.P.S. conceived the project and wrote the protocol with F.D.R.H., R.J.M., R.J.S., and R.D.R. C.K., S.S. and A.W. extracted data for analysis. CK and AW developed the models under supervision of JS and RS; FDRH, CEC, RJM, AC, AB and RAP advised on model development. LA validated the model under supervision of RDR and KIES. JS wrote the first draft of the manuscript. MO contributed as the Patient and Public Involvement (PPI) representative. All authors revised the manuscript and approved the final version. JPS is the guarantor for this work and accepts full responsibility for the conduct of the study, had access to the data, and controlled the decision to publish. The principal investigator (JPS) attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.
Corresponding authors
Ethics declarations
Competing interests
All authors have completed the ICMJE uniform disclosure form at (www.icmje.org/coi_disclosure.pdf) and declare: authors had financial support from the Wellcome Trust, Royal Society, and National Institute for Health Research for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.
Peer review
Peer review information
Nature Communications thanks Sante D. Pierdomenico, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Koshiaris, C., Wang, A., Archer, L. et al. Predicting hypotension, syncope, and fracture risk in patients indicated for antihypertensive treatment: the STRATIFY models. Nat Commun 16, 9371 (2025). https://doi.org/10.1038/s41467-025-64408-9
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-025-64408-9



