Introduction

Polycythemia vera (PV) is a rare hematologic cancer belonging to the family of chronic myeloproliferative neoplasms (MPN) and driven by mutations in the Janus kinase 2 (JAK2) gene, most frequently the JAK2V617F. The presence of an activating JAK2 mutation drives the aberrant proliferation of hematopoietic cells [1, 2] leading to erythrocytosis, thrombocytosis, leukocytosis and increased cytokine production resulting in systemic symptoms typical of PV [3, 4].

Patients with PV face a substantial risk of thrombotic complications, with a 2.7- to 13.1-fold higher likelihood of developing arterial and venous thrombosis, respectively, compared to age- and sex-matched controls [5]. Therefore, the clinical management of PV is focused on reducing the risk of such events, which remain the leading cause of morbidity and mortality.

Advanced age and a history of previous thrombotic events are two well-established clinical risk factors for predicting future thromboembolic episodes [6,7,8,9,10]. Based on these two risk factors, patients with PV are currently classified as low-risk (LR) (absence of both risk factors) and high-risk (HR) (history of thrombosis and/or age ≥60 years) [4, 11]. However, a growing body of literature suggests that advanced age and prior thrombosis history may rather reflect a limited view of the overall risk profile and are sometimes perceived as suboptimal for proper management of PV. The fact that various biomarkers are currently investigated for risk assessment of PV (including but not limited to allelic ratio [12, 13], red blood cell distribution width (RDW)/neutrophil-lymphocyte ratio [14, 15], and others) emphasizes the importance of improved risk stratification.

In light of this evidence, the European LeukemiaNet (ELN) has recently indicated the use of cytoreductive therapy also in LR patients presenting specific clinical signs and symptoms (CSSs), namely, intolerance to phlebotomy, uncontrolled hematocrit, symptomatic progressive splenomegaly, persistent and/or progressive leukocytosis, extreme thrombocytosis, relevant cardiovascular risk factors [16]. However, the prognostic role of these factors remains so far elusive.

Hydroxyurea (HU) is currently the most widely used first-line cytoreductive therapy for PV patients [17,18,19], offering effective control of the disease and playing a key role in thrombosis prevention with generally acceptable tolerance [20, 21].

In a large multicenter cooperative study, we assessed the impact of ELN clinical signs and symptoms on the thrombotic risk of PV patients homogeneously treated with first-line HU.

Materials and methods

Patients’ collection and study design

This is an observational, retrospective cohort study (PV-ARC, NCT06134102) that was promoted by the IRCCS Azienda Ospedaliero-Universitaria S. Orsola-Malpighi, Bologna, Italy.

The study now involves 1162 patients with PV diagnosed between January 1985 and December 2023. Among them, 739 patients were treated with first-line HU and had data on CSSs available at the start of HU.

Following approval of each IRB, participating centers collectively submitted diagnostic and follow-up information. The totality of medical files from each center was reported via data input into an electronic database that was developed to label all study data with an alphanumeric code after the de-identification of patients to protect personal privacy.

The data collected included patient demographics, medications, clinical/laboratory tests at diagnosis and during follow-up, type of PV therapy and thromboses. All information about concomitant diseases, body mass index (BMI), cardiovascular risk factors (CVRFs), thrombosis history, and drug usage was recorded in each case history and, thereafter, used for retrospective evaluation. Any treatment decision, including the use of phlebotomies, HU and antiplatelet/anticoagulant drugs, was at the physician’s discretion and independent from participation in this study.

After the first data entry, follow-up information was validated with revision of the clinical data, and specific queries were addressed to the participating centers in cases of inconsistent data. All patients were followed until death or the data cut-off date (March 2024).

Statistical analysis

Statistical analysis was carried out at the biostatistics laboratory of the MPN Unit, IRCCS Azienda Ospedaliero-Universitaria S. Orsola-Malpighi, Bologna.

Continuous variables are expressed as mean ± standard deviation (SD), whereas categorical variables are presented as frequencies and percentages. Comparisons of quantitative variables between two groups of patients were carried out using the Wilcoxon–Mann–Whitney rank-sum test and among three groups using the Kruskal–Wallis equality-of-populations rank test. Associations between categorical variables were tested using the χ2 test.

The mean thrombotic risk scores associated with each CVRF were analyzed by a one-way analysis of variance (ANOVA).

A Poisson regression model was applied to calculate the incidence rate ratios (IRR) of thrombotic events. The IRR was described as the number of events per 100 patient-years (%p-y), considering the time between HU start and the first thrombotic event or last contact.

Variables associated with thrombotic events were identified using univariate and multivariate (MVA) Cox proportional hazards model. Hazard ratio (HR) has been reported with 95% confidence intervals (CI).

Thrombosis-free survival (TFS) was assessed from HU start to the first thrombosis or last contact, using Kaplan–Meier analysis, adjusted for delayed entry, considering the time between PV diagnosis and the start of HU. Differences were evaluated with the log-rank test.

Tests were two-sided and p values < 0.05 were considered significant. Analyses were performed using STATA/SE software version 18.0 (StataCorp).

Definitions

PV was diagnosed according to the WHO 2022 classification [22]. In patients diagnosed before 2022, bone marrow biopsies were internally reviewed to adhere the data to current criteria [22]. In patients without a bone marrow histology (n. 49, 6.6%), PV diagnosis was based on the presence of elevated hemoglobin levels (i.e., >18.5 and 16.5 g/dL in males and females, respectively), the JAK2 Exon 14 or Exon 12 mutations, and low serum erythropoietin. Conventional criteria were used for the diagnosis of post Polycythemia Vera Myelofibrosis (PPV-MF) [23]; leukemic transformation was defined by blast cells being at least 20% in peripheral blood and/or bone marrow [22].

According to conventional criteria, LR patients were defined by age <60 years and no previous thrombotic history. HR patients were defined by age ≥60 years and/or a history of thromboembolism [24]. In this analysis, HR patients were further stratified in two groups: HR-AGE (patients at high risk only due to age ≥60 years) and HR-THRO (patients at high risk due to previous thrombosis, regardless of age).

Thromboses were defined according to the International Classification of Diseases (9th revision) [25] and graded according to the Common Terminology Criteria for Adverse Events (CTCAE) v5.0 [26]. Only grade ≥2 thromboses occurring during HU therapy were recorded and analyzed in the present study.

At HU start, presence and type of CSSs were initially searched as per the ELN definition [16]. However, three criteria were adapted based on real-life experience. In particular: a platelet (PLT) count >1500 × 109/L was never observed in our cohort, so the threshold has been lowered to a PLT count >1000 × 109/L; for the same reason, the need for at least 6 phlebotomies (PHLs) for at least 2 years has been modified to at least 6 PHLs and/or a persistently higher hematocrit (HCT) of 50% for at least 1 year before HU start. This threshold was selected according to the results of the Cyto-PV trial, where patients randomized to maintain HCT 45–50% had poorer outcomes [27].

In absence of a unified definition of “relevant” CVRF [16, 28], the ANOVA test was used to identify the combination of CVRFs most strongly correlated with the risk of thrombosis. Five predefined CVRFs, namely active smoking, hypertension, overweight (BMI ≥ 25), diabetes, and dyslipidemia, were included in the analysis [29].

In summary, CSSs were defined as follows: (1) persistent/progressive leukocytosis: 100% increase if white blood cells (WBC) < 10 × 109/L or 50% increase if WBC ≥ 10, or WBC > 15 × 109/L before HU start; (2) extreme thrombocytosis: PLT > 1000 × 109/L at HU start; (3) progressive splenomegaly: increase by more than 5 cm below costal margin (BCM) in the year before HU start; (4) inadequate HCT control: ≥6 PHLs/year or HCT ≥ 50% at two evaluations within 1 year before HU start or PHL intolerance; (5) “relevant” CVRFs: co-occurrence of active smoking, hypertension and overweight (with or without the presence of diabetes and/or dyslipidemia); (6) severe itching.

In patients at low thrombotic risk who did not receive cytoreduction during follow-up, the presence of CSSs was assessed at least 2 years after diagnosis to ensure the most accurate assessment of the criteria for inadequate hematocrit control.

The presence and severity of itching at the start of HU were assessed during clinical visits. In recent years, the 10-item Myeloproliferative Neoplasm Symptom Assessment Form Total Symptom Score (MPN10-TSS) [30] has been used and “severe” itching was considered with a score ≥5 according to ELN definition. Alternatively, the itching score was considered ≥5 if refractory to specific medication (e.g., antihistamines).

Additional reasons to start hydroxyurea were defined as follows: spleen palpable at 2–5 cm below costal margin, mild leukocytosis or thrombocytosis (WBC > 11 × 109/L or PLT > 450 × 109/L but not meeting the definition of persistent/progressive or extreme), presence of microvascular disturbances despite appropriate phlebotomies and antiplatelet drug therapy (including vascular headaches, dizziness, visual disturbances, burning pain sensation in the palms of the hands and soles of the feet, distal paresthesia and acrocyanosis) [31], ≥3 PHLs in the 12 months before HU start.

Results

Patient cohort

Overall, 739 patients received treatment with first-line HU and were included in this analysis. At therapy start, 137 (18.5%) patients were categorized as LR and 602 (81.5%) as HR. Among them, 424 (70.4%) were HR-AGE and 178 (29.6%) were HR-THRO, with 46 (25.8%) patients only due to previous thrombosis and 132 (74.2%) due to both older age and previous thrombosis (Supplementary Fig. 1).

At least one CVRF was present in 577 (78.1%) patients; specifically, 276 (37.3%), 181 (24.5%), 82 (11.1%) and 38 (5.1%) patients had one, two, three or ≥4 CVRFs. The frequency of individual CVRFs and their inter-relationships are shown in Table 1 and in Fig. 1a.

Fig. 1: The frequency and the pair-wise co-occurrence of cardiovascular risk factors and Clinical sign and symptoms represented by Circos diagrams.
figure 1

Co-occurring CVRFs (a) and CSSs (b) are indicated in the clockwise direction. In the Circos representation, the length of the arc corresponds to the frequency of the CVRF/CSS (color coded) and the width of the ribbon corresponds to the frequency of patients who also had a second CVRF/CSS. The frequency of each individual CVRF/CSS in this cohort is shown on the right side. Specifically, b shows the prevalence proportion of individual CSS.

Table 1 Distribution of clinical signs and symptoms and clinical characteristics at hydroxyurea start.

The ANOVA scores for correlation with thrombosis were: 1.21, active smoking (IRR: 2.26%p-y); 1.15, hypertension (IRR: 1.83%p-y); 0.58, overweight (IRR: 1.93%p-y); 0.14, diabetes (IRR: 1.78%p-y) and 0.03, dyslipidemia (IRR: 1.79%p-y). In accordance with the findings of this test, patients with concomitant active smoking, hypertension and overweight were included in the “relevant CVRFs” category (ANOVA score: 4.73; p = 0.03). Overall, “relevant” CVRFs were present in 41 (5.5%) patients; in 15 of these patients, diabetes and/or dyslipidemia were also present.

At HU start, CSSs were present in 443 (60.0%) patients, more frequently in LR (n. 95, 69.3%) compared to HR patients (n. 348, 57.8%, p = 0.02), mainly due to higher presence of persistent/progressive leukocytosis (p = 0.006), progressive splenomegaly (p = 0.004) and severe itching (p = 0.002) (Table 1). Patients not presenting CSSs (296, 40.0%) were more frequently female (44.2% vs. 36.6%, p = 0.04), whereas age at HU start was comparable to CSSs cohort (66.7 ± 11.1 vs. 65.6 ± 11.6 years, p = 0.1).

The frequency of CSSs was comparable between HR-AGE (n. 239, 56.4%) and HR-THRO (n. 109, 61.2%) patients (p = 0.27). This was also observed in HR-THRO patients due to both older age and previous thrombosis (62.1%) and HR-THRO patients due to previous thrombosis only (58.7%, p = 0.69).

Overall, more than one CSS was present in 155 (out of 443, 35.0%) patients, mainly in low risk (43.2% vs. 32.8% HR, p = 0.05). The frequency of individual CSS and their inter-relationships are shown in Fig. 1b.

At HU start, median leukocytes and platelet counts, and mean spleen length BCM were significantly higher in LR compared to HR patients, with no significant differences between HR-AGE and HR-THRO patients.

The mean time from PV diagnosis to HU initiation in the CSS low-risk cohort was 3 years, with 22.1% of patients initiating HU more than 5 years after diagnosis. Conversely, all HR patients started HU soon after entering the high-risk category (mean: 2.3 months).

Median HU starting dose was comparable across the three risk categories, while LR patients more frequently received a maximum dose ≥1 g (Table 1). Anticoagulants were used more frequently in HR patients (12.3% vs. 4.4% LR, p = 0.007). Conversely antiplatelets, excluding patients receiving anticoagulants, were used comparably across risk categories (LR: 95.4% vs. HR: 98.8%, p = 0.21).

Additional reasons for starting HU were present in 92.9% of patients without CSSs. Distribution of additional reasons according to conventional risk categories is shown in Supplementary Table 1.

Thrombosis during hydroxyurea therapy

The median time on HU was 6.2 years (SD ± 4.3). During HU therapy, 72 patients had a thrombosis, for a total of 92 thrombotic events. Thromboses were equally distributed between arterial (n. 45, 48.9%) and venous (n. 47, 51.1%). Supplementary Table 2 details type and site of thrombotic events.

Globally, the IRR of thrombosis during HU was 2.02%p-y and significantly higher (3.0%p-y) in HR-THRO patients (Table 2). Among HR-THRO patients, patients older than 60 years had the highest IRR of thrombosis (3.5%p-y).

Table 2 Incidence ratio rate (IRR) per 100 patient-years of thromboses during treatment with hydroxyurea.

Thrombotic events in low-risk patients with CSSs and not receiving HU

To explore whether providing HU therapy in LR patients with CSSs may reduce the thrombotic risk, we evaluated an additional cohort of 44 LR patients who did not receive cytoreduction despite the presence of CSSs. Characteristics and thrombotic outcomes of these patients are reported in Supplementary Table 3.

Overall, in these 44 patients, 6 thrombotic events occurred, resulting in an IRR of thrombosis of 2.1%p-y. This figure was numerically higher than 1.6%p-y observed in low-risk patients with CSSs who received HU (Table 2).

Impact of clinical signs and symptoms on the incidence of thrombosis during hydroxyurea therapy

The IRR of thrombosis was estimated according to the presence of CSSs, without considering additional reasons for starting HU.

In the total cohort, CSSs were associated with a significantly increased IRR of thrombosis (2.2 vs. 0.7%p-y in patients without CSSs, p < 0.001). This was confirmed across all the risk categories (Table 2).

TFS at 5 years was 88.6% in patients with CSSs compared to 96.2% in those without CSSs (p < 0.001) (Fig. 2a). Examining the three risk categories separately, the TFS curve of patients with CSSs was always significantly worse than that of patients without CSSs (Fig. 2b–d). Across all risk categories, the best TFS at 5 years was observed in LR and HR-AGE (LR with no CSSs, 100%; HR-AGE with no CSSs: 98.1%). LR and HR-AGE patients with CSSs had comparable TFS than HR-THRO patients without CSSs (89.2%, 92.1% and 88.8%, respectively). HR-THRO patients with CSSs had the most dismal TFS (80.2%) (Supplementary Fig. 2).

Fig. 2: Thrombosis-free survival comparison between patients presenting at least one Clinical Sign and Symptom (CSS) (dashed lines) and patients not presenting CSSs (continuous lines).
figure 2

a Overall cohort; b low risk (LR); c high risk only for age (HR-AGE); d high risk for previous thrombosis (HR-THRO). HU hydroxyurea.

Prognostic impact of each individual clinical sign and symptom on the thrombotic risk

In univariate Cox analysis, history of thrombosis (p < 0.001), age ≥60 years (p = 0.04), extreme thrombocytosis (p = 0.01), progressive splenomegaly (p = 0.05), inadequate hematocrit control (p = 0.01), relevant CVRFs (p = 0.01) were significantly associated with a higher thrombotic risk.

In MVA, four factors maintained significant association with the thrombotic risk: inadequate hematocrit control (HR: 2.32, 95% CI: 1.45–3.72, p < 0.001); relevant CVRFs (HR: 2.87, 95% CI: 1.36–6.06, p = 0.006); progressive splenomegaly (HR: 4.02, 95% CI: 1.18–13.65, p = 0.03) and history of thrombosis (HR: 3.76, 95% CI: 2.32–6.10, p < 0.001) (Fig. 3).

Fig. 3: Multivariate analysis of clinical signs and symptoms to evaluate the impact on thrombotic risk in the overall patient cohort.
figure 3

CVRFs cardiovascular risk factors, HU hydroxyurea, HR hazard ratio, CI confidence interval. Red dots identify variable statistically significantly associated with a higher probability of thrombosis during hydroxyurea in both multivariate and univariate analysis; Yellow dots identify variable statistically significantly associated with a higher probability of thrombosis during hydroxyurea only in univariate analysis.

Discussion

HU is the primary cytoreductive agent used in the treatment of PV and plays a crucial role in the overall anti-thrombotic strategy. Its use significantly reduces the incidence of thrombotic events; however, a notable number of events still occur, with an incidence rate of 3.3% p-y, which increases with age [32]. The effectiveness of HU is greater in preventing initial or recurrent arterial events, but it has a higher failure rate in preventing venous events [31, 33, 34]. Therefore, it is essential to identify patient groups undergoing cytoreductive therapy who remain at an increased risk for thrombosis.

In our cohort, the overall incidence of thrombosis during HU treatment is lower than what was estimated in the aforementioned meta-analysis [32], likely due to the younger median age of our patients, which includes LR patients. However, the incidence of recurrent thrombosis in the HR-THRO group (3% p-y) is comparable to the rate reported in the meta-analysis (4.5% p-y), indicating that the patients we studied reflect real-world scenarios.

To ensure homogeneous therapeutic exposure and reduce variability in thrombotic risk factors, only patients treated with HU as first-line therapy were included. Indeed, interferon therapy is mainly used in younger patients and has been made available over the years in different formulations, each potentially providing different degrees of thrombotic protection [35,36,37,38]. As for ruxolitinib, in Italy this agent is only used as second-line therapy. Therefore, ruxolitinib-treated patients represent a different cohort, characterized by more advanced disease, in whom HU had been initiated earlier and had already failed [39].

It has been more than 2 years since the publication of the ELN recommendations on the relevance of specific clinical signs and symptoms to consider for earlier therapy in conventionally defined low-risk PV patients [16]. However, the strength of these recommendations remains weak due to a lack of specific data on the prognostic impact of such CSSs on the thrombotic risk.

Our results show that CSSs are common and significantly influence thrombotic risk, isolating a cohort of patients at further increased risk within each risk category. Additionally, three CSSs were more significantly associated with thrombosis: inadequate hematocrit control, relevant cardiovascular risk factors and progressive splenomegaly.

Elevated hematocrit is one of the major risk factors for thrombosis in patients with PV [3, 11]. In the CYTO-PV study, patients with hematocrit below 45% had a lower incidence of deaths from cardiovascular causes or major thrombotic events (1.1%p-y) than those with a target hematocrit of 45–50% (4.4%p-y) [40]. A higher risk of thrombosis has also been observed in the subset of HU-treated patients requiring ≥3 phlebotomies per year [41].

There is a significant association between major thromboembolic events and hypertension, diabetes, hyperlipidemia, obesity and smoking [6, 8,9,10, 13, 42,43,44,45]. Nevertheless, a consensus on which cardiovascular risk factors and/or combinations thereof should be designated as “relevant” in patients with PV remains elusive. As observed in this study, patients frequently exhibit multiple concomitant cardiovascular risk factors. These CVRFs often co-occur with other clinical signs and symptoms that correlate with the thrombotic risk, such as inadequate hematocrit control (Fig. 1b). Furthermore, the impact of individual CVRFs can sometimes be masked by the thrombotic risk associated with PV, complicating the identification of a single CVRF that is associated with a heightened thrombotic risk. We ultimately adopted a composite definition, requiring the concurrent presence of at least three CVRFs, namely smoke, hypertension, and overweight. This definition was derived from statistical associations with thrombotic risk and relative incidence ratio rates of thrombosis associated with each factor. From a biological perspective, this composite definition is more likely to reflect the synergistic effects of CVRFs. The potential for cumulative risk actions associated with the concurrent presence of two risk factors cannot be disregarded. However, this definition aligns with previous guidelines and may serve as a practical clinical guidance tool [16, 46]. Overall, these findings recommend that CVRFs in all PV patients should be adequately controlled and reinforce the necessity of a multidisciplinary approach [47].

We also observed a correlation between progressive splenomegaly and an increased risk of thrombosis. This correlation has been reported by previous groups [9] but not confirmed in other cohorts [48]. The assessment of spleen size by palpation may be inaccurate and/or not adequately recorded in medical charts. However, the ELN criteria focus exclusively on patients with persistent and progressive splenomegaly, whose clinical burden is unlikely to be overlooked, and recommend physical examination of the spleen, to enable its application in daily clinical practice. However, some patients with progressive but minimally symptomatic splenomegaly may have been excluded from this analysis. Overall, further investigation into the dynamic evaluation of the spleen may provide valuable insights for the management of PV.

Here, we could not confirm the association between thrombocytosis/leukocytosis and thrombosis. The association between the number of platelets and PV outcomes is controversial, with some studies [6, 49], but not others [31], reporting an association between higher platelet count and thrombosis. While in the ECLAP and in the CYTO-PV studies an association between leukocytosis and vascular events was noted [6, 49], hyperleukocytosis at diagnosis or during follow-up was not found to correlate with thrombotic risk in other studies [50, 51]. Neutrophil-to-lymphocyte ratio (NLR) values ≥ 5 was independently associated with an increased risk of venous thrombosis, which suggests that NLR, rather than leukocyte count alone, may be a valuable prognostic biomarker for thrombosis in PV and should be further investigated [15, 52].

The JAK2V617F variant allele frequency is also a significant prognostic factor in PV. Nonetheless, the 2024 and 2025 National Comprehensive Cancer Network (NCCN) guidelines, as well as the ELN recommendations, do not yet include molecular evaluation as a criterion for initiating cytoreductive therapy [16, 53, 54]. In this context, the results of this study remain valid and valuable. However, the collection of molecular data would provide valuable additional insights and should be included in future research agendas of cooperative groups.

Our study confirmed the pivotal role of thrombotic history [6, 8,9,10, 42, 55]. The loss of statistical significance of older age in multivariable analysis should not imply that age is not an independent risk factor for thrombosis. Indeed, the IRRs of thrombosis demonstrated a linear correlation between thrombotic risk and age. It is notable that the IRR for low-risk patients (aged under 60 with no previous history of thrombosis) is 1.1%p-y, while the IRR for high-risk patients aged 60 or over (but with no previous thrombosis) is higher at 1.3%p-y. This incidence is comparable to that of patients at high risk for a previous thrombotic history (but aged under 60 years), who have an IRR of 1.9%p-y. The simultaneous presence of both risk factors identifies the highest risk category, with an IRR of 3.5%p-y. The results are comparable to those of the ECLAP study, which found that the IRRs of thrombosis were similar in older age groups without thrombosis (4.9%p-y) and in younger age groups with thrombosis (5.0%p-y) [55].

Finally, the IRR of thrombosis in low-risk patients with CSSs but who did not receive HU showed that lack of cytoreduction was associated with a numerically higher rate of thrombosis. This finding may further support the relevance of CSSs in clinical decision making but should be prospectively confirmed. Indeed, in the present study the decision to start HU was at the discretion of the treating hematologist and occurred at different follow-up times. As a result, the exposure to CSSs was not uniform across patients’ cohorts, potentially confounding their thrombotic risk.

Intrinsic limitations of the present study are due to its retrospective nature and include uncontrolled patient adherence to HU therapy, consistency in the assessment of CSSs, and reporting of thrombotic events, which may vary and affect the generalizability of the results.

In conclusion, CSSs may identify an increased risk phenotype across all thrombotic risk categories and may be actionable markers that can inform more aggressive and personalized management strategies.