Introduction

Early-interim positron emission tomography/computerized tomography (PET/CT) scan, performed after the first two cycles of chemotherapy (PET-2), is nowadays widely accepted as a prognostic tool in advanced stage (AS) classical Hodgkin lymphoma (cHL), thus confirming the role of an early chemosensitivity assessment as surrogate of the final treatment outcome. Several prospective trials have confirmed that a response–adapted approach, using PET-2 to select patients who might benefit from intensifying or de-escalating therapy, is safe and results in improved progression-free survival (PFS) or reduced toxicity compared to results obtained in historical controls with six cycles of ABVD (doxorubicin hydrochloride [Adriamycin], bleomycin sulfate, vinblastine sulfate, dacarbazine) or six to eight cycles of escalated BEACOPP (bleomycin, etoposide, doxorubicin hydrochloride [Adriamycin], cyclophosphamide, vincristine [Oncovin], procarbazine, prednisone) (eBEACOPP) [1,2,3,4,5,6,7,8,9,10,11,12].

PET positivity after the first two cycles of ABVD (PET-2+), defined as a score of 4 or 5 on the five-point Deauville scale (DS) [13, 14], is highly predictive for refractory outcomes when continuing with ABVD [15]. The early treatment intensification with eBEACOPP, as tested in the RATHL, SWOG S0816, and GITIL/FIL HD0607 trials, proved able to overcome the ABVD chemoresistance in PET-2 positive patients, resulting in PFS of 60% to 67.5% at 2 or 3 years [5, 6, 8].

Patients with PET-2 DS 5 represented no more than 3%–6% of the entire patient population enrolled in these trials and, due to small numbers, to date no clinical characteristic nor biological marker predictive of response has been identified.

To define the overall outcome and identify possible predictive markers in this patient subset, we performed a pooled analysis of individual patient data of the three multicenter studies, RATHL, SWOG S0816, and GITIL/FIL HD0607, focusing on patients with PET-2 DS 5 who had intensified treatment with eBEACOPP or BEACOPP-14 after two ABVD courses. The primary aim of the study was to define both PFS following treatment intensification and the overall life expectancy of this cohort of high-risk patients. Moreover, we retrospectively evaluated new and old predictive/prognostic factors recorded at baseline, in order to identify possible clinical and laboratory characteristics able to early predict response to salvage treatment and the overall long-term outcome in this small patient subset.

Patients and methods

Patients

Individual patient data from the databases of three multicenter trials, GITIL/FIL HD0607 (n = 782), RATHL (n = 1203), and SWOG S0816 trial (n = 336), evaluating PET-adapted treatment in AS cHL, were analysed. All PET-2 DS 5 patients were included in this study: 58 patients from RATHL, 29 patients from SWOG and 49 patients from HD0607 trials. Patient demographics along with the classic prognostic factors at baseline (i.e., date of birth, sex, date of diagnosis, stage, IPS, bulky disease, number of involved nodal and extranodal sites, leukocyte, lymphocyte, neutrophil, monocyte counts, hemoglobin values) were collected. Among new prognostic factors total metabolic tumor volume (TMTV) and Total lesion glycolysis (TLG) were assessed. Furthermore, date of diagnosis, date of ABVD start, date of PET-2- DS 5, date of treatment intensification (eBEACOPP(BEACOPP-14) start, date and results of PET-3 according to DS, date of therapy change (if any), information on subsequent therapies, date of ASCT(s) and allo-SCT(if performed), date of CMR achievement, based on the Lugano 2014 criteria [16], date of progression or relapse, date and cause of death, date and type of second malignancy (if any), date of last follow-up and status at last FU were collected.

Treatment

In the three studies, patients received 2 cycles of ABVD followed by early-interim PET scan (PET-2). In the RATHL trial, PET-2+ patients received either three cycles of eBEACOPP or four cycles of BEACOPP-14, chosen in advance by each treatment center. Thereafter, patients were reassessed with a third PET-CT scan (PET-3) and patients with a negative PET-3 received one more cycle of eBEACOPP or two further cycles of BEACOPP-14. Patients with a positive PET-3 underwent further salvage treatment in accordance with local protocols [5].

In SWOG S0816, PET-2+ patients received 6 cycles of eBEACOPP [7].

In GITIL/FIL HD0607, 4-eBEACOPP plus 4-BEACOPP baseline with or without Rituximab were administered to PET-2+ patients [10].

All patients provided written, informed consent before enrolment in each study. Data use agreements were made between the RATHL Network, the SWOG Cancer Research Network, the HD0607 Network and investigators at the European Institute of Oncology; the study was approved by the Ethics Committees of the participating centers and was done in accordance with the Declaration of Helsinki and the International Conference on Harmonization Guidelines for Good Clinical Practice.

PET imaging and calculation of TMTV

PET/CT scans at diagnosis and after two ABVD cycles were acquired on accredited scanners in all the three trials as previously described [5, 7, 10]. All PET/CT scans were centrally reviewed during these trials. The scans of the RATHL trial were preliminary reviewed because the cutoff value used to distinguish a score 4 from a score 5 was different in the original RATHL publication compared to SWOG S0816 and HD0607: in the former the cutoff value was set to 300% of the standardized maximum liver uptake value (SUVMax) while in the two latter studies the threshold was set to 200% of the liver SUVMax. For the purpose of the present study DS 5 was assigned to lesions having uptake at least twofold higher than the maximum liver uptake and/or the appearance of new lesions compatible with lymphoma. For the RATHL and HD0607 trials the data were analysed also according to the definition of DS 5 scored more than threefold the maximum liver uptake and/or the appearance of new lesions. For this project, three nuclear medicine physician(s) (SB, SH, LG) additionally reviewed the scans, blinded to patient history, clinical data, and treatment outcome, identifying the location of the most active lesion in PET-2 (reference lesion) that was visually scored according to DS as described elsewhere [13].

TMTV was segmented on PET/CT images using a fixed standardized uptake value of 4.0 using the recently published benchmark method [17]. First, automated segmentation was performed using the software, then, if necessary, manual addition of lesions and editing to subtract physiological uptake was undertaken.

Only focal extranodal and splenic lesions were included in the volume of interest, whereas diffuse increase in 18F-FDG uptake in the spleen and/or bone marrow was excluded. The TMTV was obtained by summing the metabolic volumes of all nodal and extranodal lesions. TLG was calculated as the product of TMTV and average SUV in the contoured lesions.

Study objectives

The primary objectives were to evaluate PFS and OS in AS cHL patients with a PET-2 DS 5 in a pooled database analysis of all three multicenter studies, comparing patients who progressed (non-responders) after intensified treatment with eBEACOPP or BEACOPP-14, with those who did not progress or relapse (responders). Secondary objectives were to retrospectively evaluate potential predictors of treatment outcome, including patient characteristics, histology, disease extent, international prognostic score, laboratory parameters, PET TMTV and TLG, at diagnosis, comparing non-responders to responders.

Statistics

Descriptive statistics were used. Differences in categorical parameters were evaluated using chi-squared of Fisher’s exact test, as appropriate. Differences in continuous parameters were evaluated using Mann–Whitney U test. Receiver operating characteristic curve (ROC) analyses were performed to evaluate the prognostic value of TMTV and TLG. The analysis of differences in TMTV and TLG among the three trials were evaluated by Wilcoxon pairwise comparisons.

PFS was measured from the date of PET-2 until the date of disease progression, disease relapse after CR, or lack of CR at the end of per protocol therapy, or death from any cause. If none of these events occurred, patients were censored at the date of information on HL status at last follow-up.

Duration of CR (DCR) was measured from the date of CR or CMR achievement to the date of disease progression, relapse, or death from any cause. If none of these events occurred, DCR was calculated as the date of information on HL-free status at last follow-up.

OS was measured from the date of PET-2 until the date of death of any cause. Patients alive were censored based on the date of last information on survival status.

Survival outcomes were analyzed using the Kaplan-Meier method and log-rank test. The association between factors with PFS or OS events was evaluated using univariate Cox proportional hazards regression models. Statistical significance was set at a two-tailored P value of 0.05. All analyses were performed with R software, version 4.3.0.

Results

From 2321 patients recruited in the three trials with available baseline and PET-2 scans, 136 patients (6%) with PET-2 DS 5 were included in this study.

Patient demographics and characteristics, laboratory parameters, TMTV, and TLG values at diagnosis are detailed in Table 1.

Table 1 Characteristics of PET-2 DS 5 cohort at diagnosis: Data of a pooled analysis from three multicenter trials (RATHL, SWOG S0816, and GITIL/FIL HD0607).

Since the SWOG trial only recruited patients with stage III or IV, whereas RATHL and HD0607 trials included also stage II with adverse features, the following differences in patient characteristics emerged among the trials: stage III and IPS > 3 were significantly more frequent by study design, median hemoglobin level was lower and TMTV and TLG values were higher in the patients enrolled on the SWOG S0816 trial compared to RATHL and HD0607 subgroups. B symptoms were more frequently reported in patients enrolled on the HD0607 trial.

Overall, 56 out of 136 PET-2 DS 5 patients (41.2%) achieved a metabolic complete remission (CMR) with a negative PET at the end of treatment intensification (EOT-PET) with eBEACOPP or BEACOPP-14, 32 patients had partial remission (23.5%), 5 patients (3.7%) had stable disease, 31 (22.8%) progressed. One toxic death, due to septic shock following Klebsiella pneumoniae, occurred. Data of response to intensified treatment were not available in 7 patients (5.1%) and 4 patients (2.9%) were withdrawn after PET-2 and did not receive intensified treatment due to patient or principal investigator decisions.

As shown in Fig. 1A, B, after a median follow-up of 41 months (range, 1–36), the 3-year PFS and OS were 32% (95%CI, 25–42) and 82% (95%CI, 75–89), respectively. The 3-year PFS and OS analysed in 106 patients of the RATHL and HD0607 trials according to the definition of DS 5 scored >300% the maximum liver uptake, were 31% and 79%, respectively, without significant differences compared to DS 5 scored 200–300% of the liver, as shown in Supplementary Fig. 1A, B. Most patients (96%) who achieved a CR and did not relapse within 2 years, had a prolonged PFS (Supplementary Fig. 2). The 3-year DCR for intensified treatment was 70% (95% CI, 58–84) (Fig. 2). Patients who achieved CR with intensified treatment (n = 56) had a longer 3-year OS compared to those not in CR (n = 69) (91% vs 74%, P = 0.0073) (Fig. 3). No factor at baseline was associated with the probability of achieving CR after intensified treatment as shown in Supplementary Tables 1 and 2.

Fig. 1: 3-year Survival from the date of PET-2 positive DS 5.
figure 1

A 3-year Progression-free Survival B 3-year Overall Survival.

Fig. 2
figure 2

Duration of complete remission after intensified treatment with eBEACOPP or BEACOPP-14.

Fig. 3
figure 3

3-year overall survival related to achievement of CMR (dotted line) or not CMR (continuous line) after intensified treatment with eBEACOPP or BEACOPP-14.

Patients progressing after BEACOPP-like intensification regimens were not homogeneously treated and missing data for most patients prevented a detail description of salvage treatments received.

Overall, in 136 PET-2 DS 5 patients, seven second malignancies occurred, after a median of 34 months (range, 11–61) from the date of PET-2: one medullary thyroid carcinoma, one kidney carcinoma, one melanoma, one nasopharyngeal carcinoma, one non-small cell lung cancer, one anaplastic large cell lymphoma (ALCL) and one acute lymphoblastic leukemia. All second cancers were reported in patients who achieved CR after intensified treatment; six patients were alive after a median of 28 months (range, 0–85) from the diagnosis of second cancer, while one patient died 10 months after the diagnosis of ALCL.

As detailed in Table 2, in univariate analysis low lymphocyte count (<600/μL) was significantly associated with poor PFS (<600/μL vs ≥ 600/μL: HR 3.19; P = 0.0007) as well as absence of marked leukocytosis (<15 × 103/μL vs ≥ 15 × 103/μL: HR 0.63; P = 0.0385). On the other hand, age ≥ 45 years, leukocyte cells count <15 × 103/μL, and low monocyte (<0.5 × 103/μL) counts significantly shortened OS. As shown in Table 3, in multivariable analysis low lymphocyte count was an independent negative prognostic factor for PFS, whereas age ≥ 45 years and leukocyte cells count <15 × 103/μL adversely affected OS.

Table 2 Progression-Free Survival and Overall Survival, calculated from the date of PET-2, by patient characteristics at diagnosis relapse or progression and by response to salvage therapy: Univariable analysis.
Table 3 Multivariable analysis of factors predicting PET-2 DS 5 patient outcome.

Overall, the median TMTV4 at baseline was 243 cm3 (range, 1.6–4266) and baseline median TLG 4 was 1476 cm3 (range, 7.3–29386) for the whole patient population. However, a significant difference in these values emerged among the three trials, as shown in Supplementary Table 3, with higher TMTV and TLG values in patients included in the SWOG trial, who had more advanced disease (stage III or IV). The ROC curve analysis of the prognostic performance of TMTV 4 and TLG for 3-year PFS showed an AUC of 0.41 and 0.40, respectively (Supplementary Fig. 3).

Discussion

In this retrospective study, we report survival outcomes for 136 out of 2231 (6%) patients with AS cHL enrolled in three multicenter studies, RATHL, SWOG S0816, and GITIL/FIL HD0607 GITIL/FIL, who had a positive PET DS 5 after two ABVD cycles. The results of the pooled analysis confirm the unsatisfactory response of these patients to intensified treatment with eBEACOPP or BEACOPP-14, with a 3-year PFS of 32% (95% CI, 25–42). Nevertheless, the 3-year OS was 82% (95% CI, 75–89), due to effective further salvage treatments.

In univariate analysis, low lymphocyte count (<600/μL) and leukocyte cells count <15 × 103/μL measured at diagnosis were able to detect refractory outcomes to intensification therapy whereas age ≥ 45 years and leukocyte cells count <15 × 103/μL were significantly associated with short OS. The unfavorable prognostic value of leukocyte cells count <15 × 103/μL is an unexpected finding. However, it is confirmed for OS at the multivariable analysis and seems a peculiar feature for this cHL subset.

PET-2 DS 5 patients behave quite similarly to patients with primary refractory disease, defined as progression or non-response during induction treatment or within 90 days of completing treatment. Indeed, PET-2 DS 5 patients have a high likelihood of a PFS event, and they frequently require several subsequent therapies, including autologous stem cell transplantation or even allogeneic transplantation. The long-term iatrogenic effects in survivors, in particular therapy-related myelodysplastic syndromes or acute leukemias and solid second malignancies are well documented [18].

Early interim PET demonstrated prognostic relevance in all trials that have tested PET-adapted treatment strategies in AS HL [5,6,7,8,9,10,11,12]. The assessment of TMTV measured on baseline PET scans has been shown to be an independent prognostic factor in a post-hoc analysis of the EORTC H10 trial in early-stage HL: 5-year PFS was 71% in the high-TMTV (>147 cm3) group vs 92% in the low-TMTV group (≤147 cm3) segmented using a threshold of 41% of the maximum SUV [19]. Furthermore, in a retrospective study on 115 patients with stage IIB, III, and IV HL treated with ABVD, TMTV was predictive of overall 5-year PFS (p = 0.0079) with an HR of 2.29 but only in the subgroup of patients who received ABVD and not in the group who received eBEACOPP [20]. Among 65 transplant-eligible patients with relapsed/refractory HL, baseline MTV and refractory disease predicted outcome, with 3-year PFS of 100% in patients with low MTV (<109.5 cm3) and relapsed disease [21]. However, the results of our study show that baseline PET parameters of TMTV and TLG did not provide additional predictive value in patients with a positive PET DS 5 after two cycles of ABVD chemotherapy who have inferior outcomes even after intensified conventional chemotherapy with BEACOPP-based regimens, and who still represent an unmet clinical need.

New biomarkers at diagnosis are warranted to better prognosticate outcome and personalize treatment for this cohort of patients. The evaluation of the difference between baseline and PET-2 TMTV/TLG, or the combination of TMTV assessment with other sensitive biomarkers, such as circulating tumor DNA [22], so called liquid biopsy, serum thymus and activation-regulated chemokine levels [23], Epstein-Barr viral whole blood or plasma load [24], small micro-RNA (miRs) contained in extracellular vesicles [25], should be evaluated to improve the accuracy of identification of those low-risk PET-2 DS 5 patients for whom escalating therapy with eBEACOPP is safe.

We are aware that our study has some limitations, the most relevant of which was the inhability to evaluate gene expression profiling of biopsies at diagnosis of PET-2 DS 5 cohort, as well as the lack of PET-2 TMTV and TLG evaluation, which both could refine the ability to be predict response to intensification to BEACOPP-based schedules.

To our knowledge, this is the largest cohort of PET-2 DS 5 patients reported so far. The results of our study, evaluating data from three large randomized international trials, may contribute to increase our insight on characteristics, risk factors and outcome of the cohort of patients with an interim PET scored DS 5 after 2 ABVD cycles.

Patients progressing after BEACOPP-like treatment intensification underwent one or more lines of multiple salvage therapies, ranging from intensified chemotherapy with or without autograft, allogeneic hematopoietic transplantation, brentuximab vedotin, anti-PD1 monoclonal antibodies. The high heterogeneity along with missing data for most patients prevent to analyze in detail all salvage treatments delivered. However, we can speculate that the remarkable high OS rate may be related to the use of new drugs in the salvage setting that became available in recent years.

The landscape of first-line and salvage treatment for AS and relapsed or refractory cHL has changed with the availability of effective new drugs, such as anti-CD30 antibody-drug conjugate brentuximab vedotin (BV) and immune checkpoint inhibitors (ICIs), nivolumab, and pembrolizumab. In the Echelon-1 trial, treatment of stage III-IV cHL with the combination of doxorubicin, vinblastine, dacarbazine, and brentuximab vedotin (A-AVD) yielded superior results compared to ABVD in terms of 6-year modified PFS (82.3% vs 74.5%) and OS (93.9% vs 89.4%) [26]. Although the majority (89%) of patients treated with A-AVD had a negative PET-2, patients with a positive PET-2 had a 6-year PFS of 61%. PET-2 positive patients in this trial were not escalated to eBEACOPP, although those with a PET-2 DS 5 could switch to alternative therapy at the treating physician’s discretion.

The BrECADD regimen, developed by the GHSG with the addition of BV, to the backbone of doxorubicin, etoposide, and cyclophosphamide, omitting vincristine/bleomycin, replacing procarbazine with dacarbazine, and substituting prednisone with a short course of dexamethasone, has shown improved efficacy and tolerability when compared to eBEACOPP, as reported in 1482 AS cHL patients aged 18–60 years [11]. At 4 years, OS was superimposable for the two treatment groups (98.6% vs 98.2%), with higher PFS 94.3% vs 90.9% (HR 0.66, 95% CI, 0.45–0.97; p = 0.035), even in PET-2 positive patients (90.3% vs 87.8%). Furthermore, treatment-related morbidity was significantly lower and gonadal function recovery was more common with BrECADD compared to eBEACOPP.

The SWOG S1826 trial randomly comparing AVD (doxorubicin, vinblastine, dacarbazine) in combination with Nivolumab (N-AVD) with A-AVD in patients with stage III or IV cHL, aged 12 years or older, showed superior 2-year PFS with N-AVD (92% vs 83%; HR 0.45; 95%CI, 030–0.65; P < 0.001), with superimposable OS (99% vs 98%) and numerically fewer deaths (7 vs 14), albeit with a short median follow-up of 2.1 years months but this study was not PET-adapted [27]. Lynch et al. documented in a small single-arm study of concurrent pembrolizumab with AVD (doxorubicin, vinblastine, and dacarbazine; APVD) a 2-year PFS of 97% and OS of 100% in 30 untreated cHL patients, despite a negative PET-2 rate of only 57%, probably due to an unspecific FDG uptake due to a restored immunity of the patients [28]. Other studies evaluating PET responses after single agent checkpoint blockade in patients with newly diagnosed Hodgkin lymphoma failed to demonstrate the utility of PET as a robust biomarker in this situation [29, 30].

Transient immunologic flares with increase in FDG-uptake, which can resemble disease progressions, have been reported after treatment with ICIs and new response criteria, for which the so-called Lymphoma Response to Immunomodulatory Therapy Criteria were developed [31].

The recently updated NCCN guidelines still recommend, in patients with AS cHL for whom BV or ICIs are not available or contraindicated, a PET-adapted strategy with ABVD as the preferred initial treatment with the shift to BrECADD in PET-2+ patients [32]. Given the low PFS rates reported with chemotherapy only approaches, such as eBEACOPP or BEACOPP-14, the shift to BrECADD or the addition of ICIs to conventional chemotherapy, if not introduced in the front-line treatment, should be a future focus for patients with Hodgkin lymphoma and DS5 on PET-2.

Future studies should incorporate novel methodologies like this for review of imaging studies in Hodgkin lymphoma, as well as explore novel parameters on PET imaging, including metabolic tumor volume and disease dissemination, rather than only Deauville scores [33,34,35].

The efficacy of PD-1 blockade, if not introduced in front-line treatment, should be explored in the early salvage treatment of PET-2 DS 5 patients. Given the low PFS rates reported with intensified chemotherapy approaches, such as eBEACOPP or BEACOPP-14, and the lack of clear-cut baseline prognostic factors that add to the predictive value of interim PET, the addition of ICIs to conventional chemotherapy should be a future focus for patients with Hodgkin lymphoma and DS5 on PET-2.

Conclusion

cHL patients with PET-2 DS 5 after ABVD have similar outcomes to patients with refractory disease, with only 32% PFS at 3 years. Low lymphocyte count measured at diagnosis is the only parameter able to detect the later failures within the DS 5 cohort in three large phase III randomized controlled trials. Nevertheless, patients with DS-5 PET-2 can benefit from a prolonged OS, albeit at the cost of multiple subsequent treatments, including autologous and eventually allogeneic transplant. Based on the excellent results in relapsed or refractory cHL with new agents, brentuximab vedotin and PD-1 inhibitors-containing salvage regimens, the early introduction when DS 5 at PET-2 is documented, should be evaluated to improve the cure rate while sparing additional chemotherapeutic treatments, known to have low therapeutic efficacy along with long-term toxicity.