Introduction

Allogeneic hematopoietic cell transplantation (HCT) is the preferred post-remission therapy for adult patients with acute lymphoblastic leukemia (ALL) in first complete remission (CR1) who have a reduced likelihood of long-term survival using conventional chemotherapy alone [1, 2]. Transplant outcomes in these patients are influenced by various pre-transplant factors, including patient and disease characteristics, as well as the effects of prior treatments on transplant-related mortality and relapse rates [3, 4]. These factors are crucial for optimizing transplant strategies. However, the impact of the number of induction cycles required to achieve CR on transplant outcomes in ALL remains unclear.

This study aimed to evaluate how the number of induction cycles to obtain CR1 influences outcomes in adult ALL patients undergoing allogeneic HCT. Using data from the European Society for Blood and Marrow Transplantation (EBMT) registry, we analyzed patients with ALL in CR1 who received HCT, focusing on the relationship between induction cycle count on post-transplant outcomes.

Patients and methods

Study design and data source

This is a retrospective, registry-based analysis conducted on behalf of the Acute Leukemia Working Party (ALWP) of the EBMT. The EBMT is a voluntary working group of more than 650 transplantation centers, all of which are required to report all consecutive stem cell transplantations and follow-up data annually. The EBMT registry maintains an internal quality control program to ensure data accuracy and consistency, with regular audits performed to address missing or incorrect data and to prompt follow-up. All transplantation centers are required to obtain written informed consent before data submission to the EBMT, in accordance with the 1975 Declaration of Helsinki. The ALWP of the EBMT approved this study.

Inclusion criteria

The study included all adult patients (aged ≥18 years) with a diagnosis of ALL who underwent their first allogeneic HCT from any donor between January 2000 and December 2022, as reported to the EBMT registry. For this analysis, we focused on transplants performed in CR1 after one, two, and three or more induction therapies. To maintain a more homogeneous study population undergoing standard allogeneic HCT, we excluded patients who had received cord blood transplantation or ex vivo T-cell-depleted grafts.

Endpoints and definitions

The primary endpoint was leukemia-free survival (LFS) after one, two, and three or more induction therapies. Secondary endpoints were acute GVHD (aGVHD) and chronic GVHD (cGVHD), relapse incidence, non-relapse mortality (NRM), GVHD-free, relapse-free survival (GRFS), and overall survival (OS) within the same subgroups and the analysis of risk factors for each outcome.

Neutrophil recovery was defined as the first day of an absolute neutrophil count of 0.5 ×109/L lasting for ≥3 consecutive days. aGVHD and cGVHD were defined and graded according to standard criteria [5, 6]. Relapse was defined as disease recurrence and appearance of blasts in the peripheral blood or bone marrow (>5%), or by clinical and/or radiologic confirmation of leukemic involvement at extramedullary sites after achieving CR. LFS was calculated until the date of first relapse, death from any cause, or the last follow-up. NRM was defined as death from any cause other than relapse. The composite endpoint GRFS was defined as survival without the following events: stage III–IV aGVHD, severe cGVHD, disease relapse, or death from any cause after HCT [7]. Myeloablative conditioning (MAC) was defined as a regimen containing either total body irradiation (TBI) with a dose >6 Gy, a total dose of oral busulfan >8 mg/kg, or a total dose of intravenous busulfan >6.4 mg/kg [8]. All other regimen intensities were defined as reported by the centers.

Statistical analysis

For univariable survival analysis, the Kaplan-Meier method were used to calculate OS, LFS, and GRFS. Cumulative incidence functions were used to estimate relapse incidence, NRM, aGVHD and cGvHD. Competing risks were death for relapse incidence and relapse for NRM, and relapse or death for aGvHD and cGvHD. Survival probabilities are given at 2 years as percentages and 95% confidence intervals (CIs).

For multivariable analysis, a Cox proportional hazards model was performed including the following variables for adjustment: number of induction cycles to CR1, Karnofsky performance status score, age of the patient at transplant (per 10 years), year of transplantation (per 5 years), reduced intensity conditioning (RIC) or MAC regimen, female donor to male patient combinations, cell source (bone marrow or peripheral blood) and type of ALL. To take into account the heterogeneity in the effect of a characteristic or a treatment across centers, we introduced a random effect into the Cox multivariate models. Hazard ratios (HR) were calculated together with corresponding 95% confidence intervals (95% CI).

The significance level was fixed at 0.05, and P-values were two-sided. P-values for secondary endpoints should be cautiously interpreted due to multiple comparisons. Statistical analyses and adjusted survival curves were performed using the R statistical software version 4.2.3 (R Foundation for Statistical Computing, Austria, Vienna; available online at http://www.R-project.org).

Results

Patient and transplant characteristics

Patient, disease, and transplant characteristics for the overall study population, as well as according to the number of induction cycles, are summarized in Table 1. A total of 2444 patients were included in the study, with 2038 transplanted after one induction cycle, 296 after two cycles, and 110 after three or more cycles. Among the latter group, 15 patients received four induction cycles and 12 patients received five. The median patient age was 40 years (range 18–73), and 58% were male. ALL was of B-cell origin in 1931 (79%) patients and 948 (39%) were Philadelphia chromosome (Ph) positive, 672 (27%) Ph negative and the Ph data were missing for 311 patients. Matched sibling donors were the most common donor type (n = 1035, 47%), followed by matched unrelated donors (n = 730, 33%) and haploidentical donors (n = 172, 8%). Peripheral blood was the stem cell source in 1734 (71%) patients. Regarding conditioning regimens, this was TBI-based in 1806 (74%) and 2007 (82%) received MAC. Anti-thymocyte globulin serotherapy for in vivo T-cell depletion was administered to 1089 (45%) patients. Most patients (n = 1460, 60%) received GVHD prophylaxis with a calcineurin inhibitor plus methotrexate.

Table 1 Patient and transplant characteristics in the whole cohort and stratified by induction cycles.

Patients who achieved CR1 after one induction cycle had a higher proportion of Ph positive B-ALL (p < 0.001), while those requiring three or more cycles were more often transplanted in recent years (p < 0.001) and more frequently received peripheral blood grafts (p = 0.032). No significant differences were observed in patient characteristics, donor type, conditioning regimen, or GHVD prophylaxis between the three cohorts.

Engraftment

The cumulative incidence of neutrophil recovery at 30 days was 96% (95% CI 95–96) for the single-induction cohort, 95% (95% CI 92–97) for the two-induction cohort, and 95% (95% CI 88–97) for the cohort receiving three or more cycles (Table 2). The 60–day cumulative incidence of platelet recovery in similar order was 94% (95% CI, 93–95), 93% (89–95), and 93% (95% CI, 86–97) (Table 2).

Table 2 Univariable analysis of transplant outcomes according to number of induction cycles.

GVHD

The cumulative incidence of aGVHD grades II–IV and III-IV at 180 days was 36% (95% CI; 34–39) and 13% (95% CI; 11–14) for the single-induction cohort, 31% (95% CI; 26–36) and 10% (95% CI; 7–14) for the two-induction cohort, and 36% (95% CI; 27–45) and 12% (95% CI; 6–19) for the cohort receiving three or more cycles (Table 2). In the multivariable analysis (Table 3), more recent transplants were associated with a reduced risk of aGVHD grades II–IV (HR 0.9; 95% CI, 0.83–0.97; p = 0.008) and grades III-IV (HR 0.87; 95% CI, 0.76–0.99; p = 0.04). The use of MAC-TBI was associated with an increased risk of aGVHD grades II–IV (HR 1.53; 95% CI, 1.18–1.99; p = 0.001) compared to RIC.

Table 3 Multivariate analysis of transplant outcomes.

The 2-year cumulative incidence of cGVHD was 41% (95% CI; 38–43) for the single-induction cohort, 40% (95% CI; 34–46) for the two-induction cohort, and 42% (95% CI; 31–52) for the cohort receiving three or more cycles (Table 2). The cumulative incidence of extensive cGVHD was 19% (95% CI; 17–21) for the single-induction cohort, 17% (95% CI; 12–22) for the two-induction cohort, and 20% (95% CI; 12–29) for the three or more cycles cohort (Table 2). In the multivariable analysis (Table 3), factors associated with a higher risk of cGVHD included female donor to male recipient transplants (HR 1.35; 95% CI, 1.12–1.63; p = 0.001), increasing patient age per decade (HR 1.11; 95% CI, 1.04–1.19; p = 0.002), and the use of MAC-TBI (HR 1.31; 95% CI, 1.03–1.67; p = 0.026). The use of peripheral blood as the graft source was associated with an increased risk of both cGVHD (HR 1.36; 95% CI, 1.11–1.66; p = 0.003) and extensive cGVHD (HR 1.65; 95% CI, 1.21–2.25; p = 0.002). Conversely, earlier year of transplantation (per 5-year period) was associated with a lower risk of cGVHD (HR 0.78; 95% CI, 0.72–0.85; p < 0.001) and extensive cGVHD (HR 0.87; 95% CI, 0.78–0.98; p = 0.02).

Relapse

The median time to relapse was 7.1 months (interquartile range [IQR] 3.8–13). The cumulative incidence of relapse at 2 years was 23% (95% CI; 21–25) for patients who achieved CR1 after one induction cycle, 31% (95% CI; 26–37) for those after two cycles, and 32% (95% CI; 23–42) for those after three or more cycles (Table 2) (Fig. 1). In multivariable analysis (Table 3), compared to achieving CR1 after one induction cycle, both two cycles (HR 1.45, 95% CI, 1.11–1.91; p = 0.007) and three or more cycles (HR 1.64, 95% CI, 1.06–2.52; p = 0.025) were associated with a higher risk of relapse. Other factors associated with a higher risk of relapse included earlier year of transplantation per 5 years (HR 0.84, 95% CI, 0.76–0.92; p < 0.001) and use of RIC rather than MAC-TBI (HR 0.48, 95% CI, 0.37–0.63; p < 0.001).

Fig. 1: Cumulative incidence of relapse by number of induction cycles.
figure 1

In yellow patients who achieved CR1 after one induction cycle, in blue for those after two cycles, and in pink for those after three or more cycles.

NRM and causes of death

The 2-year cumulative incidence of NRM was 17% (95% CI; 15–19) in the single-induction cohort, 18% (95% CI; 14–23) in the two-induction cohort, and 16% (95% CI; 9–24) in the cohort receiving three or more cycles (Table 2) (Fig. 2). In multivariable analysis (Table 3), factors associated with increased NRM included older patient age (per 10-year increase, HR 1.31; 95% CI 1.18–1.46; p < 0.001), earlier year of transplantation (per 5-year period, HR 0.69, 95% CI, 0.61–0.78; p < 0.001), use of MAC-TBI (HR 1.62, 95% CI, 1.13–2.34; p = 0.009), and use of MAC-chemotherapy (HR 1.73, 95% CI, 1.11–2.69; p = 0.015).

Fig. 2: Cumulative incidence of non-relapse mortality by number of induction cycles.
figure 2

In yellow patients who achieved CR1 after one induction cycle, in blue for those after two cycles, and in pink for those after three or more cycles.

At the last follow-up, 965 (39.5%) patients had died, of whom 502 (52%) were due to a variety of non-relapse causes, distributed as follows: 415 (53%) in the single-induction cohort, 68 (47%) in the two-induction cohort, and 19 (43%) in the cohort receiving three or more cycles. The main causes of transplant-related deaths were infections and GVHD, accounting for 290 (70%) in the single-induction cohort, 46 (68%) in the two-induction cohort, and 15 (79%) in the cohort receiving three or more cycles, results not shown.

Survival

For the entire cohort, the 2-year LFS, OS, and GRFS were 58% (95% CI 56–60), 67% (95% CI 65–69), and 40% (95% CI 39–43), respectively.

In the single-induction cohort, the 2-year LFS was 60% (95% CI; 57–62), compared to 51% (95% CI; 45–57) in the two-induction cohort and 52% (95% CI; 41–62) in patients receiving three or more cycles (Table 2) (Fig. 3). Multivariate analysis (Table 3) identified that patients undergoing two cycles (HR 1.35, 95% CI,1.09–1.68, p = 0.007) or three or more cycles (HR 1.67, 95% CI,1.18–2.35, p = 0.004) had a higher risk of worse LFS compared to those receiving a single cycle. Additional risk factors associated with a lower LFS included increasing patient age per decade (HR, 1.16; 95% CI, 1.09–1.24; p < 0.001), earlier year of transplantation (per 5-year period, HR 0.77, 95% CI, 0.72–0.84; p < 0.001), and the use of RIC rather than MAC-TBI (HR 0.77, 95% CI, 0.62–0.95; p = 0.015). When comparing patients receiving two cycles versus three or more cycles, earlier year of transplantation (per 5-year period HR 0.81, 95% CI, 0.69–0.95; p = 0.011) and the use of RIC instead of MAC-TBI (HR 0.52, 95% CI, 0.27–0.1; p = 0.05) were also associated with reduced LFS.

Fig. 3: Probability of leukemia-free survival by number of induction cycles.
figure 3

In yellow patients who achieved CR1 after one induction cycle, in blue for those after two cycles, and in pink for those after three or more cycles.

For OS, the 2-year rate was 68% (95% CI; 66–70) in the single-induction cohort, 61% (95% CI; 54–66) in the two-induction cohort, and 60% (95% CI; 49–70) for patients receiving three or more cycles (Table 2) (Fig. 4). In the multivariate analysis (Table 3), receiving two cycles (HR 1.34, 95% CI,1.06–1.7, p = 0.016) or three or more cycles (HR 1.86, 95% CI,1.28–2.68, p = 0.001) was associated with lower OS compared to those who received one cycle. Other factors associated with a worse OS were increasing patient age per decade (HR, 1.24; 95% CI, 1.15–1.33; p < 0.001), earlier year of transplantation (per 5-year period HR 0.72, 95% CI, 0.66–0.79; p < 0.001), and T-cell lineage ALL compared to Ph positive B-ALL (HR 0.76, 95% CI, 0.6–0.96; p = 0.019).

Fig. 4: Probability of overall survival by number of induction cycles.
figure 4

In yellow patients who achieved CR1 after one induction cycle, in blue for those after two cycles, and in pink for those after three or more cycles.

Regarding GRFS, the 2-year rates were 41% (95% CI; 39–44) for the single-induction cohort, 36% (95% CI; 30–42) for the two-induction cohort, and 36% (95% CI; 26–46) in patients who received three or more cycles (Table 2). In the multivariable analysis (Table 3), increasing patient age per decade (HR 1.08; 95% CI, 1.02–1.14; p = 0.008), and earlier year of transplantation (per 5-year period, HR 0.87; 95% CI, 0.82–0.93; p < 0.001) were associated with a lower GRFS.

Discussion

This study reveals that adult ALL patients undergoing allogeneic HCT in CR1 after a single induction cycle experience significantly lower relapse rates and improved LFS and OS compared to those requiring two or more cycles. Notably, NRM remained consistent across groups, and outcomes did not differ meaningfully between patients needing two versus three or more cycles to achieve CR1. These findings underscore the number of induction cycles as a key prognostic factor for HCT outcomes. Clinically, this insight could refine risk stratification, enhance transplant candidate selection, and guide tailored post-transplant management strategies for ALL patients.

As a retrospective, registry-based study, some potential biases cannot be ruled out. To reduce one major source of bias, we restricted the cohort to patients in CR1. However, patients who required more induction cycles inherently had to survive long enough to reach CR1 and proceed to transplant, introducing the risk of immortal time bias. This may lead to an overestimation of benefit in early responders or an underestimation of risk in late responders. Furthermore, differences in conditioning regimens, ALL subtypes, and graft sources—such as a higher frequency of peripheral blood stem cell use in patients receiving three or more induction cycles—as well as the use of anti-thymocyte globulin and variations in GVHD prophylaxis, were not fully balanced between cohorts. These factors introduce heterogeneity that may affect outcomes independently of induction response. Although the multivariable analysis adjusted for several key covariates, important data such as extended genetic profiles, detailed pretransplant treatment history, minimal residual disease status at transplant, use of tyrosine kinase inhibitors in Ph-positive ALL, and donor lymphocyte infusion were not available in the registry. Despite these limitations, the study’s design and large sample size allowed for a robust assessment of the association between the number of induction cycles and post-transplant outcomes in adult with ALL.

Research on the impact of induction cycle number on HCT outcomes has primarily focused on acute myeloid leukemia (AML) [9, 10], with limited data specific to ALL. Our observation of higher relapse rate driving reduced survival in patients requiring two or more cycles aligns with AML studies [9, 10] and likely reflects underlying disease resistance. Patients needing multiple induction cycles to reach CR may harbor leukemias with high-risk genetic features or aggressive biology that increase chemotherapy resistance. Notably, relapse risk did not increase progressively beyond two cycles, as outcomes were comparable between the two-cycle and three or more-cycle groups. This plateau effect deserves further ALL-specific investigation to clarify its implications and guide treatment strategies.

Beyond the influence of induction cycle number, our study also confirmed a marked improvement in nearly all outcomes over time, consistent with prior EBMT findings in this setting [11]. This trend likely reflects advances in transplant practices and supportive care. We also confirmed established prognostic factors, including recipient age, conditioning intensity, and ALL subtype, with T-cell ALL showing poorer OS than Ph+ B-ALL. Together, these observations position induction cycle number as a valuable predictor within a broader landscape of evolving HCT outcomes.

In conclusion, this study highlights the number of induction cycles required to achieve CR1 as a significant determinant of HCT outcomes in adult ALL patients, with a single cycle linked to reduced relapse and improved survival. While these findings offer a practical tool for risk stratification and clinical decision-making, the retrospective design and unmeasured variables, such as MRD, underscore the need for cautious interpretation. Consistent with prior EBMT observations, our results affirm the evolving success of allogeneic HCT and position induction cycle number as an accessible marker within this landscape. Prospective studies integrating genetic and residual disease data are essential to validate these insights and optimize treatment strategies for ALL patients undergoing transplantation.