The genomic landscape of relapsed infant and childhood KMT2A-rearranged acute leukemia

Ahlgren, Louise; Pilheden, Mattias; Sturesson, Helena; Song, Guangchun; Walsh, Michael P.; Yang, Minjun; Maillard, Maud; Zhao, Huanbin; Cheng, Zhongshan; Singh, Varsha; Castor, Anders; Pronk, Cornelis Jan; Marquart, Hanne Vibeke; Lausen, Birgitte; Schneider, Pauline; Barbany, Gisela; Pokrovskaja Tamm, Katja; Abrahamsson, Jonas; Lohi, Olli; Fogelstrand, Linda; Menendez, Pablo; Pieters, Rob; Zhang, Jinghui; Lindkvist-Petersson, Karin; Yang, Jun J.; Gruber, Tanja A.; Stam, Ronald W.; Ma, Jing; Hagström-Andersson, Anna K.

doi:10.1038/s41467-025-64190-8

Download PDF

Article
Open access
Published: 08 October 2025

The genomic landscape of relapsed infant and childhood KMT2A-rearranged acute leukemia

Nature Communications volume 16, Article number: 8964 (2025) Cite this article

6523 Accesses
2 Citations
16 Altmetric
Metrics details

Subjects

Abstract

To study the mechanisms of relapse in KMT2A-rearranged (KMT2A-r) acute lymphoblastic (ALL) and acute myeloid leukemia (AML), we performed whole-genome and exome sequencing of infants and children with relapsed ALL/AML (n = 36), and longitudinal deep-sequencing of 257 samples in 30 patients. Somatic alterations in drug-response genes, most commonly in TP53 and IKZF1 (64%), were highly enriched in early relapse ALL (79%, 9-36 months after diagnosis), but rare in very early relapse ALL (<9 months, 9%). A marked chemotherapy-exposure signature was detected for mutations in early relapse ALL but not in very early ALL or AML relapse, in line with different mechanisms of relapse. Longitudinal analyses could track residual leukemia cells, clonal drug responses, and the upcoming relapse. These results highlight that KMT2A-r ALL and AML evade therapy differently and provide insights into the mechanisms of relapse in this highly lethal form of pediatric acute leukemia.

Single-cell transcriptomics reveals a distinct developmental state of KMT2A-rearranged infant B-cell acute lymphoblastic leukemia

Article Open access 14 March 2022

Proteasome inhibition targets the KMT2A transcriptional complex in acute lymphoblastic leukemia

Article Open access 13 February 2023

Clonal origin of KMT2A wild-type lineage-switch leukemia following CAR-T cell and blinatumomab therapy

Article Open access 20 July 2023

Introduction

Acute leukemia is the most common pediatric cancer with a 5-year overall survival rate above 90% for acute lymphoblastic leukemia (ALL)¹ and almost 80% for acute myeloid leukemia (AML)². However, ALL in infants, i.e., children aged 0-12 months, with KMT2A-rearrangements (KMT2A-r), still have a dismal prognosis with a 21-45% 6-year event-free survival (EFS) rate compared to 74% EFS in non-KMT2A-r ALL³. This poor prognosis extends to children above 1 year of age with KMT2A-r ALL (69% EFS)⁴. The underlying pathology behind these lower cure rates is not well understood. However, new therapies including Blinatumomab⁵ and chimeric antigen CAR T-cell (CAR-T)⁶ therapy will hopefully improve future survival.

KMT2A-r infant ALL comprises 4% of ALL in children and whole genome sequencing (WGS) has shown few additional mutations, but that activating kinase-PI3K/RAS-mutations occur in 50% of cases^7,8. Over 90% of infants reach clinical remission (CR)³, but are prone to rapid relapse, with 90% of relapses occurring within 2 years from diagnosis and 66% within one year^3,5, and our understanding of the mechanisms driving relapse remains limited. Mutations in CREBBP, FPGS, IKZF1, MSH2/6, NR3C1/2, NT5C2, PRPS1/2, TP53, and WHSC1 are enriched at relapse across genetic subtypes in childhood ALL (17% in very early, <9 months after diagnosis; 65% early relapse, 9-36 months; 32% late relapse ALL, >36 months)^9,10,11. In relapsed childhood AML, mutations in FLT3, WT1, and UPTF are enriched at relapse^12,13.

To gain insights into the mechanisms of relapse and clonal dynamics during treatment, we studied a cohort of 36 relapsed KMT2A-r infant and childhood ALL/AML cases with WGS and whole exome sequencing (WES), providing the largest cohort of pediatric KMT2A-r acute leukemia reported to date. In addition, we performed targeted deep-sequencing of 257 samples from 30 cases, including 14 cases from the relapse cohort, during the disease course. Mutational signature analysis suggested that chemotherapy was the primary cause of mutations in early relapse ALL but not in very early relapse ALL or AML. 79% of early relapse ALL gained alterations in drug-response genes at relapse, most commonly in IKZF1 and TP53 (64%). By contrast, such alterations were rare in very early relapse ALL (9%), in line with inherent resistance, emphasizing the different mechanisms of relapse depending on relapse time. In AML, the same mutational processes were active at diagnosis and relapse, suggesting that the cells that gave rise to relapse were dormant during therapy and evaded therapy differently than the ALL cells. Combined, our data highlight the different mechanisms of relapse in KMT2A-r ALL and AML. Finally, longitudinal sequencing uncovered unique clonal responses, providing the foundation for future therapeutic interventions.

Results

Genomic landscape of relapsed KMT2A-r ALL

To gain insights into mechanisms of relapse, we performed WGS and WES on 36 cases of relapsed KMT2A-r ALL (n = 25, age range 1 day–1.3 years, average 164 days) or AML (n = 11, age range 121 days-17.7 years, average 4.7 years) including 26 infants (24 ALL, 2 AML) and 10 children (1 ALL, 9 AML) (average coverage: WGS 43X, WES 140X, Fig. 1a, b, Supplementary Fig. 1a, b, and Supplementary Data 1, 2). The mutational burden of single nucleotide variants (SNVs), insertion and deletions (indels), structural variants (SVs), and copy number alterations (CNAs), increased from diagnosis to relapse (Supplementary Fig. 1c, d, and Supplementary Data 3-6). KMT2A-r ALL and AML had a similar mutational burden at diagnosis, while at relapse, ALL displayed a higher frequency of mutations (p = 0.031) (Supplementary Fig. 1d). Three patients were hypermutated at relapse and therefore excluded from statistical comparisons (P11, P17, P137). There was no correlation between the number of mutations and age at diagnosis, or to the number of gained mutations at relapse and relapse time, specific KMT2A‑r or age (Supplementary Fig. 1e–h).

**Fig. 1: The genetic landscape at relapse in *KMT2A*-r ALL.**

In ALL, 20 mutated genes in five pathways were identified at relapse, that primarily affected cases that relapsed 9 months after diagnosis (early relapse, based on Li et al.¹¹) with cases that relapsed before 9 months or had refractory disease (very early), having a paucity of such alterations: B-cell maturation (very early 27%, early 64%), cell cycle (very early 27%, early 50%), glucocorticoid receptor signaling (very early 0%, early 21%), purine metabolism (very early 0%, early 21%), and kinase/PI3K/RAS-signaling (very early 18%, early 50%). Importantly, 79% of early relapse ALL (11/14) harbored IKZF1 (n = 8), TP53 (n = 6), NT5C2 (n = 2), CREBBP (n = 1), PRPS2 (n = 1), WHSC1 (n = 1) or NR3C1 (n = 1) alterations at relapse, while such alterations were rare in very early relapse ALL (IKZF1 n = 1, 9%, 1/11) (Fig. 1c–f, Supplementary Fig. 1i, 2a–d, and Supplementary Data 7). TP53 and IKZF1 were the most frequently altered genes (64%, 9/14 early relapse) and co-occurred in 56% suggesting genetic cooperativity, with a trend towards a lower overall survival if either one was altered (p = 0.091) (Supplementary Figs. 2e, 3, 4 and Supplementary Data 8). TP53 and IKZF1 alterations were mainly detected in cases with the KMT2A::AFF1-fusion (80%). All TP53-alterations were gained at relapse, and most had multiple such alterations (1-3) with three cases having a complete loss of TP53 (Supplementary Data 7). Similarly, only 2/9 IKZF1-alterations could be detected in the paired diagnostic sample with one identified by PCR only (P58), and one case lacking a paired diagnostic sample (Supplementary Fig. 5a). In addition, alterations in genes implicated in purine metabolism (NT5C2^R39Q/R367Q, PRPS2^P320L) and glucocorticoid signaling (CREBBP^G119fs, WHSC1^E1099K, focal deletion of NR3C1), co-occurred with IKZF1 or TP53 in 4/6 cases (Fig. 1d). The PRPS2^P320L has not been described before and when analyzing the Alpha Fold model, it was in close proximity of R302 (Supplementary Fig. 5b). The R302 residue affects the stability of hexamer formation when mutated and thereby also PRPS2 activity¹⁴. However, knockout of wild-type PRPS2 alongside with overexpression of PRPS2^P320L revealed that PRPS2^P320L had minimal effects on viability in the presence of 6-mercaptopurine (6-MP) (Supplementary Fig. 5c–h). Finally, CDKN2A/B deletions were detected in 2/3 ALLs with refractory disease, and 1/14 early relapse ALL had copy number neutral loss of heterozygosity of chromosome 9p with no accompanying mutation in CDKN2A/B or PAX5 (Fig. 1d and Supplementary Data 7). Targeted deep-sequencing did not detect these mutations at diagnosis and inspection of the WGS reads failed to detect the NR3C1 deletion, suggesting acquisition during treatment or beyond our level of detection (Supplementary Data 9). To validate our findings, we analyzed data from three studies^7,8,11, including data from 80 diagnostic KMT2A-r infant ALL cases of which 18 had a paired relapse. This showed that 6/12 (50%) of infants with early relapse, and none of the 6 infants with very early relapse, had such alterations at relapse (Supplementary Data 10). Further, TP53-alterations were rare at diagnosis (2/80 cases), with one of the cases having a very early relapse, but data from the paired relapse was not available.

At diagnosis, 32% of patients (7/22) harbored signaling mutations and 36% at relapse (9/25), and mutations were enriched at early relapse (50% versus 18% of very early). Diagnostic FLT3-mutations were enriched at early relapse (23%) and may mark cases with high relapse risk. Only two cases maintained their signaling mutation from diagnosis to relapse and in the remaining cases, they were gained at relapse. Subclonal signaling mutations (variant allele frequency, VAF < 0.3) were more common at diagnosis (63% versus 25% at relapse) (Supplementary Fig. 5i, j and Supplementary Data 7). Combined, 79% of early relapse ALL had at least one alteration in TP53, IKZF1, CREBBP, NT5C2, WHSC1 or NR3C1 at relapse and the general paucity of such alterations in very early relapse/resistant ALL (9%) suggest that their mechanisms of relapse are different.

Genomic landscape of relapsed KMT2A-r AML

Mutations in five pathways were enriched at AML relapse; cell cycle, transcription, WNT-signaling, epigenetic and signaling with only six genes recurrently altered including deletion of 12p affecting CDKN1B and ETV6 (n = 3), mutations in CCND3 (n = 2), WT1 (n = 2), SETD2 (n = 2), and FLT3 (n = 2) (Fig. 2a–d and Supplementary Fig. 6a–e, 7a). Cell cycle alterations affected around 40–50% of early and late relapse AML, respectively and included heterozygous 12p deletions, and mutations in CCND3, DZIP, and TP53. The targets for the 12p deletions have been suggested to be CDKN1B, a negative cell cycle regulator, and/or the ETS-transcription factor ETV6¹⁵. WT1-mutations were gained at late relapse (2/6, 33%) and detected in cases with rare KMT2A-r (KMT2A::AFDN, KMT2A::ELL), whereas early relapse harbored mutations in other WNT-signaling pathway genes (40%, APC, CTNNB1). Around 40-50% of early and late relapse harbored epigenetic mutations which mainly included epigenetic writers (MYST4/KAT6B, PRDM16, SMYD2, and SETD2). SETD2 was also inactivated by a translocation in a 12p-deleted case, which led to an in-frame SETD2::DCP1B product, where the C-terminal part of SETD2 containing the critical SRI-domain fused with 12p; consistent with a truncating mutation (Fig. 2e). Alterations in transcription factor genes were enriched at early relapse (60% vs 33%) and included ETV6 (12p-del), and mutations in NFE2, POU2F2, AHRR, and SREBF2.

Signaling mutations were more common at diagnosis than at relapse in AML (73% versus 36%). These mutations were more often subclonal at relapse (67%, 4/6) than at diagnosis (33%, 3/9) and cases with subclonal signaling mutations at relapse had more than one such mutation at different VAFs (Supplementary Fig. 7b–d and Supplementary Data 7). Additionally, while diagnostic KRAS/NRAS mutations were maintained at early relapse they were lost at late relapse, which instead gained receptor tyrosine kinase mutations (FLT3, CSF1R). Within all enriched relapse pathways, early relapse AML maintained most mutations from diagnosis to relapse (60%, 12/20), whereas late relapse gained (38%, 8/21) and lost (29%, 6/21) more mutations (Fig. 2f). This suggests that early relapse AML, similar to very early relapse ALL, may exhibit an intrinsic drug resistance, whereas late relapse AML cases gain mutations that promote relapse.

Mutational signatures at relapse fingerprints cellular history

We next analyzed all SNVs in non-repetitive regions showing that C > T transitions were the most common mutation at both diagnosis (45%) and relapse (44%), with no significant difference between ALL and AML (Supplementary Fig. 8a–c and Supplementary Data 11). Based on the trinucleotide context, 60 single-base substitution (SBS) signatures have been identified and attributed to both known and unknown etiologies¹⁶. To understand which mutational processes that were active, we examined the mutational signatures at relapse. In ALL, 9/14 cases had chemotherapy exposure as their primary signature and 36%-86% of their mutations were attributable to chemotherapy exposure (early relapse 89% vs very early 20%), particularly SBS87 which has previously been correlated to thiopurine treatment (Fig. 3a)¹⁷. Five cases did not have chemotherapy as their primary signature; in P28, P135, and P136 which had refractory disease, the primary signature was of unknown origin, with chemotherapy being the primary known signature for P28, and mismatch repair (MMR) for P135 and P136. P17 had MMR as the primary signature but exhibited a smaller subset of chemotherapy-associated mutations at an elevated VAF (Supplementary Fig. 8d). Finally, P11 was hypermutated at relapse, with 77% of mutations having a clock-like signature. Cases with mutations affecting purine metabolism (P69, P58 and P3), had the most prominent chemotherapy-induced signatures and very early relapse, the least prominent.

**Fig. 3: Mutational signatures and clonal evolution patterns.**

As opposed to ALL, the most common mutational signature at AML relapse was of unknown origin (6/9) with chemotherapy-associated signatures virtually absent (Fig. 3a). To investigate this further, we performed an analysis of relapse-specific mutations only and compared them to the diagnostic profiles, showing that in contrast to ALL, the cause of mutations at AML relapse did not differ from that at diagnosis and thus mirrored the natural acquisition of mutations (AML Spearman r² = 0.75 vs ALL r² = 0.36) (Supplementary Fig. 8e). Moreover, the mutational processes in AML remained stable across multiple relapses (Supplementary Fig. 9a). Combined, early relapse ALL was characterized by mutations that associated with chemotherapy exposure, indicating that the surviving cells accumulate mutations because of cytotoxic cell damage. Relapsed AML lacked chemotherapy-exposure signatures suggesting that the relapse originated from an evolutionary early cell that remained unaffected by chemotherapy.

Clonal evolution patterns from diagnosis to relapse

We next investigated the pattern of clonal evolution from diagnosis to relapse, showing that all relapses lost diagnosis-specific variants indicative of branching evolution (Supplementary Figs. 9b, 10a). By comparing the diagnostic and relapse VAFs, it was shown that in 55% of cases (12/22), relapse was seeded by multiple diagnostic clones, and in the remaining 45%, by a single sweeping clone that was detected at diagnosis in 40% of those cases (Fig. 3b, c, and Supplementary Data 3). Most patients had additional subclones at relapse, in line with a continuous evolution creating new clones. The lack of detection of the relapse clone at diagnosis does not exclude that it was present but below our resolution. In ALL, the clonal evolution pattern correlated to time to relapse with multiple diagnostic clones seeding relapse seen in 80% of very early and 33% of early relapse ALL, whereas in AML the frequency was the same (early, 66% and late, 60%) (Fig. 3c).

One ALL and four AML had multiple BM relapses, allowing us to study how the genetic landscape evolved across consecutive relapses (Fig. 3d, Supplementary Fig. 11a–d and Supplementary Data 3). The mutational burden increased after each relapse and mutations in driver genes were not lost when once gained at relapse suggesting a fitness advantage. This stepwise replacement of a fitter clone is illustrated in P89, an infant ALL with three relapses (Fig. 3e and Supplementary Fig. 11e). At the first relapse, the diagnostic PAX5^P207fs-clone was dominant and a new subclone emerged in 22% of cells, which acquired a UBR7^Y92* at the second relapse, gained new mutations and expanded in a sweep at the third relapse, where a new subclone emerged. Moreover, 64 days before the first relapse, the patient was in CR by flow cytometry (FC), but NGS detected the relapse clone in 2% of cells.

Residual leukemia cells and unique clonal responses

To uncover clonal responses to treatment and study measurable disease at the clinical measurable residual disease (MRD) timepoints, we studied 19 relapse (10 ALL, 9 AML) and 11 remission (7 ALL, 4 AML) cases using patient-specific mutations, including the KMT2A-r, by deep-sequencing (average coverage 3250/site, average 17 mutations/patient, 257 samples) (Fig. 1b, Supplementary Figs. 12–16b, and Supplementary Data 1, 12–14). CR was defined as <5% blasts in patients treated on protocols from year 2000 or older and by FC < 0.1% for newer protocols. The VAFs between the WGS/WES and targeted gene resequencing correlated (r² = 0.66) and a dilution series showed a sensitivity around VAF 0.002 or 0.4% of leukemia cells (Supplementary Fig. 16c–e).

At day 15, the first MRD-measurement during induction therapy, 13/15 ALL cases had measurable disease, with a similar average VAF in relapse and remission cases, whereas at the end of induction (EOI, day 29), relapse cases had a trend towards higher values (p = 0.069) (Fig. 4a, b and Supplementary Fig. 16f–j, 17a). This was mainly driven by very early relapse cases and if no mutations were detected, patients remained in remission. Cases with KMT2A::AFF1 and rare KMT2A-r still had measurable disease at the EOI while those with KMT2A::MLLT3 were negative (Supplementary Fig. 17b). A subset of cases had measurable disease at day 15 or 29 despite being in CR (Supplementary Fig. 17c, d and Supplementary Data 14).

**Fig. 4: Longitudinal analyses and clonal dynamics during treatment.**

Around half of AML cases had measurable disease after the first induction (EO1I) and the average VAF was similar in remission and relapse patients, with two having measurable disease despite being in CR (Fig. 4c and Supplementary Fig. 17e-h). After the second induction (EO2I, days 36-76), 7/12 cases had measurable disease with 3 cases being in CR by clinical MRD, and 6/7 cases relapsed (Fig. 4d and Supplementary Fig 17i). Further, 3/4 early relapse AML had higher values after EO2I as compared to EO1I, and 4/6 KMT2A::MLLT3-r cases had decreased values, concordant with their favorable outcome (Supplementary Fig. 17j, k).

Low-frequency KMT2A-fusion positive cells were found at CR outside of the MRD time points in 11/30 patients (3 remission, 27%; 8 relapses, 42%) (Supplementary Data 12). This included four infant ALLs with measurable disease across all time points until relapse (6-13 time points), one had resistant disease (P28), but the others entered CR (P12, P58, P89) (Figs. 3e, 4e and Supplementary Fig. 12a, c). Excluding these cases, in 4/16 cases with a BM relapse, we detected the relapse clone after induction therapy a long as 25–92 days before relapse. Some diagnostic mutations/SVs were always present and sufficient to detect the relapse. Combined, molecular monitoring with patient-specific mutations readily detected the leukemia cells.

The longitudinal data allowed analysis of clonal responses to treatment and revealed several interesting findings (Fig. 4f–i and Supplementary Fig. 18a–d). First, chemotherapy can affect genetically distinct clones differently such as in P69, an infant ALL that relapsed in maintenance 1 during treatment with 6-MP, with an NT5C2^R39Q in 28% of the relapse cells, where a change to the ALLR3 protocol without thiopurines eradicated the NT5C2-containing clone. Second, a diagnostic clone that initially had the most rapid response to chemotherapy, reappeared to cause relapse after 147 days in remission (P11). Third, the relapse clone can be dormant for a long time before causing relapse, such as in P17, where it was undetectable for 378 days until it reappeared in 1% of cells, almost 100 days before expanding in a clonal sweep. To gain further insight into the clonal composition and mutational co-occurrence in this patient, single cells were sorted and selected targets were assessed (Fig. 4j, Supplementary Fig. 18e–k and Supplementary Data 15). This showed that mutations in CDKN1B and KRAS co-occurred in the dominant clone, with two subclones, one with mutations in TP53, PTPN11 and CREBBP in 40% of cells, and one without additional mutations in the enriched pathways (20% of cells). We conducted a similar analysis for P57 and verified clonal expansion of a population exhibiting two TP53-mutations (Supplementary Fig. 12d). Finally, different signaling mutations likely have distinct molecular consequences as a minor diagnostic NF1-containing clone outcompeted a larger PTPN11-containing clone at relapse (P77). These data demonstrate unique clonal responses to the treatment given.

Discussion

We here demonstrate different mechanisms of relapse in KMT2A-r ALL and AML. In ALL, early relapse is likely driven by surviving cells that accumulate mutations because of chemotherapy exposure and in 79%, mutations in drug-response genes were detected at relapse, most commonly in TP53 and IKZF1. By contrast, 91% of very early relapse ALL lacked such mutations, in line with inherent resistance. Relapsed AML had the same mutational processes active at diagnosis and relapse and lacked a chemotherapy-exposure signature, suggesting relapse from a cell that has remained unaffected by therapy (Fig. 5).

**Fig. 5: Illustration of different relapse mechanisms in *KMT2A-*r leukemia cells.**

Multiple diagnostic clones seeding relapse were seen in 55% of patients overall and enriched in very early relapse/resistant KMT2A-r ALL (80% versus 33% in early relapse). This evolutionary pattern is seen in 20% of relapsed childhood ALL across genetic subtypes and thus enriched in KMT2A-r ALL¹¹. Together with the general lack of new driver mutations, this implies that very early KMT2A-r infant ALL relapse is driven by intrinsic resistance where other factors, including the host genetics, microenvironment and/or cell state drive relapse¹⁸. Early relapse KMT2A-r ALL commonly evolved through a clonal sweep (67%) and harbored alterations in chemoresistance-associated genes including TP53, IKZF1, NT5C2, NR3C1, WHSC1, and CREBBP in 79%. Similar alterations are detected in 17% of very early relapse and 65% of early relapse ALL across genetic subtypes in pediatric ALL (n = 94 non-KMT2A-r, n = 9 KMT2A-r) and restricting the analyses to non-KMT2A-r cases, in 22% and 70%, respectively¹¹. Importantly, co-existing IKZF1 and TP53 alterations at relapse were not seen in any of these cases¹¹, suggesting that this may be unique to KMT2A-r ALL. IKZF1 and TP53 associate with inferior prognosis in childhood ALL^19,20,21 and recently, diagnostic TP53 and IKZF1 alterations were identified in 14% and 8% of adult KMT2A-r ALL respectively, and found to be associated with poor outcome²². These alterations are rare at diagnosis in infant and childhood KMT2A-r leukemia (4% and 0%, respectively) and highly enriched at relapse herein (64%)⁷. To validate our findings, we analyzed data from 18 KMT2A-r ALL trios^7,8,11, showing a frequency of chemotherapy resistance-associated alterations on a par with our data in early relapse cases (50%, 6/12), and a paucity of such changes in very early relapse cases (0/6). Still, the number of patients analyzed remains low, and some lacked complete genomics data, thus larger studies are needed to confirm our results. In agreement with the patient data, functional studies have demonstrated that alterations in TP53 and IKZF1 drive resistance to numerous cytostatic, suggesting that alternative treatments are needed to successfully treat these patients^{11,23,24,25,26,27,28,29}.

In line with the possible acquisition of TP53-alterations during treatment^11,25,30, they were not detected at diagnosis with targeted sequencing including duplex-sequencing in P58³⁰. Similarly, remaining drug-associated mutations were gained during maintenance therapy with thiopurines and/or anthracyclines, and not detected at diagnosis^9,10,31. This is, to the best of our knowledge, the first study demonstrating that IKZF1 and TP53 are tightly connected to early relapse in KMT2A-r ALL, detected in 64% of cases, suggesting monitoring during treatment for the rise of these high-risk lesions may allow for early intervention(s).

Early relapse AML maintained most mutations within the enriched pathways while later relapses had a higher mutational turnover. Diagnostic TP53 or CCND3 alterations were detected in 40% of early relapse AML and may mark patients with a high relapse risk, warranting future studies. CCND3 mutations are rare in KMT2A-r AML (8.9%)³², and enriched in relapse cases (16.7% herein). Recurrent 12p deletions affecting ETV6 and CDKN1B were identified at relapse, and as the minimally deleted region does not always contain ETV6, CDKN1B may be the target gene¹⁵. 12p deletions associate with inferior survival in childhood AML, are often cytogenetically cryptic/complex and may lead to ETV6-fusions³³. Also our cases had complex karyotypes and did not survive, and although no ETV6-fusions were identified, a SETD2::DCP1B-fusion was detected which caused loss of SETD2. Only one SETD2-fusion has been described in leukemia, a SETD2::CCDC12, that we identified in a child with KMT2A-r ALL⁷. Finally, while RAS-signaling mutations were maintained at early relapse, they were lost at late relapse, which instead gained kinase receptor mutations, suggesting variable dependence on signaling pathways.

The mutational signature analysis revealed a difference in the mechanisms of relapse in KMT2A-r ALL and AML and between very early and early relapse ALL. Relapsed ALL had a large number of new mutations and in early relapse ALL, most mutations gained at relapse had a chemotherapy-exposure signature and likely occurred due to treatment, as also suggested by others^11,34. Thus, surviving cells accumulate mutations during treatment until a point where they expand to cause relapse, often accompanied by mutations in drug-response genes (79%)^11,25,31. By contrast, the chemotherapy-exposure signature was not prominent in very early relapse infant ALL, and their short time to relapse rather points towards an inherent resistance¹⁸. Interestingly, the primary signature in the three refractory ALL cases was of unknown origin, indicating a potential common yet unidentified cause. In relapsed AML, the pattern of mutations was similar at diagnosis and relapse and mirrored the natural acquisition of mutations, rather than being caused by chemotherapy (two infants and seven children were analyzed). This demonstrates the absence of additional mutational pressure and that an evolutionary early and inherently resistant or dormant cell, which remained unaffected by treatment, caused the relapse. This finding is consistent with recent single-cell RNA-sequencing data showing a shift at relapse toward a more primitive cellular state in KMT2A-r AML³⁵. Although the strong chemotherapy-induced signature in ALL and the more diagnosis-like signature in AML have been seen previously, this difference has not been highlighted before^11,36. These different modes of resistance are important as adjusted doses might be required for persistent cells that acquire mutations along treatment, whereas for inherently resistant cells, new treatments are likely needed, and dormant cells might be targeted by being pushed into cell division. Combined, new therapies including Blinatumomab, CAR-T, or Menin inhibitors, are likely needed to conquer all types of relapsed disease.

Few have analyzed childhood acute leukemia during treatment with NGS^11,37. Our data extend these studies to include infant leukemia and showed that a change in therapy can favor the eradication of one clone and expansion of another demonstrating variable sensitivity to the given drugs. Further, a diagnostic clone that initially was the most sensitive clone to therapy can reappear to cause relapse, suggesting awakening by changes in the microenvironment, signaling, immune system or a combination thereof³⁸. Mutations in the same pathway likely provide different cellular advantages and can outcompete each other at relapse and finally, a stepwise replacement of a fitter clone was seen in cases with multiple relapses.

Given the prognostic value of MRD in childhood ALL^39,40, we investigated measurable disease using personal primers, and very early relapse ALL still had high levels of leukemia cells at the EOI, whereas cases that stayed in remission lacked detectable leukemia cells. In AML, the fraction of leukemia cells after the second induction has prognostic significance⁴¹ and herein 6/7 cases with measurable disease at that time relapsed. All but two cases were in CR by FC, highlighting the sensitivity of NGS^42,43. However, the limited cohort size may affect the results, underscoring the need for validation in larger studies. Molecular monitoring with personal mutations detected relapse up to 3 months before clinical relapse and could be an important clinical tool as earlier detection increases the treatment window and chances of survival⁴⁴.

Collectively, this study provides unique insights into the mechanisms of relapse in a highly aggressive leukemia. Mutations in chemoresistance-associated genes were identified in 79% of early relapse ALL with co-existing TP53 and IKZF1-alterations being highly enriched (64%) while almost all very early relapse ALL lacked such alterations. Our results suggest that some of the mutations that drive early KMT2A-r relapse ALL accumulate in persisting leukemia cells because of chemotherapy treatment, while very early relapse ALL is driven by an inherent resistance. In contrast to ALL, KMT2A-r relapse AML stems from an evolutionary early and likely dormant cell that remains unaffected by chemotherapy exposure. These results have implications for future treatment strategies and prediction of relapse.

Methods

Patient cohort

Informed consent was obtained according to the Declaration of Helsinki and the study was approved by the local Ethics Committee of Lund University, Sweden. The serial samples were made possible by a practice of periodical bone marrow monitoring during therapy. The cohort consisted of 52 KMT2A-r patients diagnosed between 1992 and 2010 and treated on the Nordic Society of Paediatric Haematology and Oncology (NOPHO), Interfant protocols, AALL0631, or TINI (Supplementary Data 1). The median time from diagnosis to relapse was 405 days for infant ALL (range 60–1024), 419 days for childhood ALL (range 378–459), 372 days for infant AML (range 60–683) and 205 days for childhood AML (range 104–337). The ALL cohort was divided into those that stayed in remission (n = 7), very early relapse/resistant disease if relapse occurred <9 months after diagnosis or if they never reached CR (n = 11) and early relapse if relapse occurred 9–36 months from diagnosis (n = 16, note that for P22 and P56, the paired relapse sample was not available as they occurred in the testis and CNS, respectively), based on the definition by Li et al.¹¹. The AML cohort was divided into those that stayed in remission (n = 4), early or late relapse if relapse occurred before (n = 6) or after 1 year in CR (n = 8), based on the clinical definition⁴⁵.

DNA extraction

DNA was extracted from 267 BM and 45 peripheral blood (PB) samples as a part of clinical procedures (n = 119), from TRIZOL (n = 42), fixative (n = 23), cell pellets (n = 87), or glass slides (n = 40) using standard protocols including TRIzolTM Reagent (Thermo Fisher Scientific, Waltham, MA, USA), Gentra Puregene Blood Kit (Qiagen, Hilden, Germany), QIAamp DNA Micro kit (Qiagen) or an in-house protocol for fixative (Supplementary Data 12). From cells in TRIzolReagent (Thermo Fisher Scientific), DNA was isolated from the interphase with an in-house protocol. In brief, 500 μl Back extraction buffer (50 mM Sodium Citrate (Sigma-Aldrich, St Louis, MO), 1 M Tris (Thermo Fisher Scientific) and 4 M Guanidintiocyanat (Sigma-Aldrich) were added to the interphase, followed by 55 °C for 5 min, 10 min mixing and 12,000 x g, 30 min. The water phase was mixed with 1 μl Glycogen (10 µg/µl, Invitrogen, Waltham, MA, USA) and 800 µl 2-propanol (Sigma-Aldrich), incubated for 5 min, followed by 12,000 x g, 15 min at 4 °C. The pellet was washed with 80% Ethanol (Solveco Rosersberg, Sweden) and resuspended in nuclease-free water (Invitrogen). The phase lock gel system (Light, QuantaBio, Beverly, MA, USA) was used for clean-up after extraction.

For samples in fixative, cells in 200 µl freshly made fixative were neutralized with 140 µl 5 M NaOH (Sigma-Aldrich), lysed for 10 min at RT with 0.32 M Sucrose (Merck), 10 mM Tris (VWR, PA, USA), 5 mM MgCl₂ (Sigma-Aldrich), and 1% Triton X-100 (Sigma-Aldrich), followed by 350 x g, 5 min, resuspension in 50 µl Extraction buffer (1 M Tris (pH 8), 0.5 M EDTA (Sigma-Aldrich), 20% SDS and Proteinase K (0.2 µg/µl, Qiagen) and 1 h at 60 °C, followed by clean-up using the phase lock gel system.

Slides were incubated with 100% xylene (Sigma-Aldrich) until the cover slide detached, rinsed with Ethanol (70, 85, 99.5%) and then milli-Q water. The cell smears were scraped off into 180 µl ATL-buffer and 30 µl proteinase-K and extracted using the QIAamp DNA Micro kit (Qiagen). DNA quantity and quality were assessed by the Qubit 4 fluorometer and NanoDrop One (both Thermo Fisher Scientific), respectively.

Whole-genome, whole-exome sequencing and bioinformatic analysis

Whole-genome libraries were performed using the TruSeq Nano protocol (Illumina Inc, San Diego, CA, USA) and 150 bp paired-end sequencing was performed using Illumina HiSeqX or NovaSeq 6000. Whole-exome libraries were performed using the Nextera Rapid Capture Exome kit (Illumina) (Nextera rapid capture exome 15037436 Rev.D) or Twist Comprehensive Exome (Twist Bioscience, San Francisco, CA, USA), and 300 bp paired-end sequencing was performed on Illumina NextSeq500 or NovaSeq6000.

Methods used for mapping, coverage and quality assessment, SNV and Indel-analysis, tier annotation, and Loss of Heterozygosity (LOH) detection have been described in ref. ¹⁷. In short, for the annotation of transcripts, Ensembl (build 54_36) and Genbank (build downloaded on 21 May 2009) were used. The following four tiers were used to classify sequence variants (i) tier 1: coding synonymous, nonsynonymous, splice-site and noncoding RNA variants; (ii) tier 2: conserved variants (conservation score cutoff of greater than or equal to 500, based on either the phastConsElements28way table or the phastConsElements17way table from the UCSC Genome Browser) and variants in regulatory regions annotated by UCSC (regulatory annotations included are targetScanS, ORegAnno, tfbsConsSites, vistaEnhancers, eponine, firstEF, L1 TAF1 Valid, Poly(A), switchDbTss, encodeUViennaRnaz, laminB1 and cpgIslandExt); (iii) tier 3: variants in non-repeat masked regions; and (iv) tier 4: remaining SNVs. For structural variations, CREST⁴⁶ was used and for CNA, CONSERTING⁴⁷. Both SVs and CNAs were manually inspected. Zoomed-in allelic imbalance plots were generated for PAX5, IKZF1, and CDKN2A/B, to ensure that focal CNAs were not missed.

A subset of samples was analyzed as follows (St Jude, CAB pipeline); Paired-end reads were aligned to human reference genome build hg38 using BWA-mem that is implemented in the tool ‘fq2bam’ from NVIDIA Clara Parabricks GPU accelerated toolkit. For tumor/normal variant calling, an ensemble strategy with optimized filtering (SNV called by at least 2 callers) was employed to identify SNVs/indels using Mutect2 (v4.1.2.0)⁴⁸, SomaticSniper (v1.0.5.0)⁴⁹, VarScan2 (v2.4.3)⁵⁰, MuSE (v1.0rc)⁵¹, and Strelka2 (v2.9.10)⁵². Variant annotation was performed using Annovar⁵³. These consensus calls were further undergoing manual assessment and evaluation of read depth, mapping quality, and strand bias, to eliminate additional artifacts. Tumor/normal matched samples were further subjected to copy number variant (CNV) calling using CONSERTING⁴⁷ and CNVKit⁵⁴, as well as SV calling by Gridss⁵⁵, Manta⁵⁶,and DELLY⁵⁷. The results were assembled and QCed to keep high-quality SVs. For P30, SNP-array data was included for the diagnostic sample⁵⁸. P137 and 138 were analyzed with Dragen.

P133 was analyzed using another pipeline³⁶. Briefly, pair-end reads were aligned to hg38 with BWA⁵⁹. Duplicate reads were marked with Picard and Indel realignment was performed with GATK⁴⁸. Somatic SNVs and indels were called using MuTect, MuTect2 and MuSE^48,51. Variants were included if either 1) Identified in both WGS and WES by at least two out of three variant callers, or 2) Identified by either WGS or WES by all three variant callers (MuSE, Mutect, and Mutect2). SVs were identified by Manta⁵⁶, DELLY⁵⁷, novoBreak⁶⁰, and SvABA⁶¹, all with default settings. Patchwork was used for CNV-calling⁶². Mutations were pre-filtered based on sequencing depth in tumor and normal sample, sequencing reads for the mutant allele, and allele frequency in the population.

Patients lacking a germline sample were analyzed against a random germline and only genes that were coupled to relapse herein, or in published cohorts^11,37 were studied. Putative SNVs were excluded if they had a normal allele frequency >0.01 (Exome Aggregation Consortium, reference variants (rs) and SWEfreq) if the germline VAF was >0.2, and if mutant reads were only on one strand. For structural variants only known alterations were included.

Mutational signatures

The R-package MutationalPatterns⁶³ 3.14.0 was utilized to assess the relative contribution of single base substitutions in non-repetitive regions identified through WGS to the Cosmic SBS mutational profiles. Each signature was classified as Clock-like, APOBEC/AID, Mismatch Repair (MMR), UV, Chemotherapy treatment, Biological, ROS, Tobacco, Chemical exposure or Unknown based on their description in Cosmic, and each patient was assigned to a “primary signature” based on their most common classification (Supplementary Data 11)¹⁶. When calculating the average VAF per signature, each mutation was first assigned to the most likely signature: All 96 possible motifs were designated a most likely signature, by mapping each one to the Cosmic SBS mutational signature based on where the motif has the highest contribution.

Pathway analysis and relapse-specific mutations

Pathways were included if they contained >1 relapse-specific alteration in paired samples. Glucocorticoid receptor signaling was included as it is connected to resistance. All mutations were evaluated using DriverPower⁶⁴ and Annovar⁵³. The heatmap and location of relapse mutations were generated using St Jude’s webtool ProteinPaint (https://pecan.stjude.cloud/proteinpaint/study/). For CNAs, focal CNAs (≤5 genes), broad CNAs (>5 affected genes but less than 400), and whole-arm changes were included if detected at relapse or if in a recurrently mutated pathway. The genomic random interval (GRIN) analysis was used to identify recurrently mutated genes⁶⁵. Whole chromosome gains were excluded unless there was also a focal deletion/amplification, and CNAs connected to KMT2A-rearrangements were excluded. Alterations in unpaired samples were included if they were present in a recurrent pathway and were known cancer-associated mutations.

2D plots

SciClone⁶⁶ was used to determine clonal evolution patterns. Validated tier 1 and high-quality tier 2 and 3 SNVs were included together with manually inspected CNAs. If the variant had been studied by targeted sequencing, the VAF with the highest coverage was used and relapse variants detected in a low fraction of diagnostic cells in the targeted sequencing were also included. The minimum depth for a variant was 10 reads.

Primary Template-directed Amplification (PTA)

For 13 samples with no measurable or limited amounts of DNA ( < 0.1–3.2 ng), the “ResolveDNA Whole Genome Amplification v.1 kit” (Bioskryb Genomics, Durham, UK) was used for whole genome amplification (WGA) (Supplementary Data 12). PTA was performed according to the manufacturer’s instructions with modifications. The lysis step was performed for 12.5 min, 1400 rpm at RT (MPS-1 plate mixer, BioSan, Riga, Latvia). During clean-up, the BioSkryb ResolveDNA beads and WGA-DNA were resuspended by mixing for 2 min, 1500 rpm at RT (MPS-1, Biosan) followed by 1 min, 19xg at RT. After ethanol wash, resolved WGA-DNA was resuspended by mixing for 2 min, 1700 rpm at RT and 1 min, 19xg at RT. WGA-DNA was quantified with a dsDNA high sensitivity kit using Qubit 4 (Thermo Fisher Scientific) and fragment size determined with D5000 ScreenTape assay using the 4200 TapeStation (Agilent Technologies, Santa Clara, CA, USA). A non-targeting control and 100 pg genomic DNA were used as a negative and positive control, respectively.

Longitudinal deep-sequencing

Between 6 and 35 tier 1-3 mutations from WGS/WES data were selected per patient. Primers were designed using Primer3⁶⁷ and PrimerTK (https://github.com/stjude/PrimerTK), where costume scripts were used to check for dimer formation and an In-Silico PCR tool for off-target products and ordered from Integrated DNA Technologies (IDT, Coralville, Iowa, USA) (Supplementary Data 16). PCR was performed using Qiagen Multiplex PCR Kit (Qiagen) with 5-20 ng input DNA and products were quantified using Qubit 4 (Thermo Fisher Scientific) and prepared for sequencing using Nextera XT DNA Sample Preparation Kit, Index Kits (Illumina), and purified using AMPure XP beads (Beckman Coulter Inc., Brea, CA). 2 × 150 bp paired-end sequencing was performed using Illumina NextSeq 500, MiniSeq or MiSeq. Reads were trimmed using Trimmomatic (0.38)⁶⁸ and paired-end reads were aligned to hg19 using BWA (0.7.15)⁵⁹. PCR duplicates were marked using Picard (2.6.0, Broad Institute, Cambridge, MA, USA). Variant calling was performed using Freebayes⁶⁹ (1.1.0) (parameters: --min-alternate-fraction 0, --strand-filter 0, --p-value 1, --min-mapping-quality 50).

Dilution series to determine the sensitivity of the multiplex-PCR

Kasumi-1 (ACC-220, DSMZ, Braunschweig, Germany), carrying a homozygous TP53 (chr17:7577538, C > T) mutation was diluted with REH (ACC-22, DSMZ), lacking TP53 mutations. Cells were cultured under standard conditions and DNA was extracted using the DNeasy Blood & Tissue kit (Qiagen) and diluted to 11 ng/µl. Kasumi was diluted with REH in nine steps, mixed and incubated for 15 min before the next dilution to an expected VAF of 0.0019. This included up to four multiplex PCR replicas. Two additional TP53-mutations were found in Kasumi-1 (chr17:7577407, A > C and chr17:7577427, G > A).

Single-cell sorting and whole genome amplification

Relapse cells from P17 and P57 were thawed in 10% FBS (HyClone, Logan, UT, USA) with DNase I (1 mg/ml, Roche, Basel, Switzerland) and RPMI1640 (Thermo Fisher Scientific), 1% Penicillin-Streptomycin (Thermo Fisher Scientific), centrifuged for 5 min, 350 x g at 4 °C and resuspended in PBS + 2% FBS (HyClone). Cells were blocked with Human TruStain FcX (BioLegend, San Diego, CA, USA), according to the manufacturer’s description, and stained for 30 min at 4 °C with antibodies and isotype controls (Supplementary Data 17). Cells were washed and resuspended in PBS + 2% (HyClone) and incubated with 7-AAD (BD Biosciences, Franklin Lakes, NJ, USA) for 10 min at 4 °C before flow analysis.

Single cells were isolated by FACS Aria Fusion (BD Biosciences) with FACSDiva v8.0.2 (BD) using index sorting. The precision was set according to “A Rapid Method to Verify Single-Cell Deposition Setup for Cell Sorters”⁷⁰. Cells were sorted into twin.tec LoBind PCR plates (Eppendorf) with 1.5 µl Cell Buffer (ResolveDNAv.1, BioSkryb), using the 100 µm nozzle, covered with plastic film (Thermo Fisher Scientific), spun down (MPC-25), mixed for 15 s, 2300 rpm (MPS-1, BioSan, Riga, Latvia), spun down (MPC-25) put on dry ice and stored at −80 °C. Whole genome amplification, PCR and library preparation were performed as above (Supplementary Data 16). Libraries were sequenced using MiSeq (Illumina).

Clonal evolution visualized by fish plots

Fish plots were generated using the “fishplot” package in R”⁷¹. The clusters were determined manually based on the VAFs and some VAFs were adjusted to draw the fishplots. If available, single-cell data were used for interpretation. If samples were missing close to or after the relapse in patients with multiple relapses, we have depicted the relapse clone as if it emerges and disappears 20 days before and/or after the relapse and 30-50 days before the single/last relapse for illustrative reasons. The VAFs for SNVs in regions with CNA were corrected.

Variant allele frequencies at the MRD time points

The VAF for a given timepoint was an average of the two highest VAFs and we required that the KMT2A-r was identified. However, given that patients often are cytopenic at early time points and that the number of leukemic cells is low/undetectable during remission, we rescued the sample despite the KMT2A-r was not identified (`n = 8) if >2 variants from the major diagnostic clone and/or relapse had a VAF above our detection level (>0.002). We required that no other mutations were present at the site with a VAF > 0.002 indicating artifacts.

CRISPR/Cas9 genome editing of PRPS2

The human REH (CRL-8286) B-ALL cell line was purchased from the American Type Culture Collection (ATCC) and authenticated by STR analysis. Cells were cultured in RPMI 1640 (Gibco, 11875093) supplemented with 10% fetal bovine serum (FBS, HyClone, SH30071.03) at 37 °C with 5% CO₂. PRPS2 knockout single clones were generated using CRISPR-Cas9 and were confirmed by Next-Generation Sequencing (See Supplementary Data 18 for sgRNA sequence).

Plasmid construction and lentivirus production

PRPS2 (NM_001039091) wildtype and PRPS2^P320L were synthesized and subcloned to MSCV-IRES-GFP using EcoRI cloning site using Genescript (Piscataway, NJ, USA). Human PRPS2 wildtype or mutant PRPS2 P320L were amplified by PCR and cloned into the cl20-Elongation Factor 1 alpha (EF-1α)-internal ribosome entry site (IRES)-GFP lentiviral plasmid using NEBuilder HiFi DNA Assembly Master Mix (NEB, E2621L)⁷². Purified cl20c-IRES-GFP empty vector, cl20c-Flag-PRPS2-WT-IRES-GFP, and cl20c-Flag-PRPS2-P320L-IRES-GFP were transfected with packaging vectors into Lenti-X 293 T cells (Takara, #632180)^73,74. After 48 h, the culture media containing lentiviral particles were collected to infect REH (CRL-8286, ATCC) parental and PRPS2 KO cell lines for another 48 h. The transduction efficiencies were determined by flow cytometry. Primers used for plasmid construction can be found in Supplementary Data 18.

Western blotting

After being washed once with cold PBS buffer, the cells were lysed by RIPA lysis buffer (Thermo Scientific, #89901). Protein concentrations were determined using BCA assay kit (Thermo Scientific, #23228). 30 μg protein for each sample was subjected to western blotting. Flag-tag antibody (Invitrogen, #PA1-984B-HRP, Lot No. WC316048, 1:1000 dilution) was used for the detection of overexpressed PRPS2. Vinculin level for each sample was detected with a vinculin antibody (CST, #13901, Lot No.7, 1:1000 dilution).

Drug sensitivity assessment

CellTiter-Glo (CTG) assay (Promega, #G7573) was used to test cell viability. REH-cells were seeded into a 384-well microplate (Thermo Scientific, #242765) with 2,000 cells/well. Gradient-diluted 6-MP was then added and incubated for 72 h until the CTG-assay was performed and tested by microreader (Agilent, Biotek, Synergy H4 hybrid reader). The assay was repeated twice.

Statistics

R 4.4.0 was used for statistical calculations. Spearman correlation test was used for comparison of signature analysis at diagnosis and relapse in AML, as the data was significantly different from a normal distribution, determined by Shapiro Wilk normality test. The comparison of the number of gained mutations across KMT2A-r was tested by Kruskal-Wallis. For all two group comparisons, a 2-sided Wilcoxon rank sum test was used. For Kaplan-Meier survival estimations, Survival 3.5.8 was used.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The sequencing data generated in this study is deposited in the European Genome-phenome Archive (EGA). The sequencing data are available under restricted access as it is considered personal data and falls under the General Data Protection Regulation (GDPR), and access can be obtained by upon request from the corresponding author (Anna Hagström-Andersson, anna.hagstrom@med.lu.se) through EGAS00001008197. The raw sequencing data are protected and are not available due to data privacy laws. Source data are provided with this paper and the sequencing data generated in this study are provided in the Supplementary Information/Source Data file. Source data are provided with this paper.

References

Maloney, K. W. et al. Outcome in Children With Standard-Risk B-Cell Acute Lymphoblastic Leukemia: Results of Children’s Oncology Group Trial AALL0331. J. Clin. Oncol. 38, 602–612 (2020).
Article CAS PubMed Google Scholar
Tierens, A. et al. Mitoxantrone Versus Liposomal Daunorubicin in Induction of Pediatric AML With Risk Stratification Based on Flow Cytometry Measurement of Residual Disease. J. Clin. Oncol. 42, 2174–2185 (2024).
Article CAS PubMed Google Scholar
Pieters, R. et al. Outcome of Infants Younger Than 1 Year With Acute Lymphoblastic Leukemia Treated With the Interfant-06 Protocol: Results From an International Phase III Randomized Study. J. Clin. Oncol. 37, 2246–2256 (2019).
Article CAS PubMed Google Scholar
Attarbaschi, A. et al. Outcomes of Childhood Noninfant Acute Lymphoblastic Leukemia With 11q23/KMT2A Rearrangements in a Modern Therapy Era: A Retrospective International Study. J. Clin. Oncol. 41, 1404–1422 (2023).
Article CAS PubMed Google Scholar
van der Sluis, I. M. et al. Blinatumomab Added to Chemotherapy in Infant Lymphoblastic Leukemia. N. Engl. J. Med 388, 1572–1581 (2023).
Article PubMed Google Scholar
Maude, S. L. et al. Chimeric antigen receptor T cells for sustained remissions in leukemia. N. Engl. J. Med 371, 1507–1517 (2014).
Article PubMed PubMed Central Google Scholar
Andersson, A. K. et al. The landscape of somatic mutations in infant MLL-rearranged acute lymphoblastic leukemias. Nat. Genet 47, 330–337 (2015).
Article CAS PubMed PubMed Central Google Scholar
Agraz-Doblas, A. et al. Unraveling the cellular origin and clinical prognostic markers of infant B-cell acute lymphoblastic leukemia using genome-wide analysis. Haematologica 104, 1176–1188 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, B. et al. Negative feedback-defective PRPS1 mutants drive thiopurine resistance in relapsed childhood ALL. Nat. Med 21, 563–571 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tzoneva, G. et al. Activating mutations in the NT5C2 nucleotidase gene drive chemotherapy resistance in relapsed ALL. Nat. Med 19, 368–371 (2013).
Article CAS PubMed PubMed Central Google Scholar
Li, B. et al. Therapy-induced mutations drive the genomic landscape of relapsed acute lymphoblastic leukemia. Blood 135, 41–55 (2020).
Article PubMed PubMed Central Google Scholar
Bolouri, H. et al. The molecular landscape of pediatric acute myeloid leukemia reveals recurrent structural alterations and age-specific mutational interactions. Nat. Med 24, 103–112 (2018).
Article CAS PubMed Google Scholar
Umeda, M. et al. Integrated Genomic Analysis Identifies UBTF Tandem Duplications as a Recurrent Lesion in Pediatric Acute Myeloid Leukemia. Blood Cancer Discov. 3, 194–207 (2022).
Article CAS PubMed PubMed Central Google Scholar
Song, L. et al. PRPS2 mutations drive acute lymphoblastic leukemia relapse through influencing PRPS1/2 hexamer stability. Blood Sci. 5, 39–50 (2023).
Article PubMed Google Scholar
Andreasson, P. et al. Deletions of CDKN1B and ETV6 in acute myeloid leukemia and myelodysplastic syndromes without cytogenetic evidence of 12p abnormalities. Genes Chromosomes Cancer 19, 77–83 (1997).
Article CAS PubMed Google Scholar
Alexandrov, L. B. et al. Signatures of mutational processes in human cancer. Nature 500, 415–421 (2013).
Article CAS PubMed PubMed Central Google Scholar
Zhang, J. et al. The genetic basis of early T-cell precursor acute lymphoblastic leukaemia. Nature 481, 157–163 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Turati, V. A. et al. Chemotherapy induces canalization of cell state in childhood B-cell precursor acute lymphoblastic leukemia. Nat. Cancer 2, 835–852 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hof, J. et al. Mutations and deletions of the TP53 gene predict nonresponse to treatment and poor outcome in first relapse of childhood acute lymphoblastic leukemia. J. Clin. Oncol. 29, 3185–3193 (2011).
Article PubMed Google Scholar
Stengel, A. et al. TP53 mutations occur in 15.7% of ALL and are associated with MYC-rearrangement, low hypodiploidy, and a poor prognosis. Blood 124, 251–258 (2014).
Article CAS PubMed Google Scholar
Stanulla, M., Cave, H. & Moorman, A. V. IKZF1 deletions in pediatric acute lymphoblastic leukemia: still a poor prognostic marker?. Blood 135, 252–260 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kim, R. et al. Genetic alterations and MRD refine risk assessment for KMT2A-rearranged B-cell precursor ALL in adults: a GRAALL study. Blood 142, 1806–1817 (2023).
Article CAS PubMed Google Scholar
Demir, S. et al. Therapeutic targeting of mutant p53 in pediatric acute lymphoblastic leukemia. Haematologica 105, 170–181 (2020).
Article CAS PubMed PubMed Central Google Scholar
Oshima, K. et al. Mutational and functional genetics mapping of chemotherapy resistance mechanisms in relapsed acute lymphoblastic leukemia. Nat. Cancer 1, 1113–1127 (2020).
Article CAS PubMed PubMed Central Google Scholar
Yang, F. et al. Chemotherapy and mismatch repair deficiency cooperate to fuel TP53 mutagenesis and ALL relapse. Nat. Cancer 2, 819–834 (2021).
Article CAS PubMed Google Scholar
Scheijen, B. et al. Tumor suppressors BTG1 and IKZF1 cooperate during mouse leukemia development and increase relapse risk in B-cell precursor acute lymphoblastic leukemia patients. Haematologica 102, 541–551 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rogers, J. H. et al. Modeling IKZF1 lesions in B-ALL reveals distinct chemosensitivity patterns and potential therapeutic vulnerabilities. Blood Adv. 5, 3876–3890 (2021).
Article CAS PubMed PubMed Central Google Scholar
Vervoort, B. M. T. et al. IKZF1 gene deletions drive resistance to cytarabine in B-cell precursor acute lymphoblastic leukemia. Haematologica 109, 3904–3917 (2024).
Cox, W. P. J. et al. Histone deacetylase inhibition sensitizes p53-deficient B-cell precursor acute lymphoblastic leukemia to chemotherapy. Haematologica 109, 1755–1765 (2024).
CAS PubMed Google Scholar
Pilheden, M. et al. Duplex Sequencing Uncovers Recurrent Low-frequency Cancer-associated Mutations in Infant and Childhood KMT2A-rearranged Acute Leukemia. Hemasphere 6, e785 (2022).
Article CAS PubMed PubMed Central Google Scholar
Tzoneva, G. et al. Clonal evolution mechanisms in NT5C2 mutant-relapsed acute lymphoblastic leukaemia. Nature 553, 511–514 (2018).
Article CAS PubMed PubMed Central Google Scholar
Matsuo, H. et al. Recurrent CCND3 mutations in MLL-rearranged acute myeloid leukemia. Blood Adv. 2, 2879–2889 (2018).
Article CAS PubMed PubMed Central Google Scholar
Harrison, C. J. et al. Cytogenetics of childhood acute myeloid leukemia: United Kingdom Medical Research Council Treatment trials AML 10 and 12. J. Clin. Oncol. 28, 2674–2681 (2010).
Article PubMed Google Scholar
van der Ham, C. G. et al. Mutational mechanisms in multiply relapsed pediatric acute lymphoblastic leukemia. Leukemia 38, 2366–2375 (2024).
Article PubMed PubMed Central Google Scholar
Lambo, S. et al. A longitudinal single-cell atlas of treatment response in pediatric AML. Cancer Cell 41, 2117–2135 e12 (2023).
Article CAS PubMed Google Scholar
Gunnarsson, R. et al. Single base substitution mutational signatures in pediatric acute myeloid leukemia based on whole genome sequencing. Leukemia 35, 1485–1489 (2021).
Article CAS PubMed PubMed Central Google Scholar
Waanders, E. et al. Mutational landscape and patterns of clonal evolution in relapsed pediatric acute lymphoblastic leukemia. Blood Cancer Discov. 1, 96–111 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sosa, M. S., Bragado, P. & Aguirre-Ghiso, J. A. Mechanisms of disseminated cancer cell dormancy: an awakening field. Nat. Rev. Cancer 14, 611–622 (2014).
Article CAS PubMed PubMed Central Google Scholar
Conter, V. et al. Molecular response to treatment redefines all prognostic factors in children and adolescents with B-cell precursor acute lymphoblastic leukemia: results in 3184 patients of the AIEOP-BFM ALL 2000 study. Blood 115, 3206–3214 (2010).
Article CAS PubMed Google Scholar
Popov, A. et al. Prognostic value of minimal residual disease measured by flow-cytometry in two cohorts of infants with acute lymphoblastic leukemia treated according to either MLL-Baby or Interfant protocols. Leukemia 34, 3042–3046 (2020).
Article PubMed Google Scholar
Karol, S. E. et al. Clinical impact of minimal residual disease in blood and bone marrow of children with acute myeloid leukemia. Blood Adv. 7, 3651–3657 (2023).
Article PubMed PubMed Central Google Scholar
Afrin, S. et al. Targeted Next-Generation Sequencing for Detecting MLL Gene Fusions in Leukemia. Mol. Cancer Res 16, 279–285 (2018).
Article CAS PubMed Google Scholar
Delsing Malmberg, E. et al. Accurate and Sensitive Analysis of Minimal Residual Disease in Acute Myeloid Leukemia Using Deep Sequencing of Single Nucleotide Variations. J. Mol. Diagn. 21, 149–162 (2019).
Article CAS PubMed Google Scholar
Wood, B. et al. Measurable residual disease detection by high-throughput sequencing improves risk stratification for pediatric B-ALL. Blood 131, 1350–1359 (2018).
Article CAS PubMed PubMed Central Google Scholar
Abrahamsson, J. et al. Improved outcome after relapse in children with acute myeloid leukaemia. Br. J. Haematol. 136, 229–236 (2007).
Article PubMed Google Scholar
Wang, J. et al. CREST maps somatic structural variation in cancer genomes with base-pair resolution. Nat. Methods 8, 652–654 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chen, X. et al. CONSERTING: integrating copy-number analysis with structural-variation detection. Nat. Methods 12, 527–530 (2015).
Article CAS PubMed PubMed Central Google Scholar
Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics 43, 11 10 1-11 10 33 (2013).
Larson, D. E. et al. SomaticSniper: identification of somatic point mutations in whole genome sequencing data. Bioinformatics 28, 311–317 (2012).
Article MathSciNet CAS PubMed Google Scholar
Koboldt, D. C. et al. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res 22, 568–576 (2012).
Article CAS PubMed PubMed Central Google Scholar
Fan, Y. et al. MuSE: accounting for tumor heterogeneity using a sample-specific error model improves sensitivity and specificity in mutation calling from sequencing data. Genome Biol. 17, 178 (2016).
Article PubMed PubMed Central Google Scholar
Kim, S. et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat. Methods 15, 591–594 (2018).
Article CAS PubMed Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 38, e164 (2010).
Article PubMed PubMed Central Google Scholar
Talevich, E., Shain, A. H., Botton, T. & Bastian, B. C. CNVkit: Genome-Wide Copy Number Detection and Visualization from Targeted DNA Sequencing. PLoS Comput Biol. 12, e1004873 (2016).
Article ADS PubMed PubMed Central Google Scholar
Cameron, D. L. et al. GRIDSS: sensitive and specific genomic rearrangement detection using positional de Bruijn graph assembly. Genome Res 27, 2050–2060 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chen, X. et al. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics 32, 1220–1222 (2016).
Article CAS PubMed Google Scholar
Rausch, T. et al. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics 28, i333–i339 (2012).
Article CAS PubMed PubMed Central Google Scholar
Olsson, L. et al. The genetic landscape of paediatric de novo acute myeloid leukaemia as defined by single nucleotide polymorphism array and exon sequencing of 100 candidate genes. Br. J. Haematol. 174, 292–301 (2016).
Article CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Chong, Z. et al. novoBreak: local assembly for breakpoint detection in cancer genomes. Nat. Methods 14, 65–67 (2017).
Article CAS PubMed Google Scholar
Wala, J. A. et al. SvABA: genome-wide detection of structural variants and indels by local assembly. Genome Res 28, 581–591 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mayrhofer, M., DiLorenzo, S. & Isaksson, A. Patchwork: allele-specific copy number analysis of whole-genome sequenced tumor tissue. Genome Biol. 14, R24 (2013).
Article PubMed PubMed Central Google Scholar
Blokzijl, F., Janssen, R., van Boxtel, R. & Cuppen, E. MutationalPatterns: comprehensive genome-wide analysis of mutational processes. Genome Med 10, 33 (2018).
Article PubMed PubMed Central Google Scholar
Shuai, S. et al. Combined burden and functional impact tests for cancer driver discovery using DriverPower. Nat. Commun. 11, 734 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Pounds, S. et al. A genomic random interval model for statistical analysis of genomic lesion data. Bioinformatics 29, 2088–2095 (2013).
Article CAS PubMed PubMed Central Google Scholar
Miller, C. A. et al. SciClone: inferring clonal architecture and tracking the spatial and temporal patterns of tumor evolution. PLoS Comput Biol. 10, e1003665 (2014).
Article PubMed PubMed Central Google Scholar
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. Methods Mol. Biol. 132, 365–386 (2000).
CAS PubMed Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv:1207.3907 (2012).
Rodrigues, O. R. & Monard, S. A rapid method to verify single-cell deposition setup for cell sorters. Cytom. A 89, 594–600 (2016).
Article CAS Google Scholar
Miller, C. A. et al. Visualizing tumor evolution with the fishplot package for R. BMC Genomics 17, 880 (2016).
Article PubMed PubMed Central Google Scholar
Li, Y. et al. PAX5 epigenetically orchestrates CD58 transcription and modulates blinatumomab response in acute lymphoblastic leukemia. Sci. Adv. 8, eadd6403 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hanawa, H. et al. Comparison of various envelope proteins for their ability to pseudotype lentiviral vectors and transduce primitive hematopoietic cells from human blood. Mol. Ther. 5, 242–251 (2002).
Article CAS PubMed Google Scholar
Li, Y. et al. Germline RUNX1 variation and predisposition to childhood acute lymphoblastic leukemia. J. Clin. Invest. 131, e147898 (2021).

Download references

Acknowledgements

This research was supported by The Swedish Childhood Cancer Fund (20121-0046, AKHA), The Swedish Cancer Society (20-1036, AKHA), The Swedish Research Council (2019-01446, AKHA), The Knut and Alice Wallenberg Foundation (2014-0098, AKHA), The Crafoord Foundation (2726-001, AKHA), The Nilsson-Ehle Donations, Ellen Bachrach Memorial Fund 2022-04, Governmental Funding of Clinical Research within the National Health Service, all AKHA. Sequencing was performed either by the SNP&SEQ Technology Platform, Uppsala, the National Genomics Infrastructure (NGI) Sweden and Science for Life Laboratory or at The Center for Translational Genomics, Lund University and Clinical Genomics Lund, SciLifeLab. The SNP&SEQ Platform is also supported by the Swedish Research Council and the Knut and Alice Wallenberg Foundation. We thank the Center for Advanced Genome Engineering (S. Miller, A. Loughran and T. Caera) for their technical support in performing experiments included in this study. We thank Elias Levy Itshak Salfati and Pratima Nallagatla for bioinformatics support.

Funding

Open access funding provided by Lund University.

Author information

Authors and Affiliations

Division of Clinical Genetics, Department of Laboratory Medicine, Lund University, Lund, Sweden
Louise Ahlgren, Mattias Pilheden, Helena Sturesson, Minjun Yang, Varsha Singh & Anna K. Hagström-Andersson
Department of Pathology, St. Jude Children’s Research Hospital, Memphis, TN, USA
Guangchun Song, Michael P. Walsh & Jing Ma
Department of Pharmacy and Pharmaceutical Sciences, St Jude Children’s Research Hospital, Memphis, TN, USA
Maud Maillard, Huanbin Zhao & Jun J. Yang
Center for Applied Bioinformatics, St Jude Children’s Research Hospital, Memphis, TN, USA
Zhongshan Cheng
Childhood Cancer Center, Skåne University Hospital, Lund, Sweden
Anders Castor & Cornelis Jan Pronk
Department of Clinical Immunology, National University Hospital, Rigshospitalet, Copenhagen, Denmark
Hanne Vibeke Marquart
Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Hanne Vibeke Marquart
Department of Paediatrics and Adolescent Medicine, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
Birgitte Lausen
Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
Pauline Schneider, Rob Pieters & Ronald W. Stam
Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
Gisela Barbany
Department of Oncology and Pathology, Karolinska Institutet, Stockholm, Sweden
Katja Pokrovskaja Tamm
Department of Pediatrics, Institution for Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
Jonas Abrahamsson
Tampere Center for Child, Adolescent and Maternal Health Research, Faculty of Medicine and Health Technology, Tampere University, and Tays Cancer Center, Tampere University Hospital, Tampere, Finland
Olli Lohi
Region Västra Götaland, Sahlgrenska University Hospital, Department of Clinical Chemistry, Gothenburg, Sweden
Linda Fogelstrand
Department of Laboratory Medicine, Institute of Biomedicine, University of Gothenburg, Gothenburg, Sweden
Linda Fogelstrand
Josep Carreras Leukemia Research Institute and School of Medicine, University of Barcelona, Barcelona, Spain
Pablo Menendez
Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
Pablo Menendez
Department of Biomedicine, School of Medicine, University of Barcelona, Barcelona, Spain
Pablo Menendez
Spanish Cancer Network (CIBERONC), ISCIII, Madrid, Spain
Pablo Menendez
Pediatric Cancer Centre Barcelona-Sant Joan de Deu Hospital (PCCB-SJD), Barcelona, Spain
Pablo Menendez
Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, TN, USA
Jinghui Zhang
Department of Experimental Medical Science, Medical Structural Biology, Lund University, Lund, Sweden
Karin Lindkvist-Petersson
Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
Tanja A. Gruber
Center for Translational Genomics, Lund University, Lund, Sweden
Anna K. Hagström-Andersson

Authors

Louise Ahlgren
View author publications
Search author on:PubMed Google Scholar
Mattias Pilheden
View author publications
Search author on:PubMed Google Scholar
Helena Sturesson
View author publications
Search author on:PubMed Google Scholar
Guangchun Song
View author publications
Search author on:PubMed Google Scholar
Michael P. Walsh
View author publications
Search author on:PubMed Google Scholar
Minjun Yang
View author publications
Search author on:PubMed Google Scholar
Maud Maillard
View author publications
Search author on:PubMed Google Scholar
Huanbin Zhao
View author publications
Search author on:PubMed Google Scholar
Zhongshan Cheng
View author publications
Search author on:PubMed Google Scholar
Varsha Singh
View author publications
Search author on:PubMed Google Scholar
Anders Castor
View author publications
Search author on:PubMed Google Scholar
Cornelis Jan Pronk
View author publications
Search author on:PubMed Google Scholar
Hanne Vibeke Marquart
View author publications
Search author on:PubMed Google Scholar
Birgitte Lausen
View author publications
Search author on:PubMed Google Scholar
Pauline Schneider
View author publications
Search author on:PubMed Google Scholar
Gisela Barbany
View author publications
Search author on:PubMed Google Scholar
Katja Pokrovskaja Tamm
View author publications
Search author on:PubMed Google Scholar
Jonas Abrahamsson
View author publications
Search author on:PubMed Google Scholar
Olli Lohi
View author publications
Search author on:PubMed Google Scholar
Linda Fogelstrand
View author publications
Search author on:PubMed Google Scholar
Pablo Menendez
View author publications
Search author on:PubMed Google Scholar
Rob Pieters
View author publications
Search author on:PubMed Google Scholar
Jinghui Zhang
View author publications
Search author on:PubMed Google Scholar
Karin Lindkvist-Petersson
View author publications
Search author on:PubMed Google Scholar
Jun J. Yang
View author publications
Search author on:PubMed Google Scholar
Tanja A. Gruber
View author publications
Search author on:PubMed Google Scholar
Ronald W. Stam
View author publications
Search author on:PubMed Google Scholar
Jing Ma
View author publications
Search author on:PubMed Google Scholar
Anna K. Hagström-Andersson
View author publications
Search author on:PubMed Google Scholar

Contributions

L.A. and A.K.H.A designed the study; L.A., H.S., and V.S. performed experiments; M.M., H. Z., and J.J.Y. performed functional assays, K.L.P. performed structural modeling of PRSP2; L.A., M.P., M.Y., M.P.W., G.S., J.M., Z.C., and J.Z. performed computational data analyses; L.A., M.P. and A.K.H.A analyzed sequencing data. A.C., C.J.P., P.S., G.B., K.P.T., J.A., L.F., H.V.M., B.L., R.P., T.A.G., R.W.S., P.M. and O.L. collected patient material and clinical data; L.A., M.P., and A.K.H.A wrote the manuscript, and all other authors performed critical reading of the manuscript.

Corresponding author

Correspondence to Anna K. Hagström-Andersson.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Kara Davis, who co-reviewed with Chiara Pirillo and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1-18

Reporting Summary

Transparent Peer review file

Source data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ahlgren, L., Pilheden, M., Sturesson, H. et al. The genomic landscape of relapsed infant and childhood KMT2A-rearranged acute leukemia. Nat Commun 16, 8964 (2025). https://doi.org/10.1038/s41467-025-64190-8

Download citation

Received: 07 June 2024
Accepted: 02 September 2025
Published: 08 October 2025
Version of record: 08 October 2025
DOI: https://doi.org/10.1038/s41467-025-64190-8