Machine learning in paediatric haematological malignancies: a systematic review of prognosis, toxicity and treatment response models

Gurumurthy, Gerard; Gurumurthy, Juditha; Gurumurthy, Samantha

doi:10.1038/s41390-024-03494-9

Download PDF

Systematic Review
Open access
Published: 31 August 2024

Machine learning in paediatric haematological malignancies: a systematic review of prognosis, toxicity and treatment response models

Gerard Gurumurthy¹,
Juditha Gurumurthy² &
Samantha Gurumurthy³

Pediatric Research volume 97, pages 524–531 (2025)Cite this article

3238 Accesses
6 Citations
Metrics details

Abstract

Background

Machine Learning (ML) has demonstrated potential in enhancing care in adult oncology. However, its application in paediatric haematological malignancies is still emerging, necessitating a comprehensive review of its capabilities and limitations in this area.

Methods

A literature search was conducted through Ovid. Studies included focused on ML models in paediatric patients with haematological malignancies. Studies were categorised into thematic groups for analysis.

Results

Twenty studies, primarily on leukaemia, were included in this review. Studies were organised into thematic categories such as prognoses, treatment responses and toxicity predictions. Prognostic studies showed AUC scores between 0.685 and 0.929, indicating moderate-high predictive accuracy. Treatment response studies demonstrated AUC scores between 0.840 and 0.875, reflecting moderate accuracy. Toxicity prediction studies reported high accuracy with AUC scores from 0.870 to 0.927. Only five studies (25%) performed external validation. Significant heterogeneity was noted in ML tasks, reporting formats, and effect measures across studies, highlighting a lack of standardised reporting and challenges in data comparability.

Conclusion

The clinical applicability of these ML models remains limited by the lack of external validation and methodological heterogeneity. Addressing these challenges through standardised reporting and rigorous external validation is needed to translate ML from a promising research tool into a reliable clinical practice component.

Impact

Key message: Machine Learning (ML) significantly enhances predictive models in paediatric haematological cancers, offering new avenues for personalised treatment strategies. Future research should focus on developing ML models that can integrate with real-time clinical workflows.
Addition to literature: Provides a comprehensive overview of current ML applications and trends. It identifies limitations to its applicability, including the limited diversity in datasets, which may affect the generalisability of ML models across different populations.
Impact: Encourages standardisation and external validation in ML studies, aiming to improve patient outcomes through precision medicine in paediatric haematological oncology.

Construction and validation of a risk prediction model for complications in patients with acute leukemia based on machine learning

Article Open access 19 November 2025

Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms

Article Open access 10 January 2023

Development and validation of a cuproptosis-related prognostic model for acute myeloid leukemia patients using machine learning with stacking

Article Open access 02 February 2024

Introduction

Machine Learning (ML), a subset of artificial intelligence, is capable of identifying complex patterns within large datasets. By leveraging advanced algorithms, ML can facilitate significant advancements in diagnostics, prognostics, and therapeutic decision-making. Despite its potential, the application of ML in healthcare remains largely limited to adult oncology, radiology, and pathology, where it has shown promise in enhancing diagnostic accuracy and treatment planning.^1,2,3,4 However, its utilisation in paediatric haematological malignancies is still in its infant stages, primarily due to the unique challenges and complexities associated with paediatric cancers.

Paediatric haematological cancers present an area where ML can be beneficially utilised. Children with haematological malignancies exhibit diverse biological behaviours and responses to treatment, necessitating highly individualised therapeutic approaches.⁵ The heterogeneity of these diseases, coupled with the varying responses to existing therapies, underscores the need for a nuanced approach that balances effective treatment with the minimisation of long-term adverse effects.^6,7,8 ML, with its ability to process and analyse vast amounts of data, offers the potential to develop more precise and personalised treatment strategies, thereby improving prognosis and reducing treatment-related toxicity in paediatric patients.

The European Union’s Beating Cancer Plan underscores the importance of integrating advanced technologies, including ML, into cancer care.⁹ This initiative aims to exploit the predictive/ classification power of ML to enhance cancer prevention, diagnosis, and treatment across Europe. In the context of paediatric haematological malignancies, the potential benefits of ML are particularly significant. The ability to predict disease progression, treatment response, and adverse effects with greater accuracy can transform clinical care, enabling more targeted and effective interventions. It is therefore necessary to address the current limitations of ML, including the need for diverse and representative datasets, standardised reporting, and rigorous external validation. This systematic review aims to provide a comprehensive overview of the current applications of ML in paediatric haematological malignancies, assessing its potential to enhance diagnostic accuracy, prognostic predictions, and treatment strategies.

Methods

This review was conducted in accordance with the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines and was registered with PROSPERO (CRD42024507811). A comprehensive systematic review of the literature was carried out in February 2024 using the OVID platform. The databases searched included AMED, EMBASE, MEDLINE, and Emcare. The detailed search strategy is provided in Supplementary Table S1.

Inclusion and exclusion criteria

To be included in the review, studies had to focus on the application of ML in paediatric haematological cancers, detailing the type of ML model and its methodology. Only original research articles were considered; reviews, case reports, and other non-original research articles were excluded. Additionally, only studies exclusively involving paediatric populations were included; mixed studies with both paediatric and adult cohorts were excluded. Articles were limited to those published in English.

Data extraction and synthesis

Analysis of records were conducted by two authors independently. Data extracted from each study included the specific type of haematological cancer investigated, the tasks performed by the ML program, the number of patients involved, the ML method employed, input and output variables, the method of cross-validation used, and any external validation performed. Studies were then grouped based on their primary objectives or outcomes related to paediatric haematological cancers, such as prognosis, treatment response, and toxicity models. This thematic grouping facilitated a narrative synthesis to highlight trends, patterns, and gaps in the current research. A minimum of three studies was required to synthesise a theme, ensuring sufficient data to capture the scope and trends of current research efforts.

Quality assessment

The quality of the studies was assessed using appropriate tools. For studies investigating prognostic ML models, the Quality in Prognosis Studies (QUIPS)¹⁰ checklist was utilised. For all other thematic groups, the Newcastle-Ottawa Scale (NOS)¹¹ was used to assess the quality, given the observational nature of the included studies.

Analysis

Due to the varying nature of the ML tasks, lack of uniform reporting formats, and diverse effect measures, formal meta-analyses were deemed unfeasible. Instead, heterogeneity was addressed qualitatively by describing differences in study populations, methodologies, outcomes, and effect measures.

Results

Searches conducted through the available databases in Ovid yielded a total of 711 results (Fig. 1). Of which, 20 studies that applied ML in paediatric haematological malignancies met the inclusion criteria for this review (Table 1).^{12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31}

Table 1 Characteristics of Included Studies Exploring Machine Learning in Paediatric Haematological Malignancies.

Full size table

The included studies primarily focused on leukaemia, with specific emphasis on acute lymphoblastic leukaemia (ALL) in 13 studies, acute myeloid leukaemia (AML) in four studies, and an unspecified subtype in 1 study. Additionally, two studies addressed lymphoma. The most commonly used ML methods and algorithms were Random Forest (RF, n = 8), Least Absolute Shrinkage and Selection Operator (LASSO, n = 6), Gradient Boosting Model (GBM, n = 4), and Support Vector Machines (SVM, n = 4). Note that multiple papers utilised more than one ML method in a single study.

Cross-validation techniques were employed in 16 (80%) of the studies, including methods such as 5- to 1000-fold cross-validation, leave-one-out cross-validation, training-versus-testing sets, and C-index calculations. External validation was performed in 5 (25%) of all included studies.

Prognosis and relapse/recurrence studies

Eight studies utilised ML to predict disease outcomes in paediatric haematological malignancies.^{12,13,14,15,16,17,18,19} All studies focused on leukaemia, with five addressing ALL, two on AML, and one on an unspecified subtype. All studies were assessed as “low risk” of bias using the Quality in Prognosis Studies (QUIPS) checklist, which evaluates study participation, attrition, prognostic factor measurement, outcome measurement, study confounding, and statistical analysis,¹⁰ indicating good overall study quality in this category.

Survival analyses employing genetic data from databases such as TARGET were the most common methodology in this group. For example, one study identified key long non-coding RNAs (LncRNAs) associated with AML prognosis using LASSO Cox analysis, reporting Area Under the Curve (AUC) values of 0.701, 0.704, and 0.696 for 1-, 3-, and 5-year survival, respectively.¹⁵ AUC values below 0.50 indicate poor predictability, values between 0.51 and 0.70 indicate relatively poor accuracy, values between 0.71 and 0.90 indicate moderate accuracy, and values above 0.90 indicate high accuracy and strong discrimination capability.³² Notably, these findings were externally validated using comparative data from The Cancer Genome Atlas (TCGA), although with a lower concordance to the developed model. Overall, four (50%) of the studies in this category were externally validated. For instance, one study using the RF algorithm with 10-fold Monte Carlo cross-validation to predict relapse in ALL achieved an AUC of 0.901. The results were externally validated against an independent test set of 84 patients, demonstrating the robustness and potential clinical applicability of the predictive model. The use of external validation suggests a strong reinforcement of the predictive models’ robustness and applicability in clinical settings. This process, such as using data from TCGA, underscores the potential of these models to generalise across different datasets, enhancing their reliability for clinical prognosis and treatment decision-making in paediatric leukaemia cases.

The studies showed a range of AUC scores from 0.685 to 0.929, indicating a wide variation in model performance. This heterogeneity could be attributed to differences in study design, including varying numbers of patients (range 156–1693) and primary endpoints (e.g., 3-year overall survival vs. 5-year overall survival). Seven (88%) of the studies used AUC as a primary measure of predictive performance. All studies used either LASSO (n = 4) or RF (n = 3) methods. When grouped by ML method, LASSO models had AUC scores ranging from 0.685 to 0.898, indicating low to moderate accuracy, while RF models had AUC scores ranging from 0.803 to 0.929, indicating moderate to high accuracy. These results suggest that RF techniques may offer marginally superior predictive performance compared to LASSO.

Despite the promise shown by these models, limitations include the use of genetic data from publicly available databases and a lack of relevant paediatric cohort validation. One group of authors highlighted the need for future research to employ more prospective paediatric cohorts due to the limitations associated with using public databases.¹⁵

In summary, these studies highlight the significant potential of ML methods, particularly RF and LASSO, in predicting disease outcomes in paediatric leukaemia. The variation in AUC scores underscores the importance of strategic ML method selection, reflecting its role in study outcome heterogeneity. These findings highlight the need for a nuanced approach in selecting ML techniques, considering not only AUC scores but also factors like model interpretability and computational demands, to enhance predictive precision in leukaemia prognosis.

Treatment response studies

Five studies investigated the use of ML to predict treatment response in paediatric haematological malignancies, including three studies on ALL and two on AML.^{20,21,22,23,24} Four of these studies focused on classification tasks. All studies scored 6 or more on the Newcastle-Ottawa Scale (NOS), indicating a generally high standard of methodological quality and reliability in their findings.

In one study, a ten-gene DNA-damage response gene expression signature (CalDDR-GEx10 score) was used to predict responses to gemtuzumab ozogamicin (GO) in paediatric AML patients. The input variables included gene expression levels of 18 genes in DNA-damage response pathways. Patients with high CalDDR-GEx10 scores had lower complete remission (CR) rates and worse event-free survival when treated with GO. This score specifically predicted responses to calicheamicin-induced DNA damage, rather than general chemotherapy effects, with a sensitivity of 72.7%, specificity of 63.6%, and a Positive Predictive Value (PPV) of 61.1%.²⁰ Another study employed ML techniques, including k-nearest neighbours (K-NN), SVM, and RF, to RNA sequencing data to predict CR in paediatric AML patients post-induction therapy.²² The best result, achieved using a K-NN model with 50 genes, yielded an AUC of 0.812. Both studies were able to predict CR based on genetic data through the utilisation of ML and were cross-validated, highlighting the potential of ML and gene expression signatures in personalised medicine for cancer treatment.

Three (60%) of the studies in this category used AUC as a measure of their models’ ability to predict treatment response, with scores ranging from 0.840 to 0.875, indicating moderate accuracy. Despite the use of different ML algorithms (GBM, K-NN, and Decision Tree), the studies showed similar patient sizes (range 241–473) and endpoints, contributing to low heterogeneity in the evaluation of treatment response prediction. This consistency suggests a reliable evaluation of treatment response prediction across these studies.

However, none of these studies achieved a high accuracy AUC model ( > 0.900), indicating that while the models were moderately effective, they did not reach the threshold of high accuracy. Additionally, none of the studies conducted external validation, which limits the clinical utility of these models. Prospective studies with external validation are needed to assess the impact of these ML models on treatment decision-making and patient outcomes. Despite these limitations, the findings support the potential of ML to enhance personalised medicine in this field.

Treatment toxicity studies

ML was used to predict adverse treatment effects in five studies.^{25,26,27,28,29} Three studies focused on ALL and two on lymphoma. The Newcastle-Ottawa Scale (NOS) was used to assess the quality of these studies, with all scoring six or more, indicating high methodological quality.

One study explored the relationship between genetic variations and treatment-related adverse effects (TRAEs) in paediatric patients with ALL undergoing methotrexate therapy. It found a significant association between the SLC19A1 (c.80 G > A) genotype and increased TRAEs, with an odds ratio (OR) of 5.71 (p < 0.01). Multinomial logistic regression and multifactor dimensionality reduction analysis supported this association, confirming the genotype’s strong correlation with TRAEs.²⁶ Another study also focused on methotrexate therapy, using ML to predict neutropenia and fever associated with high-dose methotrexate treatment in paediatric B-ALL. The best model, using a combined RF with Adaptive Synthetic (ADASYN) resampling, achieved an AUC of 0.870 to 0.927, sensitivity of 0.916–0.935, and specificity of 0.920–0.924.²⁷

In another study, CT images were used to predict late TRAEs. A deep learning model demonstrated high concordance with manual human analysis, evidenced by Dice scores greater than 0.950 and a K-statistic of 1.00. Notably, once trained, the model segmented body composition from CT datasets in under a second, highlighting the potential of ML models to rapidly and accurately process extensive datasets. Validated against external manual analysis, this model shows promise for clinical application due to its capability to deliver rapid and reliable results.²⁸

The studies varied widely in their statistical analyses, making it difficult to comment on heterogeneity. Only two studies used AUC as a measure of effect. These AUC values were 0.870 (moderate accuracy) and 0.927 (high accuracy), suggesting strong predictive capabilities of ML models in this context.

The primary limitation of these studies is the lack of uniform reporting of effect measures, which hampers the ability to review heterogeneity and draw robust conclusions. Additionally, the sample sizes in these studies (range 20 to 200) were smaller compared to other categories, limiting the statistical power to detect significant associations. Moreover, translating these findings into clinical practice requires validation in larger, multi-centre studies to confirm their utility in predicting treatment-related toxicities. Only one of these studies included external validation, underscoring the need for further validation efforts.

Others: disease susceptibility & diagnosis studies

This review identified two studies focused on developing predictive models for disease susceptibility and diagnostics in paediatric haematological malignancies.^30,31 These studies provide insights into the early application of ML in identifying risk factors and diagnostic markers.

One study employed several ML algorithms, including Classification and Regression Tree (CART), RF, GBM, and C5.0 decision tree, to identify key attributes influencing ALL susceptibility.³⁰ Platelet count was identified as a crucial predictor, and the CART algorithm demonstrated a high model accuracy of 99.8%. However, this study lacked external validation, which limits the generalisability of the findings and highlights the need for further investigation in more varied and larger cohorts.

Similarly, the second study within this group also utilised ML models for disease susceptibility but did not perform external validation. The lack of validation is a significant limitation as it prevents the confirmation of the models’ applicability in different clinical settings. Despite this, the preliminary findings suggest that ML can identify important predictive factors for disease susceptibility.

Overall, while the limited number of studies prevents a comprehensive thematic analysis, these findings indicate the potential of ML in enhancing early disease detection and risk assessment in paediatric haematological cancers. The absence of external validation across both studies underscores the need for further research to ensure the reliability and practical utility of these ML models.

Discussion

The review reveals a promising trend of ML models achieving moderate to high accuracy across the examined thematic categories. ML methods such as RF and LASSO have emerged as effective tools in paediatric haematological malignancies, as reflected in their prevalence across the studies reviewed. These studies demonstrate a strong emphasis on predictive tasks, highlighting a growing interest in using ML for prognosis and treatment outcome prediction. Most research thus far lies in prognosis models, with further research warranted in diagnosis and treatment toxicity prediction models. An adequate number of studies exist in treatment response studies. It is crucial, however, to assess the real-world applicability of these findings through external validation, considering the diverse methodologies and sample sizes across studies.^33,34

The lack of external validation in many studies is a significant limitation that prevents the replication and generalisation of ML models across different datasets. The heterogeneity of the data collected, including variations in patient populations, data sources, and ML methodologies, complicates the replication of these models. To address this issue, future studies should focus on standardising data collection methods and reporting metrics. The need for standardised reporting guidelines, such as the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD),³⁵ is highlighted in a similar review of the literature.³⁶ Moreover, the limited clinical deployment of ML algorithms, with many studies showing limited clinical applicability, is a common criticism.^37,38,39

This review also illustrates the infancy of ML application in this field, marked by the limited number of studies included in most thematic categories identified. The initial literature search yielded over 20 abstract reports that, despite being excluded due to not meeting inclusion criteria, indicate a growing application of ML in paediatric haematology with vast use across the field. For example, a study using Prediction Analysis of Microarrays (PAM) to identify paediatric patients with B-ALL with a Ph-like signature for better clinical intervention employed ML on gene expression profiles from 811 patients, leading to a 15-gene classifier that showed high sensitivity (93.0%) and specificity (89.7%) in tests.⁴⁰ The classifier was also able to identify genomic lesions linked to Ph-like ALL, associated with poor clinical outcomes. The findings suggest that integrating this classifier in clinical practice could help identify patients for targeted therapy, potentially improving treatment outcomes. The numerous abstracts noted in the literature search highlight the rapidly growing application of ML in paediatric haematological cancers. As these datasets grow, they offer new opportunities for applying novel ML approaches, potentially transforming the field.

The review highlights the potential of ML to enhance patient care by providing clinicians and health professionals with data-driven insights that can inform diagnostic and treatment decisions. While promising, the integration of ML in clinical practice should support, not replace, healthcare providers. For instance, ML algorithms have achieved 97.0% accuracy in identifying leukaemia from peripheral blood smears, thereby supporting clinical investigations.⁴¹ Furthermore, the integration of ML-enhanced technologies such as the Countess 3 Automated Cell Counter in bone marrow transplant labs exemplifies a shift toward more precise and efficient diagnostic processes, showing notable improvements over traditional manual methods.^42,43 This trend is supported by a previous reporting of methodology that calls for strong evaluation frameworks to measure the actual impact of ML on patient outcomes, thus ensuring its role as a complement to, not a replacement for, clinical decision-making.⁴⁴

Comparing findings between paediatric and adult/mixed studies reveals key insights. In adult/mixed cohorts, ML applications show significant improvements with AUC ranges of 0.71 – 0.93 for prognosis/relapse prediction,^45,46 0.85 – 0.97 for treatment response,^22,47,48 and 0.59 – 0.90 in toxicity predictions.^49,50 These studies have a superior AUC for prognosis/ relapse and treatment response predictions as compared to the paediatric cohort. The superior AUC in adult models highlights their robustness, likely due to larger sample sizes and more extensive datasets. Adult studies may also benefit from standardised methodologies and larger, diverse cohorts, contributing to increased generalisability. Conversely, paediatric studies face challenges such as smaller sample sizes and heterogeneous designs, leading to broader AUC ranges and reduced generalisability.

The scope of this review is narrowed by the predominance of studies focused on leukaemia, specifically ALL and AML, with only two studies extending to non-leukemic haematological malignancies. This lack of diversity within the spectrum of paediatric haematological cancers limits our capacity to generalise the findings of ML across the broader field. Consequently, while our review suggests substantial advancements in the ML-driven management of leukaemia, the translatability of these insights to other haematological conditions remains to be ascertained. With only a small number of studies employing external validations, we are unable to comment on the feasibility of implementing these ML algorithms in the current clinical setting. This underscores an imperative for future research to encompass a wider range of haematological disorders, thus enhancing the robustness and clinical relevance of ML prognostic, diagnostic, and treatment response models in paediatric haematology.

Addressing these challenges of methodological heterogeneity and limited clinical deployment is crucial for the implementation of ML in paediatric malignancies.⁵¹ The expanding datasets in this domain offer an opportunity for applying novel ML approaches. However, increased standardisation in study designs and reporting standards, like the TRIPOD guidelines mentioned above, is essential to achieve this. Future research should focus on prospective studies and fostering interdisciplinary collaboration to develop and implement clinically relevant ML tools. Moreover, integrating ML with clinical workflows and validating these models in diverse, real-world settings will be vital in ensuring their practical utility and improving outcomes for children with cancer.

Conclusion

This systematic review highlights the growing role of ML in paediatric haematological malignancies, demonstrating its potential to significantly enhance diagnostic accuracy, prognostic predictions, and treatment strategies. Despite moderate to high accuracy achieved by ML models, the clinical applicability remains constrained due to the lack of external validation and methodological heterogeneity. Addressing these challenges through larger, diverse datasets, standardised reporting, and robust external validation is crucial for translating ML from a promising research tool into a reliable component of clinical practice. This advancement could lead to more precise and personalised treatment approaches, ultimately improving outcomes for children with cancer.

References

Bertsimas, D. et al. Machine learning in oncology: methods, applications, and challenges. JCO Clin. Cancer Inf. 4, 885–894 (2020).
Google Scholar
Nardini, C. Machine learning in oncology: a review. Ecancermedicalscience 14, 1065 (2020). Published 2020 Jun 30.
PubMed PubMed Central Google Scholar
Nagy, M. et al. Machine learning in oncology: what should clinicians know? JCO Clin. Cancer Inf. 4, 799–810 (2020).
Google Scholar
Hong, J. et al. System for high-intensity evaluation during radiation therapy (SHIELD-RT): A prospective randomized study of machine learning-directed clinical evaluations during radiation and chemoradiation. J. Clin. Oncol. 38, 3652–3661 (2020).
PubMed Google Scholar
Widemann, B. Advances, challenges and progress in pediatric hematology and oncology. Curr. Opin. Pediatr. 35, 39–40 (2023).
PubMed Google Scholar
Rosenquist Ret al. Novel precision medicine approaches and treatment strategies in hematological malignancies. J. Intern. Med. 294, 413–436 (2023).
CAS PubMed Google Scholar
Duncavage, E. et al. Genomic profiling for clinical decision making in myeloid neoplasms and acute leukemia. Blood 140, 2228–2247 (2022).
CAS PubMed PubMed Central Google Scholar
Khoury, J. et al. The 5th edition of the World Health Organization classification of haematolymphoid tumours: myeloid and histiocytic/dendritic neoplasms. Leukemia 36, 1703–1719 (2022).
PubMed PubMed Central Google Scholar
European Commission. Europe’s Beating Cancer Plan 2023. 2024. https://health.ec.europa.eu/system/files/2022-02/eu_cancer-plan_en_0.pdf.
Cochrane Method. QUIPS tool. 2024. https://methods.cochrane.org/sites/methods.cochrane.org.prognosis/files/uploads/QUIPS%20tool.pdf.
Ottawa Hospital Research Institute. The Newcastle-Ottawa Scale (NOS) for assessing the quality of nonrandomised studies in meta-analyses. 2024. https://www.ohri.ca/programs/clinical_epidemiology/oxford.asp.
He, X. et al. A gene signature comprising seven pyroptosis-related genes predicts prognosis in pediatric patients with acute myeloid leukemia. Acta Haematol. 145, 627–641 (2022).
CAS PubMed Google Scholar
He, Y. et al. A nomogram for predicting event-free survival in childhood acute lymphoblastic leukemia: a multicenter retrospective study. Front. Oncol. 12, 854798 (2022).
CAS PubMed PubMed Central Google Scholar
Cui, Y. et al. Bayesian inference for survival prediction of childhood Leukemia. Comput. Biol. Med. 156, 106713 (2023).
PubMed Google Scholar
Zheng, G. et al. Comprehensive analysis of N6-methyladenosine-related long noncoding RNA prognosis of acute myeloid leukemia and immune cell infiltration. Front. Genet. 13, 888173 (2022).
CAS PubMed PubMed Central Google Scholar
Bohannan, Z., Coffman, F. & Mitrofanova, A. Random survival forest model identifies novel biomarkers of event-free survival in high-risk pediatric acute lymphoblastic leukemia. Comput. Struct. Biotechnol. J. 20, 583–597 (2022).
CAS PubMed PubMed Central Google Scholar
Gao, X. & Liu, W. The establishment and evaluation of a new model for the prediction of Children B-ALL based on TARGET: A SQUIRE-compliant study. Med. (Baltim.) 99, e20115 (2020).
Google Scholar
Lin, C. et al. Integrating RNA-seq and scRNA-seq to explore the biological significance of NAD+ metabolism-related genes in the initial diagnosis and relapse of childhood B-cell acute lymphoblastic leukemia. Front. Immunol. 13, 1043111 (2022).
CAS PubMed PubMed Central Google Scholar
Pan, L. et al. Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia. Sci. Rep. 7, 7402 (2017).
PubMed PubMed Central Google Scholar
Gbadamosi, M. et al. A ten-gene DNA-damage response pathway gene expression signature predicts gemtuzumab ozogamicin response in pediatric AML patients treated on COGAAML0531 and AAML03P1 trials. Leukemia 36, 2022–2031 (2022).
CAS PubMed PubMed Central Google Scholar
Pedreira, C. et al. New decision support tool for treatment intensity choice in childhood acute lymphoblastic leukemia. IEEE Trans. Inf. Technol. Biomed. 13, 284–290 (2009).
PubMed Google Scholar
Gal, O., Auslander, N., Fan, Y. & Meerzaman, D. Predicting complete remission of acute myeloid leukemia: machine learning applied to gene expression. Cancer Inf. 18, 1176935119835544 (2019).
Google Scholar
Kashef, A., Khatibi, T. & Mehrvar, A. Prediction of cranial radiotherapy treatment in pediatric acute lymphoblastic leukemia patients using machine learning: a case study at MAHAK Hospital. Asian Pac. J. Cancer Prev. 21, 3211–3219 (2020).
PubMed PubMed Central Google Scholar
Kashef, A., Khatibi, T. & Mehrvar, A. Treatment outcome classification of pediatric acute lymphoblastic leukemia patients with clinical and medical data using machine learning: a case study at MAHAK Hospital. Inform. Med. Unlocked 20, 100399 (2020).
Google Scholar
Al-Fahad, R. et al. Early imaging based predictive modeling of cognitive performance following therapy for childhood ALL. IEEE Access 7, 146662–146674 (2019).
PubMed PubMed Central Google Scholar
Ramalingam, R. et al. Evaluation of cytogenetic and molecular markers with MTX-mediated toxicity in pediatric acute lymphoblastic leukemia patients. Cancer Chemother. Pharmacol. 89, 393–400 (2022).
CAS PubMed Google Scholar
Zhan, M. et al. Machine learning to predict high-dose methotrexate-related neutropenia and fever in children with B-cell acute lymphoblastic leukemia. Leuk. Lymphoma 62, 2502–2513 (2021).
CAS PubMed Google Scholar
Tram, N. et al. Deep learning of image-derived measures of body composition in pediatric, adolescent, and young adult lymphoma: association with late treatment effects. Eur. Radiol. 33, 6599–6607 (2023).
PubMed Google Scholar
Theruvath, A. et al. Validation of deep learning-based augmentation for reduced 18F-FDG Dose for PET/MRI in children and young adults with lymphoma. Radiol. Artif. Intell. 3, e200232 (2021).
PubMed PubMed Central Google Scholar
Mahmood, N. et al. Identification of significant risks in pediatric acute lymphoblastic leukemia (ALL) through machine learning (ML) approach. Med. Biol. Eng. Comput. 58, 2631–2640 (2020).
PubMed Google Scholar
Kulis, J. et al. Machine learning based analysis of relations between antigen expression and genetic aberrations in childhood B-cell precursor acute lymphoblastic leukaemia. J. Clin. Med. 11, 2281 (2022).
CAS PubMed PubMed Central Google Scholar
Mandrekar, J. Receiver operating characteristic curve in diagnostic test assessment. J. Thorac. Oncol. 5, 1315–1316 (2010).
PubMed Google Scholar
Stuart, E., Bradshaw, C. & Leaf, P. Assessing the generalizability of randomized trial results to target populations. Prev. Sci. 16, 475–485 (2015).
PubMed PubMed Central Google Scholar
Khorsan, R. & Crawford, C. How to assess the external validity and model validity of therapeutic trials: a conceptual approach to systematic review methodology. Evid. Based Complement Altern. Med. 2014, 694804 (2014).
Google Scholar
Collins, G. & Moons, K. Reporting of artificial intelligence prediction models. Lancet 393, 1577–1579 (2019).
PubMed Google Scholar
Ramesh, S. et al. Applications of artificial intelligence in pediatric oncology: a systematic review. JCO Clin. Cancer Inform. 5, 1208–1219 (2021).
PubMed Google Scholar
Kelly, C., Karthikesalingam, A., Suleyman, M., Corrado, G. & King, D. Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 17, 195 (2019).
PubMed PubMed Central Google Scholar
Ghassemi, M. et al. A review of challenges and opportunities in machine learning for health. AMIA Jt. Summits Transl. Sci. Proc. 2020, 191–200 (2020).
PubMed PubMed Central Google Scholar
Khan, B. et al. Drawbacks of Artificial Intelligence and Their Potential Solutions in the Healthcare Sector. Biomed. Mater Devices. 1–8 https://doi.org/10.1007/s44174-023-00063-2 (2023).
Harvey, R. et al. Development and validation of a highly sensitive and specific gene expression classifier to prospectively screen and identify B-precursor acute lymphoblastic leukemia (ALL) patients with a Philadelphia chromosome-like (‘Ph-like’ or ‘Bcr-Abl1-like’) signature for therapeutic targeting and clinical intervention. Blood 122, 826 (2013).
Google Scholar
Ghaderzadeh, M. et al. Machine learning in detection and classification of leukemia using smear blood images: a systematic review. Sci. Prog. 2021, 9933481:1–14 (2021).
Lavitt, F., Rijlaarsdam, D., van der Linden, D., Weglarz-Tomczak, E. & Tomczak, J. Deep learning and transfer learning for automatic cell counting in microscope images of human cancer cell lines. Appl. Sci. 11, 4912 (2021).
CAS Google Scholar
Lee, S.-J., Chen, P.-Y. & Lin, J.-W. Complete blood cell detection and counting based on deep neural networks. Appl. Sci. 12, 8140 (2022).
CAS Google Scholar
Verma, A. et al. Grand rounds in methodology: key considerations for implementing machine learning solutions in quality improvement initiatives. BMJ Qual. Saf. 33, 121–131 (2024).
PubMed Google Scholar
Karami, K., Akbari, M., Moradi, M. T., Soleymani, B. & Fallahi, H. Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques. PLoS One 16, e0254976 (2021).
CAS PubMed PubMed Central Google Scholar
Eckardt, J. et al. Prediction of complete remission and survival in acute myeloid leukemia using supervised machine learning. Haematologica 108, 690–704 (2023). Published 2023 Mar 1.
CAS PubMed Google Scholar
Tong, Y. et al. Prediction of lymphoma response to CAR T cells by deep learning-based image analysis. PLoS One 18, e0282573 (2023).
CAS PubMed PubMed Central Google Scholar
Elhadary, M. et al. Applications of machine learning in chronic myeloid leukemia. Diagnostics 13, 1330 (2023).
PubMed PubMed Central Google Scholar
Jian, C. et al. Predicting delayed methotrexate elimination in pediatric acute lymphoblastic leukemia patients: an innovative web-based machine learning tool developed through a multicenter, retrospective analysis. BMC Med Inf. Decis. Mak. 23, 148 (2023).
Google Scholar
Arai, Y. et al. Using a machine learning algorithm to predict acute graft-versus-host disease following allogeneic transplantation. Blood Adv. 3, 3626–3634 (2019).
PubMed PubMed Central Google Scholar
Tozzi, A. et al. Gaps and opportunities of artificial intelligence applications for pediatric oncology in european research: a systematic review of reviews and a bibliometric analysis. Front. Oncol. 12, 905770 (2022).
PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

School of Medicine, University of Manchester, Manchester, UK
Gerard Gurumurthy
School of Cancer and Pharmaceutical Sciences, King’s College London, London, UK
Juditha Gurumurthy
Department of Infectious Diseases & Immunology, Imperial College London, London, UK
Samantha Gurumurthy

Authors

Gerard Gurumurthy
View author publications
Search author on:PubMed Google Scholar
Juditha Gurumurthy
View author publications
Search author on:PubMed Google Scholar
Samantha Gurumurthy
View author publications
Search author on:PubMed Google Scholar

Contributions

GG, JG and SG were involved in the literature search and subsequent data extraction from the included studies. The initial draft of the manuscript was prepared by GG and JG, with SG providing further review and editing. All authors have given final approval for the version to be published.

Corresponding author

Correspondence to Gerard Gurumurthy.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Search Strategy

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gurumurthy, G., Gurumurthy, J. & Gurumurthy, S. Machine learning in paediatric haematological malignancies: a systematic review of prognosis, toxicity and treatment response models. Pediatr Res 97, 524–531 (2025). https://doi.org/10.1038/s41390-024-03494-9

Download citation

Received: 17 February 2024
Revised: 22 June 2024
Accepted: 05 August 2024
Published: 31 August 2024
Version of record: 31 August 2024
Issue date: February 2025
DOI: https://doi.org/10.1038/s41390-024-03494-9

This article is cited by

Pediatrics 4.0: the Transformative Impacts of the Latest Industrial Revolution on Pediatrics
- Derşan Onur
- Çağla Özbakır
Health Care Analysis (2025)

Abstract

Background

Methods

Results

Conclusion

Impact

Similar content being viewed by others

Construction and validation of a risk prediction model for complications in patients with acute leukemia based on machine learning

Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms

Development and validation of a cuproptosis-related prognostic model for acute myeloid leukemia patients using machine learning with stacking

Introduction

Methods

Inclusion and exclusion criteria

Data extraction and synthesis

Quality assessment

Analysis

Results

Prognosis and relapse/recurrence studies

Treatment response studies

Treatment toxicity studies

Others: disease susceptibility & diagnosis studies

Discussion

Conclusion

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Search Strategy

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Pediatrics 4.0: the Transformative Impacts of the Latest Industrial Revolution on Pediatrics

Search

Quick links