Image-based explainable artificial intelligence accurately identifies myelodysplastic neoplasms beyond conventional signs of dysplasia

Eckardt, Jan-Niklas; Srivastava, Ishan; Schulze, Freya; Winter, Susann; Schmittmann, Tim; Riechert, Sebastian; Schneider, Martin M. K.; Reichel, Lukas; Gediga, Miriam Eva Helena; Sockel, Katja; Sulaiman, Anas Shekh; Röllig, Christoph; Kroschinsky, Frank; Asemissen, Anne-Marie; Pohlkamp, Christian; Haferlach, Torsten; Bornhäuser, Martin; Wendt, Karsten; Middeke, Jan Moritz

doi:10.1038/s41698-025-01222-y

Download PDF

Brief Communication
Open access
Published: 11 December 2025

Image-based explainable artificial intelligence accurately identifies myelodysplastic neoplasms beyond conventional signs of dysplasia

Jan-Niklas Eckardt^1,2,
Ishan Srivastava^1,2,
Freya Schulze¹,
Susann Winter¹,
Tim Schmittmann³,
Sebastian Riechert^2,3,
Martin M. K. Schneider¹,
Lukas Reichel¹,
Miriam Eva Helena Gediga¹,
Katja Sockel¹,
Anas Shekh Sulaiman¹,
Christoph Röllig¹,
Frank Kroschinsky¹,
Anne-Marie Asemissen⁴,
Christian Pohlkamp⁵,
Torsten Haferlach⁵,
Martin Bornhäuser^1,6,7,
Karsten Wendt^2,3 &
…
Jan Moritz Middeke^1,2

npj Precision Oncology volume 10, Article number: 26 (2026) Cite this article

2041 Accesses
1 Citations
Metrics details

Subjects

Myelodysplastic syndrome

Abstract

Cytomorphological assessment of bone marrow smears (BMS) is essential in the diagnosis of myelodysplastic neoplasms (MDS), yet manual evaluation is prone to inter-observer variability. We trained end-to-end deep learning models to distinguish between MDS, acute myeloid leukemia, and bone marrow donor BMS with high accuracy in internal tests and external validation. Occlusion sensitivity mapping revealed the high importance of nuclear structures beyond canonical dysplasia, demonstrating accurate, interpretable MDS detection without labor-intensive cell-level annotation.

Myelodysplastic neoplasms (MDS) encompass clonal myeloid malignancies that are characterized by ineffective hematopoiesis, cytopenia, myelodysplasia, and recurrent genetic events. Accurate cytomorphologic evaluation of the bone marrow remains crucial for the initial diagnosis, response assessment, and detection of disease transformation to acute myeloid leukemia (AML). While counting myeloblasts is rather straightforward, signs of dysplasia are more subtle and their accurate identification requires experienced investigators. Still, detection is often challenging, time- and cost-intensive, prone to inter-observer variability (even for seasoned morphologists)^1,2, and shows discrepancies between site and central review³. Deep learning (DL), especially convolutional neural nets (CNN), excel in image classification tasks, and have recently been used for bone marrow morphology assessment^4,5,6,7,8,9. In this study, we used an end-to-end DL system to accurately differentiate between MDS, AML, and bone marrow donor samples based on regions of interest in bone marrow smears, without the need for manually labeling cells or dysplastic morphologies.

We extended our previously described DL pipeline^4,5 to delineate MDS (n = 463), AML (n = 1301), and donors (n = 236). Based on case-level diagnoses, pre-treatment regions of interest of bone marrow smears from initial diagnosis were labeled with either “MDS”, “AML”, or “donor”. Importantly, no cell-level manual labeling was performed. We trained multiple models for binary classifications, i.e., MDS vs. AML and MDS vs. donors, using six DL architectures with 5-fold cross-validation for every combination. MDS classification performance was externally validated. Baseline characteristics of the MDS patient cohort are shown in Table S1. For the distinction between MDS and donors, we found Densenet-201 to achieve the highest classification performance with an accuracy of 0.98 and a corresponding ROCAUC (area-under-the-curve of the receiver-operating-characteristic) of 0.97 (Table S2; Fig. 1A). Delineating MDS from AML, the best results were obtained using the Squeezenet architecture, resulting in an accuracy of 0.98 and a ROCAUC of 0.99 (Table S2; Fig. 1B). In our AML cohort, 156 patients had AML with myelodysplasia-related changes (AML-MRC). We found our classifier to still be highly accurate even when accounting for MRC status, with an accuracy of 0.95 and 0.98 for AML-MRC and AML without MRC delineated from MDS, respectively (Table S3). The same was true for AML with or without mutated NPM1, showing an accuracy of 0.99 and 0.97 delineated from MDS, respectively (Table S4). The training of models (MDS vs. donors and MDS vs. AML) on the high-performance computing (HPC) system required 20 hours to complete each. In external validation, our models achieved an accuracy of 0.99 with a corresponding ROCAUC of 0.98 in distinguishing external MDS from donors (Table S5, Fig. 1C). Delineating external MDS samples from AML, an accuracy of 0.92 was achieved with a ROCAUC of 0.98 (Table S5, Fig. 1D). In subgroup analyses, we trained models to delineate MDS with increased blasts (MDS-IB1/2) from MDS without increased blasts, achieving a ROCAUC of 0.87 (Table S6), and from AML, achieving a ROCAUC of 0.85 (Table S6). Distinguishing MDS-IB1 from MDS-IB2 showed a ROCAUC of 0.80 (Table S6).

**Fig. 1: Performance of deep learning models for binary classifications delineating MDS, AML, and donors.**

To highlight the importance of image areas associated with correct class predictions and thereby identify morphological cues that the network used to delineate MDS, AML, and donors, we used occlusion sensitivity maps (OSM). We found OSM to be cell-specific, indicating that networks focus on cells rather than background or smudge, particularly on granulopoiesis and erythropoiesis as well as megakaryocytes (Fig. 2). High importance was found for defined signs of dysplasia, including altered nuclear morphology such as chromatin clumping, dysfunctional segmentation, or double nuclei. However, at times, high importance in correct class predictions was also found for cells we deemed inconspicuous regarding dysplasia per conventional definition¹⁰, while neural networks in these cells also mainly focused on the nucleus, sometimes including the perinuclear zone. This indicates more intricate and subtle morphological alterations unquantifiable by human observers. However, other signs of dysplasia, such as hypogranulation, were disregarded by our model. We observed that correct classifications were often given with confidence scores close to 1.0, potentially indicating that the models were confident enough in their decisions to assign a certain class without evaluating all apparent signs of dysplasia (defined or not) or that some signs of dysplasia were simply too rare in the training set to be learned by the models.

**Fig. 2: Occlusion Sensitivity Mapping (OSM) highlights image areas with high importance for correct class predictions, enabling output interpretation.**

Using end-to-end DL, we developed a software framework to distinguish between MDS, AML, and donors with very high accuracy based on bone marrow smears from 2000 individual patients and bone marrow donors. Importantly, we have demonstrated that information abstraction, even in MDS with often subtle morphologies, is feasible using end-to-end learning, in contrast to recent studies in hematology that primarily rely on the generation of cell-level labels^6,7,8,9. Using the latter approach, a bottom-up system has to be devised, where first thousands of labels are required to build a robust classifier, and second, individual cell-level predictions have to be aggregated to generate a diagnosis-level prediction. Apart from being time-consuming and cost-ineffective, the generation of cell-level labels, i.e., the ground truth that many classifiers in hematology currently are based on, is flawed due to substantial classification biases². This bottleneck and pitfall of cell-level labeling can essentially be bypassed by an end-to-end approach using robust region-of-interest-level labels. Similar approaches have yielded favorable results in generating differential counts¹¹ as well as NPM1 and FLT3-ITD mutation status prediction¹².

With respect to explainability, DL is often referred to as a ‘black box’. Using OSM not only enables an internal proof-of-concept, but also provides additional information to the human observer, as novel features that are important for prediction can be investigated that otherwise would elude the human eye. Interestingly, our classifiers focused on nuclei not only in dysplastic cells, but also in cells that we did not deem to be morphologically suspicious of dysplasia. Potentially, this alludes to digital biomarkers in MDS distinct from classical signs of dysplasia. Future work will focus on correlating saliency maps with genetic alterations and/or gene expression in MDS. While certain molecular alterations have already been linked to certain morphologies, such as mutated SF3B1 in MDS with ring sideroblasts, CNNs can potentially be used to identify novel gene-morphology links¹³.

Our study is limited by several factors. As is the case for most recent studies of computer vision in (hemato-)pathology, our analysis is based on retrospective data. While external validation confirmed high classification accuracy, prospective validation is still warranted. While manual selection of regions of interest mirrors workflows in clinical routine, it introduces the necessity of expert input, and thus manual labor, into the workflow, and potentially may introduce bias, as classification performance may drop if suboptimal image areas are provided. In our study, we differentiated only between AML, MDS, and donors in a binary way. Still, some dysplastic morphologies can also be present to a certain degree in non-malignant disorders such as congenital syndromes, nutritional deficiencies, infectious disease, and drug- or toxin-mediated bone marrow damage. To increase routine applicability, future work will also focus on acquiring image data from reactive and non-neoplastic specimens exhibiting signs of dysplasia in order to make our classifier more versatile and applicable in clinical routine. To this end, multi-class predictions in a single classifier are a viable alternative to binary classifications as more diagnostic classes are added, ensuring that one single model may practically inform diagnostic decision-making rather than having to apply multiple models on the same slide. Further, the updated WHO¹⁴ and ICC classifications¹⁵ confirm or introduce several genetically defined subtypes of AML and MDS, MDS with increased blasts, as well as (in the case of the ICC) AML/MDS overlap syndrome. Given the rarity of some of these subtypes on one side and, in computer vision terms, relative scarcity of training data in our sample on the other, training computer vision models to delineate these subtypes was currently not feasible, but is planned for future multicenter studies. While several pathology foundation models have been introduced recently¹⁶, neither have they been trained on large corpora of bone marrow smear images nor have they been systematically evaluated for bone marrow morphology or MDS classification tasks, providing another avenue for further studies.

In summary, we developed a DL framework trained on patient and donor samples, achieving high accuracies in our internal test set and external validation in distinguishing between MDS, AML, and donors.

Methods

Data sets

We identified 463 MDS patients who had been previously diagnosed and treated at the University Hospital Dresden, Germany. The first control group was comprised of 1301 AML patients that had been diagnosed and treated under the auspices of the multicenter German Study Alliance Leukemia (SAL) within the following previously reported multicenter trials: AML96 [NCT00180115], AML2003 [NCT00180102], AML60+ [NCT00180167], and SORAML [NCT00893373]. Patients were eligible upon diagnosis of MDS or AML, age ≥18 years, and available biomaterial at initial diagnosis, including bone marrow smears. The second control group consisted of 236 bone marrow samples from healthy bone marrow donors who underwent allogeneic bone marrow donation at our center. An additional external validation cohort was obtained from the Munich Leukemia Laboratory (MLL), Munich, Germany, consisting of 50 MDS patients evaluated in conjunction with held-out test sets of AML and donor samples from our internal cohort. Prior to analysis, written informed consent was obtained from all patients and donors according to the revised Declaration of Helsinki¹⁷. All studies were approved by the Institutional Review Board of the TUD Dresden University of Technology (EK 98032010 and EK 289112008).

Image digitization

Pre-treatment bone marrow smears from the initial diagnosis were used within this study. Staining of MDS, AML, and donor bone marrow smears was performed from anticoagulated bone marrow with the May-Gruenwald-Giemsa method¹⁰. Disease class labels were derived from case-level diagnostics, including cytomorphology, histology, cytogenetics, and molecular genetics, previously documented for each case during routine diagnostics or as part of the respective clinical trial. Using a Pannoramic SCAN II (3DHISTECH), we obtained high-resolution whole slide images with a 20x objective and a 1.6x C-mount adapter, yielding a resolution of 0.20353 µm/pixel. For every AML patient and bone marrow donor, we selected a single region viewed at 50x digital magnification in SlideViewer (3DHISTECH) and exported the image. We assumed that subtle signs of dysplasia would not be fully captured in one field of view alone. Therefore, for each MDS patient, we selected four regions of interest from the whole-slide image during training and testing on our internal data, viewed each at 50× digital magnification, and exported them as images for analysis. For external validation, 10 regions of interest were selected for MDS slides to accommodate the relatively smaller sample size of the validation cohort. Evaluation of bone marrow smears and potential regions of interest was performed by board-certified hematologists, mirroring clinical routine in selecting image areas according to stain quality, cellularity, even cell distribution, and avoidance of overlapping or clumping cells, presence of bone marrow spicules (rather than peripheral blood), absence of artifacts, and digital image quality.

Deep learning

End-to-end prediction on regions of interest from bone marrow slides

We extended our previously described DL pipeline^4,5 for binary predictions on regions of interest on bone marrow smears for the delineation of MDS, AML, and donors. Based on case-level diagnoses, images were labeled with either “MDS”, “AML”, or “donor”. Importantly, no cell-level manual labeling was performed. The pipeline was adapted to evaluate cases in a binary fashion, i.e., MDS vs. AML and MDS vs. donors. Potentially, imbalanced training data can bias a classifier towards the predominant class. Considering the imbalances between the data sets (n = 463 samples for MDS with 4 images per patient, resulting in 1852 MDS images in total; n = 1301 samples for AML with 1 image per patient; n = 236 samples per donor with 1 image per donor), we used image augmentation techniques, such as random sized cropping, color shifting and linear transformations, to balance the data sets for each binary classification task. For all binary classifications, a 5-fold internal cross-validation was used, i.e., a train-test split of 80:20. Cases that were used for model training were strictly separated from cases that were used for testing. In DL, determination of an optimal model cannot be done a priori, but rather has to be evaluated given the specific use case, data set, and model architecture. Hence, we evaluated six recently introduced DL architectures for computer vision, including ResNet-18/34/50/101/152¹⁸, ResNeXt-50_32x4d/101_32x8d¹⁹, Wide-ResNet-50/101²⁰, DenseNet-121/161/169/201²¹, ShuffleNet v2_x0_5/v2_x1_02²², and SqueezeNet v1.1²³. All DL models were pre-trained on ImageNet data²⁴. The final architecture for each model was determined using automated hyperparameter optimization with the Optuna framework²⁵. DL models were implemented in Python using the PyTorch framework. Computations were performed using the high-performance computing (HPC) cluster of the TUD Dresden University of Technology.

Performance evaluation

Recall (syn.: sensitivity), precision (syn.: positive predictive value), and accuracy were used to evaluate classification performances. Recall is defined as the fraction of all positive predictions among all relevant events, and precision is defined as the fraction of true positives among all positive predictions. Further, the area-under-the-curve (AUC) was determined for the receiver-operating-characteristic (ROC). All metrics are reported for each binary classification for the internal test sets as well as for the external validation cohort with 95% confidence intervals.

Explainability of classifications via occlusion sensitivity maps

In order to derive morphologies that were identified by the neural networks for class predictions, methods of model explainability can be employed. Generally, explainability of computer vision models refers to techniques that visualize image areas of high importance for correct classifications, thereby allowing the identification of image areas or objects that confirm that the model learnt relevant features or enable identification of digital biomarkers by domain experts. In our study, to highlight image areas the network assigned high importance to and thereby identify morphological cues the network used to delineate MDS, AML, and donors, we used occlusion sensitivity maps (OSM)²⁶. In OSM, random image areas are iteratively blocked from the view of the CNN, and classification performance is measured. If the blocked image area is highly relevant for accurate classification, model performance will drop accordingly. This process is repeated for the entire image. Thus, image areas that are crucial for accurate predictions are highlighted so that morphologies that prompt the CNN classifier to predict a label can be evaluated and interpreted. The importance of image areas was scaled between 0 and 1. For visualizations, a threshold of 0.5 was used, and only image areas between confidence scores of 0.5 to 1.0 are highlighted and referred to as areas of high importance. Areas of high or low importance for class predictions on region of interest images were subsequently assessed by at least three independent board-certified hematologists per image to identify relevant morphologies highlighted or missed by OSM.

Data availability

Code is available under [https://github.com/Ai-in-Cancer-org/MDS_pipeline/] Example ROIs of MDS and AML patients are available under https://doi.org/10.6084/m9.figshare.30196588. Additional data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Goasguen, J. E. et al. Dyserythropoiesis in the diagnosis of the myelodysplastic syndromes and other myeloid neoplasms: problem areas. Br. J. Haematol. 182, 526–533 (2018).
Article PubMed Google Scholar
Sasada, K. et al. Inter-observer variance and the need for standardization in the morphological classification of myelodysplastic syndrome. Leuk. Res. 69, 54–59 (2018).
Article PubMed Google Scholar
Zhang, L. et al. Diagnosis of Myelodysplastic Syndromes and Related Conditions: Rates of Discordance between Local and Central Review in the NHLBI MDS Natural History Study. Blood 132, 4370 (2018).
Article Google Scholar
Eckardt, J.-N. et al. Deep learning detects acute myeloid leukemia and predicts NPM1 mutation status from bone marrow smears. Leukemia 36, 111–118 (2022).
Article CAS PubMed Google Scholar
Eckardt, J.-N. et al. Deep learning identifies Acute Promyelocytic Leukemia in bone marrow smears. BMC Cancer 22, 201 (2022).
Article CAS PubMed PubMed Central Google Scholar
Matek, C., Schwarz, S., Spiekermann, K. & Marr, C. Human-level recognition of blast cells in acute myeloid leukaemia with convolutional neural networks. Nat. Mach. Intell. 1, 538–544 (2019).
Article Google Scholar
Matek, C., Krappe, S., Münzenmayer, C., Haferlach, T. & Marr, C. Highly accurate differentiation of bone marrow cell morphologies using deep neural networks on a large image data set. Blood 138, 1917–1927 (2021).
Article CAS PubMed PubMed Central Google Scholar
Rodellar, J., Alférez, S., Acevedo, A., Molina, A. & Merino, A. Image processing and machine learning in the morphological analysis of blood cells. Int J. Lab Hematol. 40, 46–53 (2018).
Article PubMed Google Scholar
Kainz, P., Burgsteiner, H., Asslaber, M. & Ahammer, H. Training echo state networks for rotation-invariant bone marrow cell classification. Neural Comput. Appl. 28, 1277–1292 (2017).
Article PubMed Google Scholar
Bain B. J., Clark D. M. & Wilkins B. S. Bone Marrow Pathology. (John Wiley & Sons, 2019).
Wang, C.-W. et al. Deep learning for bone marrow cell detection and classification on whole-slide images. Med. Image Anal. 75, 102270 (2022).
Article PubMed Google Scholar
Wei, B.-H. et al. Annotation-free deep learning for predicting gene mutations from whole slide images of acute myeloid leukemia. npj Precis. Oncol. 9, 35 (2025).
Article CAS PubMed PubMed Central Google Scholar
Brück, O. E. et al. Machine Learning of Bone Marrow Histopathology Identifies Genetic and Clinical Determinants in Patients with MDS. Blood Cancer Discov. 2, 238–249 (2021).
Article PubMed PubMed Central Google Scholar
Khoury, J. D. et al. The 5th edition of the World Health Organization Classification of Haematolymphoid Tumours: Myeloid and Histiocytic/Dendritic Neoplasms. Leukemia 36, 1703–1719 (2022).
Article PubMed PubMed Central Google Scholar
Arber, D. A. et al. International Consensus Classification of Myeloid Neoplasms and Acute Leukemias: integrating morphologic, clinical, and genomic data. Blood 140, 1200–1228 (2022).
Article CAS PubMed PubMed Central Google Scholar
Campanella, G. et al. A clinical benchmark of public self-supervised pathology foundation models. Nat. Commun. 16, 3640 (2025).
Article CAS PubMed PubMed Central Google Scholar
World Medical Association. World Medical Association Declaration of Helsinki: ethical principles for medical research involving human subjects. JAMA 310, 2191–2194 (2013).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 770–778 (IEEE, Las Vegas, NV, USA). https://doi.org/10.1109/CVPR.2016.90 (2016).
Xie, S., Girshick, R., Dollár, P., Tu, Z. & He, K. Aggregated Residual Transformations for Deep Neural Networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 5987–5995. https://doi.org/10.1109/CVPR.2017.634 (2017).
Zagoruyko, S. & Komodakis, N. Wide Residual Networks. Preprint at https://doi.org/10.48550/arXiv.1605.07146 (2017).
Huang, G., Liu, Z., Maaten, L. van der & Weinberger, K. Q. Densely Connected Convolutional Networks. Preprint at https://doi.org/10.48550/arXiv.1608.06993 (2018).
Zhang, X., Zhou, X., Lin, M. & Sun, J. ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. Preprint at https://doi.org/10.48550/arXiv.1707.01083 (2017).
Iandola, F. N. et al. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size. Preprint at https://doi.org/10.48550/arXiv.1602.07360 (2016).
Deng, J. ImageNet: A large-scale hierarchical image database. in 2009 IEEE Conference on Computer Vision and Pattern Recognition 248–255. https://doi.org/10.1109/CVPR.2009.5206848 (2009).
Akiba, T., Sano, S., Yanase, T., Ohta, T. & Koyama, M. Optuna: A Next-generation Hyperparameter Optimization Framework. in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining 2623–2631 (Association for Computing Machinery, New York, NY, USA, 2019). https://doi.org/10.1145/3292500.3330701 (2019).
Zeiler, M. D. & Fergus, R. Visualizing and Understanding Convolutional Networks. In Computer Vision – ECCV 2014 (eds Fleet, D., Pajdla, T., Schiele, B. & Tuytelaars, T.) 818–833 (Springer International Publishing, Cham). https://doi.org/10.1007/978-3-319-10590-1_53 (2014).

Download references

Acknowledgements

The authors are grateful to the Centre for Information Services and High-Performance Computing of the TUD Dresden University of Technology for providing its facilities for training and execution of deep learning models. This study was funded in part by Novartis Oncology. The funder had no role in conceptualization, design, data collection, analysis, decision to publish, or preparation of the manuscript.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Internal Medicine I, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
Jan-Niklas Eckardt, Ishan Srivastava, Freya Schulze, Susann Winter, Martin M. K. Schneider, Lukas Reichel, Miriam Eva Helena Gediga, Katja Sockel, Anas Shekh Sulaiman, Christoph Röllig, Frank Kroschinsky, Martin Bornhäuser & Jan Moritz Middeke
Else Kröner Fresenius Center for Digital Health, TUD Dresden University of Technology, Dresden, Germany
Jan-Niklas Eckardt, Ishan Srivastava, Sebastian Riechert, Karsten Wendt & Jan Moritz Middeke
Institute of Software and Multimedia Technology, TUD Dresden University of Technology, Dresden, Germany
Tim Schmittmann, Sebastian Riechert & Karsten Wendt
Department of Hematology, Oncology and Bone Marrow Transplantation with Section of Pneumology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
Anne-Marie Asemissen
Munich Leukemia Laboratory, Munich, Germany
Christian Pohlkamp & Torsten Haferlach
German Cancer Consortium (DKTK), Partner Site Dresden, and German Cancer Research Center (DKFZ), Heidelberg, Germany
Martin Bornhäuser
National Center for Tumor Diseases (NCT), NCT/UCC Dresden, a partnership between DKFZ, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, and Helmholtz-Zentrum Dresden-Rossendorf (HZDR), Dresden, Germany
Martin Bornhäuser

Authors

Jan-Niklas Eckardt
View author publications
Search author on:PubMed Google Scholar
Ishan Srivastava
View author publications
Search author on:PubMed Google Scholar
Freya Schulze
View author publications
Search author on:PubMed Google Scholar
Susann Winter
View author publications
Search author on:PubMed Google Scholar
Tim Schmittmann
View author publications
Search author on:PubMed Google Scholar
Sebastian Riechert
View author publications
Search author on:PubMed Google Scholar
Martin M. K. Schneider
View author publications
Search author on:PubMed Google Scholar
Lukas Reichel
View author publications
Search author on:PubMed Google Scholar
Miriam Eva Helena Gediga
View author publications
Search author on:PubMed Google Scholar
Katja Sockel
View author publications
Search author on:PubMed Google Scholar
Anas Shekh Sulaiman
View author publications
Search author on:PubMed Google Scholar
Christoph Röllig
View author publications
Search author on:PubMed Google Scholar
Frank Kroschinsky
View author publications
Search author on:PubMed Google Scholar
Anne-Marie Asemissen
View author publications
Search author on:PubMed Google Scholar
Christian Pohlkamp
View author publications
Search author on:PubMed Google Scholar
Torsten Haferlach
View author publications
Search author on:PubMed Google Scholar
Martin Bornhäuser
View author publications
Search author on:PubMed Google Scholar
Karsten Wendt
View author publications
Search author on:PubMed Google Scholar
Jan Moritz Middeke
View author publications
Search author on:PubMed Google Scholar

Contributions

J.-N.E. and J.M.M. conceptualized the project. J.-N.E., F.S., M.M.K.S., M.E.H.G., K.S., A.S.S., C.R., F.K., A.M.A., C.P., T.H., M.B. and J.M.M. provided patient samples. J-N.E., F.S., M.M.K.S., L.R., and M.E.H.G. acquired BMS images. I.S., T.S., S.R. and K.W. developed computer vision models. All authors analyzed and interpreted the data. J.-N.E. wrote the initial draft. All authors edited the draft, approved the final version of the manuscript, and agreed to be accountable for all aspects of the work.

Corresponding author

Correspondence to Jan-Niklas Eckardt.

Ethics declarations

Competing interests

J.N.E., T.S., S.R., K.W. and J.M.M. are part-owners of Cancilico. The other authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Eckardt, JN., Srivastava, I., Schulze, F. et al. Image-based explainable artificial intelligence accurately identifies myelodysplastic neoplasms beyond conventional signs of dysplasia. npj Precis. Onc. 10, 26 (2026). https://doi.org/10.1038/s41698-025-01222-y

Download citation

Received: 06 May 2025
Accepted: 27 November 2025
Published: 11 December 2025
Version of record: 15 January 2026
DOI: https://doi.org/10.1038/s41698-025-01222-y

This article is cited by

Comprehensive performance assessment of the BMIA-12 a system for bone marrow cell quantification in normal and hematological malignancy samples
- Ha Nui Kim
- Jin Hee Lee
- Soo-Young Yoon
Scientific Reports (2026)