Abstract
Fabry disease (FD) is a rare lysosomal storage disorder caused by mutations in the GLA gene, resulting in globotriaosylceramide accumulation. Kidney involvement (Fabry nephropathy) significantly contributes to morbidity and mortality. Diagnosis can be difficult, especially in females or late-onset variants. Renal biopsy remains essential, but interpretation requires expert pathologists. Digital pathology and artificial intelligence (AI) offer promising solutions to support diagnosis. The study analyzed Whole-slide images from renal biopsies of Fabry nephropathy patients to develop and validate a “foamy podocytes” screening AI tool. Two computational tasks were performed: glomerular-level classification, and podocyte-level segmentation. Performance was evaluated using standard metrics. A novel ZEBRA score (fpA/tgA%) was developed to quantify disease burden, and correlations with histological scores and clinical parameters were assessed. EfficientNetB2 achieved the highest classification accuracy (79%) in identifying foamy podocytes. SegFormerB4 had the best segmentation performance (Dice = 0.46, IoU = 0.37). The ZEBRA score effectively distinguished Fabry nephropathy from controls (p < 0.001) and showed good correlation with manual scoring (rs = 0.66–0.71). The AI-assisted ZEBRA pipeline highlights high-risk Fabry nephropathy features to support nephropathologists as a screening tool.
Similar content being viewed by others
Introduction
Fabry disease (FD) is a multisystem lysosomal storage disorder caused by mutations in the GLA gene on the X chromosome, leading to a deficiency of α-galactosidase A (AGAL). This enzymatic defect results in the accumulation of globotriaosylceramide (Gb3) in various organs, including the heart, nervous system, skin, and kidneys1. Kidney involvement (Fabry nephropathy, FN) is a major contributor to morbidity and mortality2,3 with progressive proteinuria that may lead to end-stage kidney disease (ESKD) if not promptly recognized and treated4,5. However, FN is often clinically silent in early stages, especially in late-onset or atypical variants and in female patients due to X-chromosome inactivation (lyonization), making early diagnosis challenging6. In this context, renal biopsy remains crucial for detecting early Gb3 deposits via light and electron microscopy, even before other systemic signs emerge7,8,9. Histological evaluation also supports treatment decisions8, monitoring response9,10,11 and predicting disease progression9,12,13. Currently, diagnosis relies on the expertise of pathologists in identifying podocyte cytoplasmic vacuolization—FN’s hallmark—which may be subtle or focal, particularly in atypical or female cases9,14, representing a potential challenge for non experienced nephropathologists. Although vacuolated podocytes often provide the only morphological hint of FN on biopsy, their presence is not pathognomonic and must be interpreted with caution, as similar appearances can be observed in other lysosomal storage disorders or in drug-induced phospholipidosis15. Confirmatory electron microscopy, identifying characteristic lamellated inclusions (e.g., myelin and zebra bodies)16, is not always available, increasing the risk of underdiagnosis. As kidney involvement may precede other symptoms17, renal biopsy can be the first clue to FD, prompting genetic and biochemical testing. Digital pathology may represent the ideal solution to overcome these challenges, since the conversion of physical slides into whole slide images (WSIs) enable the creation of hub-spoke centers with high expertise in the evaluation of these rare diseases18,19, favoring both the preservation of high quality standards and the development of artificial intelligence (AI) and computational tools to assist the pathologists in the evaluation of such cases20. Although recent computational nephropathology efforts have successfully applied AI to segment renal tissue (e.g., glomeruli, tubules, interstitium)21 and classify glomerular lesions (e.g., sclerosis, hypercellularity)22, these models often focus on broad morphological classes and not on rare and disease-specific features such as podocyte vacuolization. Here, a computational pipeline for the automatic detection of vacuolated podocytes is developed and validated as a scalable, automated risk-alert tool that sensitively screens for and quantifies the specific morphological hallmarks of FN.
Methods
Cases
A multicenter analysis of formalin-fixed and paraffin-embedded renal biopsy and genetically-confirmed FNs was conducted across different Italian institutions (Table 1), spanning a time period of 15 years (2010–2024). For each case (n = 37) and control (n = 40), a minimum of one 2–3 μm cut hematoxylin and eosin (H&E) or periodic acid-Schiff (PAS) stained slide was obtained, including both staining types when available. Cases were included if they underwent kidney biopsy according to standard nephrological indications, regardless of whether FD was initially suspected. Indications for biopsy included persistent significant proteinuria (> 0.5–1 g/day), hematuria, unexplained progressive decrease in renal function, or other clinical or laboratory signs suggesting underlying renal disease. Controls were randomly selected from cases with an alternative final diagnosis and included only glomeruli evaluated as devoid of the lesion of interest upon histopathological review; matching by center was applied whenever possible, and matching by time period was ensured by selecting slides from the same year. For all cases, electron microscopy was performed confirming the presence of myelin bodies in FN cases and their absence in controls. The slides were scanned using different whole-slide imaging systems at varying magnifications. Demographic and clinical data were collected, including sex, age, serum creatinine levels (mg/dL), estimated glomerular filtration rate (eGFR) following the CKD-EPI formula (ml/min/1.73 m2), proteinuria (g/24 h), GLA mutation type, associated amino acid change, and FD clinical phenotype. Pathological features were evaluated by a renal pathologist (VL) for each biopsy, documenting: the total number of glomeruli, the number of globally sclerotic glomeruli, the number of glomeruli with segmental sclerosis, the mean podocyte vacuolization score based on the International Study Group of FN (ISGFN) scheme9, and the estimated percentage of interstitial fibrosis and tubular atrophy (IFTA%). This study complies with the Declaration of Helsinki and was performed according to ethics committee approval (PNRR-MR1-2022–12375735, n. 16681, 03/16/23).
Dataset creation and model development
In the initial phase, non-globally-sclerotic glomerular regions and individual foamy podocytes were manually annotated using QuPath v0.5.123 by a pathology resident (MC) on a pathology-dedicated medical monitor (BARCO MDPC-8127, Courtrai, Belgium). The annotations were subsequently reviewed and corrected by an expert nephropathologists (FP) with over 15 years of experience in renal biopsy interpretation. Based on these annotations, 512 × 512 pixel tiles were extracted at a magnification of 0.5 μm/px and categorized according to the presence or absence of foamy podocytes. Two complementary experiments were conducted to automate the detection and analysis of foamy podocytes within glomerular regions, using independently trained models:
-
a classification model designed as a computationally lightweight, easily integrable screening tool compatible with existing frameworks (WSInfer and QuPath);
-
a separate segmentation model developed for detailed, feature-level quantification to provide explainability and support granular disease assessment (Fig. 1).
Schematic representation of the computational pathology pipeline for the detection of foamy podocytes. After case selection and subdivision in training/validation and test sets, biopsies were annotated at both glomerular level (A: giving a label of foamy or not foamy, green and red in the figure) and at podocyte level (B: delineating the podocyte with foamy cytoplasm). Based on these annotations, cases were used to build a classification and segmentation model, for A and B annotations respectively. The obtained algorithms were applied on an external test set to evaluate their performances on either the classification of positive/negative glomeruli of the biopsy (glomerular detail) or the outlining of foamy podocytes within glomeruli (podocyte detail). The obtained pipeline (ZEbra Bodies Recognition by Artificial intelligence, ZEBRA) is available as a free QuPath-WSInfer extension (classification system) as and as a segmentation tool freely available as a GitHub repository.
We additionally tested the model on a rare case of crystalline podocytopathy, still characterized by a foamy-like appearance, to check whether the model is reliable when stressed with ambiguous cases.
1. Classification experiment (glomerular detail):
A series of classification models were trained to detect the presence/absence of foamy podocytes at the glomerular level. These models included ResNet18, EfficientNetB2, DenseNet121, Swin-T, and a custom architecture that utilized UNI2 (a foundational model developed by the Mahmood Lab at Harvard)23. Training was performed using 5-fold cross-validation at the case level, ensuring that all tiles from a single case were consistently assigned to either the training or validation set in each fold, with evaluations performed on an independent test set. The test set did not include any slides from the internal cohort. Each model was trained with and without data augmentation (Supplementary Methods). Performance on an external test set was evaluated at both the tile and case levels, using standard classification metrics: accuracy, precision, recall, F1 score, receiver operating characteristic-area under the curve (AUC-ROC), sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV). The 95% confidence intervals for metrics were estimated using a bootstrap approach with 1,000 resamples, sampling with replacement from the predictions and recalculating each metric to derive the empirical percentile-based intervals.
2. Segmentation experiment (podocyte detail):
Several segmentation models were employed, including U-Net, DeepLabV3+, and SegFormerB4. The same 5-fold case-level cross-validation scheme was applied, with evaluations performed on an independent test set. Model performance was assessed at both the tile and case levels, using the Dice coefficient and intersection over union (IoU) metrics to measure concordance with manually annotated masks. The Dice coefficient quantifies the overlap between predicted and reference regions relative to their combined size, while IoU measures the overlap relative to the union of the predicted and reference regions. Although calculated differently, these metrics are closely related and provide complementary information on segmentation accuracy. In addition, a separate model was also trained specifically to segment the glomerulus, enabling accurate delineation of the glomerular area and supporting the subsequent quantification of foamy podocyte proportions. For each glomerulus, the proportion of the area occupied by foamy podocytes (fpA) on the total glomerular area (tgA) was calculated (fpA/tgA%) and correlated with the ISGFN scoring system9.
Statistical analysis
Continuous variables were summarized as medians and interquartile ranges (IQR), while categorical variables were expressed as counts and percentages. Differences in continuous and categorical distributions were evaluated using the Mann-Whitney U test and χ² test, respectively, with statistical significance defined as P < 0.05. To evaluate the agreement between the continuous values derived from the manual mean podocyte vacuolization score (MPVS, as determined by ISGFN9 and the outputs of the automated segmentation models estimating the glomerular area occupied by foamy podocytes, the Spearman correlation coefficient (rs) was used. To evaluate the discriminative performance of the ZEBRA score between FN and controls, a receiver operating characteristic (ROC) curve analysis was performed. The AUC was calculated to quantify overall performance. The optimal cutoff value was determined using Youden’s J statistic, and the corresponding sensitivity and specificity were reported to provide guidance for potential clinical interpretation. Linear regression (R2) was used to assess the correlation of the MPVS and the computational score with biochemical values and ANCOVA test for statistical significance. All statistical analyses were performed using Excel 2016 (Microsoft) and Python v.3.12 with the libraries pandas v.2.2.3 and scikit-learn v.1.4.2.
Results
Cases
The clinicopathological features of the cohort are presented in Table 2. Controls were composed of 12 IgA nephropathy, 6 Minimal change disease and 6 arterionephrosclerosis, 5 minimal histological abnormalities, 4 membranous nephropathy and 4 focal and segmental glomerulosclerosis and 3 miscellaneous (thrombotic microangiopathy, fibrillary glomerulonephritis and proliferative glomerulonephritis with monoclonal immunoglobulin deposits) cases. The distribution of GLA mutations among Fabry patients revealed a predominance of the c.644 A > G and c.1066 C > T variants, with the majority of patients exhibiting the late-onset (LO) phenotype (73%). FN demonstrated higher eGFR (98 ml/min/1.73m2 vs. 69 ml/min/1.73m2 p < 0.001), lower levels of proteinuria (median 0.2 g/day vs. 1.28 g/day, p < 0.001) and milder IFTA (p = 0.01) compared to controls. Additionally, FNs showed a median podocyte vacuolization score of 1.18 (IQR 0.89–2.05). The dataset comprised 1,075 glomeruli from FN cases and 689 from controls; vacuolized podocytes were identified in 74% (796/1,075) of the FN glomeruli, resulting in a final distribution of 796 positive and 968 negative glomeruli.
Classification task (glomerular detail)
The performance metrics of all models on the training, validation, and test sets are presented in Table 3. The application of augmentation techniques enhanced the generalizability of the models, as evidenced by only a minor reduction in performance across the training, validation, and test sets (Table 3). EfficientNetB2 models achieved the highest performance in glomerular classification at the tile level on the external test set, with an accuracy, precision, recall, F1-score, sensitivity and AUC-ROC of 0.79, a PPV of 0.78, specificity of 0.83, and NPV of 0.74 (Fig. 2). This model successfully identified all FNs as positive at case level, although it misclassified at least one glomerulus in 10 out of 16 controls in the external test set, resulting in a perfect NPV and sensitivity (1), a PPV of 0.57 and a specificity of 0.38 at case level. These findings suggest the potential role of this classifier as a screening tool to detect cases that may benefit from a secondary review by pathologists.
The confusion matrix shows the exact number of correctly classified and misclassified glomeruli for the best model. For illustration, two representative examples are provided for each prediction category (correctly classified and misclassified), with zoomed-in views highlighting the foamy podocytes.
Segmentation task (podocyte detail)
The performance metrics of all models on the training, validation, and test sets are presented in the Supplementary Results. Among the different models tested (Table 3), the SegFormer B4 achieved the highest performances in foamy podocyte segmentation (Dice = 0.46 and IoU = 0.37 without augmentation), with a tile-level sensitivity of 0.95 and a PPV of 0.91, where the tile-level assessment specifically refers to detecting the presence or absence of foamy podocytes based on the segmentation output, further stressing the role of this AI tool as a screening assistant (Fig. 3).
Examples of cases correctly classified or misclassified by the segmentation model. In green, the segmented glomerular area and in orange the vacuolized podocytes are shown. TN=True Negative, FN = False Negative, TP = True Positive, FP = False Positive. The reported values refer only to glomeruli that were correctly segmented by the model, with a sample considered positive when at least one pixel was predicted as a vacuolized podocyte.
To streamline the inference workflow, a custom script was developed to automatically generate region annotations at the appropriate scale to be manually positioned around glomeruli (ZEbra Bodies Recognition by Artificial intelligence, ZEBRA). The glomerulus-level classification model was exported in a format compatible with the WSInfer extension for QuPath24, enabling user-friendly, one-click inference on annotated glomerular regions, and uploaded in the following Github repository, where the technical functioning is also explained in the documentation:
https://github.com/Gizmopath/ZEBRA-ZEbra-Bodies-Recognition-by-Artificial-intelligence.
All code used during training and inference is included in the same repository, including the customized QuPath scripts for tile extraction and glomerular region annotation. The script for inference and quantification of foamy podocytes relative to total glomerular area is likewise available in the GitHub repository. However, unlike the glomerulus-level classification model, this script is provided as a standalone tool and is not directly compatible with integration into QuPath/WSInfer.
A computational histology score for Fabry nephropathy: the ZEBRA score
The obtained ZEBRA model was used to calculate the proportion of the area occupied by foamy podocytes (fpA) on the total glomerular area (tgA). The obtained average ZEBRA score (ZS = fpA/tgA %) further confirmed the capability of the algorithm to discriminate FNs from controls (Fig. 4), both at a case level (mean ZS of 0.49 ± 0.14 vs. 0.10 ± 0.8 in positive vs. negative cases, p < 0.001) and at a glomerular level (mean ZS of 0.5 ± 2.4 vs. 0.10 ± 0.13 in positive vs. negative cases, p < 0.001), with a clear separation and no overlap at the case level. ROC curve analysis of the ZEBRA score demonstrated good discrimination between FN and controls, with an AUC of 0.93. A preliminary cutoff of 0.19 (mean ZS per case) was identified, corresponding to a sensitivity of 0.91 and a specificity of 0.81. The comparison of ZS and MPVS for the positive cases demonstrated strong correlation between the two scores in both H&E (rs=0.66) and PAS (rs=0.71) stained slides, however, case-by-case analysis revealed substantial discrepancies between the two scores, likely due to the limited segmentation performance (Fig. 5A). Despite the subtle vacuolization potentially caused by X-chromosome inactivation in females, the model demonstrated non-inferior performance in females vs. males, with even higher correlation between MPVS and ZS in the former (rs = 0.74 for H&E, 0.75 for PAS) than in the latter group (rs = 0.56 for H&E, 0.68 for PAS). To assess the reliability and clinical relevance of the ZS even in cases with subtle clinical presentations, the analysis of this parameter in the subset of FN cases with eGFR > 90 mL/min/1.73 m² and proteinuria < 0.5 g/24 h showed a mean ZS of 0.22 ± 0.15, as compared to the null value of the control group with same characteristics, demonstrating the relevance even in this clinical subset. Linear regression analysis (Fig. 5B–E) indicated a modestly stronger correlation of ZS compared with MPVS with baseline clinical measures (eGFR-ZS R2 = 0.05 vs. 0.002 MPVS and proteinuria ZS R2 = 0.09 vs. 0.05 MPVS, respectively), although not reaching statistical significance (p = 0.88 and 0.26, respectively). Finally, to test the reliability of the algorithm we stressed the ZEBRA pipeline on a rare case of crystalline podocytopathy. Both the classification (EfficientNetB2) and the segmentation models (SegformerB4) detected positive glomeruli (5 out of 27) and/or podocytes (in 8 out of 27 glomeruli), respectively, but the final mean ZS was overall < 0.01, way below the proposed cutoff of 0.19, indicating a correct classification as non-FN case for this crystalline podocytopathy.
Comparative box plots of ZEBRA Scores between glomeruli from Fabry Nephropathy samples and glomeruli from samples with other renal diseases (top), and aggregated ZEBRA Scores per case (bottom). Statistically significant differences are observed in both comparisons, with a clear separation and no overlap at the case level.
Comparison and correlations between the mean podocyte vacuolization score (MPVS), the ZEBRA Score (ZS), and clinical data. Panel (A) displays all positive cases arranged in descending order of their mean glomerular MPVS, alongside the corresponding ZS values for those same cases. Scatter plots with linear regression lines are shown for MPVS vs. eGFR (B), ZS vs. eGFR (C), MPVS vs. proteinuria (D), and ZS vs. proteinuria (E).
Discussion
The integration of artificial intelligence (AI) into nephropathology has garnered significant attention, offering promising avenues for enhancing diagnostic precision and reproducibility in renal biopsy interpretation20. AI applications have been explored for tasks such as glomerular classification, segmentation, and quantification of histological features, aiming to augment traditional pathology workflow25. In the context of FN the diagnostic challenge is represented by the insidious clinical presentation that can be totally unspecific, especially in female patients and those with late-onset variants, which may reflect on relatively subtle modifications on renal biopsies (e.g. podocyte vacuolization) that can be overlooked by inexperienced pathologists thereby causing delays in treatment. Our study introduces a novel AI-driven pipeline, ZEBRA (Zebra bodies Evidenced on Bioptic Renal samples using Artificial intelligence), designed to automatically detect and quantify vacuolated podocytes in digitized renal biopsy slides to assist pathologists providing a first screening of H&E/PAS stained slides to highlight possible areas of glomerular accumulation of Gb3 within podocytes in the form of cytoplasmic vacuolization. Being quick and feasible, the developed tool may be applied at scale to routine biopsy series to uncover incidental FNs, which, based on our unpublished experience, may account for up to 1–2% of all renal biopsies. Moreover, it can be directly implemented on routine histology stains available in all the pathology labs, instead of needing epoxy-embedded toluidine blue-stained semi-thin section slides. Moreover, even if electron microscopy provides useful details that complement the light microscopy features of FN and recent reports demonstrate the applicability of AI algorithms on EM figures to assist renal pathologists in some evaluations (e.g. foot process effacement)26,27, we wanted to create an easily applicable tool to check widely available H&E/PAS slides with screening purpose to redirect the case to electron microscopy or genetic analysis.
The ZEBRA algorithm consists of two components: a glomerular-level classification module and a segmentation module that precisely delineates individual affected podocytes at the microscopic level. The glomerular detail of ZEBRA profits from the most recent and effective neural networks available for AI-based image processing and shows a perfect detection of all cases of FN in the test set while misdiagnosing some control glomeruli as affected, suggesting a good role for this tool as a screening assistant. This classification model has been built as a fully integratable extension for QuPath28 and WSInfer24 as a free resource to allow easy implementation for the daily work and prospective application of the tool to further validate its clinical utility. However, foamy podocytes are not exclusive to FD and may appear in other conditions, underscoring the need to interpret AI-based findings within the broader clinical and pathological context. Moreover, although the segmentation of ZEBRA achieved only a modest Dice score, reflecting the inherent complexity and variability of vacuolated podocyte patterns, it showed optimal tile-level sensitivity (0.95) and PPV (0.91) relying on state-of-the-art transformer models (SegFormerB4) and the careful annotations by expert renal pathologists as solid ground truth, providing the substrate for the elaboration of a computational disease burden quantification with the ZEBRA score. While the MPVS is useful to assess podocyte involvement, the manual assessment method can still have some limitations such as interobserver variability and inconsistent predictive capability. Our ZEBRA score aims to provide a reproducible and quantifiable measure of podocyte vacuolization, potentially serving as a biomarker for diagnostic and predictive purposes. While the correlations with baseline eGFR and 24-hour proteinuria were generally modest for both ZS and MPVS without statistically significant differences, a slight improvement was observed using ZS. In this experiment, the use of standard and widely available techniques, together with the implementation of key methodological strategies, including diverse data augmentation techniques and the use of multiple whole-slide scanner systems, enhanced the algorithm’s robustness and generalizability across a wide range of variability sources. These approaches directly address one of the major current challenges in translating AI algorithms into routine clinical practice: the risk of reduced performance when applied to external datasets29. Currently as a feasibility study due to limited sample size, the ZEBRA score correlates with baseline clinical parameters, whilst its predictive value concerning long-term outcomes remains to be elucidated. While controls were matched for center/time, they exhibited worse baseline renal function, which is a result of the retrospective selection of biopsy-proven non-Fabry glomerular diseases. Future prospective studies with larger cohorts and longitudinal follow-up are essential to validate the prognostic utility of the ZEBRA score, enhance its generalizability by expansion to multi-center external datasets with multi-annotator consensus, and its integration into clinical practice.
Conclusion
This study introduces and validates a novel digital pathology pipeline for FN, culminating in the ZEBRA score, a new histological measure of podocyte involvement. The findings offer initial insights into its potential applications, emphasizing that the AI system provides a comprehensive analysis of image data, alerting human operators to possible oversights and helping prevent missed cases, in line with its design as a high-sensitivity screening tool. Future studies are warranted to assess the prognostic implications of the ZEBRA score and its utility in guiding therapeutic decisions in FD.
Data availability
Authors agree to make data and materials supporting the results or analyses presented in their paper available upon reasonable request. The trained models were exported in a format compatible with the WSInfer extension for QuPath, enabling user-friendly, one-click inference on annotated glomerular regions, and uploaded in the following Github repository: [https://github.com/Gizmopath/ZEBRA-ZEbra-Bodies-Recognition-by-Artificial-intelligence](https:/github.com/Gizmopath/ZEBRA-ZEbra-Bodies-Recognition-by-Artificial-intelligence).
References
Germain, D. P. Fabry disease. Orphanet J. Rare Dis. 5, 30 (2010).
Feriozzi, S. & Rozenfeld, P. Pathology and pathogenic pathways in Fabry nephropathy. Clin. Exp. Nephrol. 25, 925–934 (2021).
Pisani, A. et al. The kidney in fabry’s disease. Clin. Genet. 86, 301–309 (2014).
Branton, M. H. et al. Natural history of Fabry renal disease: influence of alpha-galactosidase A activity and genetic mutations on clinical course. Medicine 81, 122–138 (2002).
Scionti, F. et al. Genetic variants associated with Fabry disease progression despite enzyme replacement therapy. Oncotarget 8, 107558–107564 (2017).
Riccio, E., Sabbatini, M., Capuano, I. & Pisani, A. Early biomarkers of Fabry nephropathy: A review of the literature. Nephron 143, 274–281 (2019).
Tøndel, C. et al. Foot process effacement is an early marker of nephropathy in young classic Fabry patients without albuminuria. Nephron 129, 16–21 (2015).
Avarappattu, J., Gaspert, A., Spartà, G. & Rohrbach, M. Impact of kidney biopsy on deciding when to initiate enzyme replacement therapy in children with Fabry disease. Pediatr. Nephrol. 39, 131–140 (2024).
Fogo, A. B. et al. Scoring system for renal pathology in Fabry disease: report of the international study group of Fabry nephropathy (ISGFN). Nephrol. Dial Transpl. 25, 2168–2177 (2010).
Kim, I. Y., Lee, H. J. & Cheon, C. K. Fabry nephropathy before and after enzyme replacement therapy: important role of renal biopsy in patients with Fabry disease. Kidney Res. Clin. Pract. 40, 611–619 (2021).
Najafian, B. et al. One year of enzyme replacement therapy reduces globotriaosylceramide inclusions in podocytes in male adult patients with Fabry disease. PLoS One. 11, e0152812 (2016).
Tøndel, C. et al. Agalsidase benefits renal histology in young patients with Fabry disease. J. Am. Soc. Nephrol. 24, 137–148 (2013).
Rusu, E. E. et al. The Impact of Kidney Biopsy for Fabry Nephropathy Evaluation on Patients’ Management and Long-Term Outcomes: Experience of a Single Center. Biomedicines 10, (2022).
Valbuena, C. et al. Kidney biopsy findings in heterozygous Fabry disease females with early nephropathy. Virchows Arch. 453, 329–338 (2008).
Choung, H. Y. G., Jean-Gilles, J. & Goldman, B. Myeloid bodies is not an uncommon ultrastructural finding. Ultrastruct Pathol. 46, 130–138 (2022).
Colpart, P. & Félix, S. Fabry nephropathy. Arch. Pathol. Lab. Med. 141, 1127–1131 (2017).
Capuano, I., Buonanno, P., Riccio, E., Crocetto, F. & Pisani, A. Parapelvic cysts: an imaging marker of kidney disease potentially leading to the diagnosis of treatable rare genetic disorders? A narrative review of the literature. J. Nephrol. 35, 2035–2046 (2022).
L’Imperio, V. et al. Improvements in digital pathology equipment for renal biopsies: updating the standard model. J. Nephrol. 37, 221–229 (2024).
L’Imperio, V. et al. Digital pathology for the routine diagnosis of renal diseases: a standard model. J. Nephrol. 34, 681–688 (2021).
Cazzaniga, G. et al. Time for a full digital approach in nephropathology: a systematic review of current artificial intelligence applications and future directions. J. Nephrol. 37, 65–76 (2024).
Xiong, Z. et al. Advances in kidney biopsy lesion assessment through dense instance segmentation. Artif. Intell. Med. 164, 103111 (2025).
Weis, C. A. et al. Assessment of glomerular morphological patterns by deep learning algorithms. J. Nephrol. 35, 417–427 (2022).
Chen, R. J. et al. Towards a general-purpose foundation model for computational pathology. Nat. Med. 30, 850–862 (2024).
Kaczmarzyk, J. R. et al. Open and reusable deep learning for pathology with WSInfer and QuPath. NPJ Precis Oncol. 8, 9 (2024).
Hermsen, M. et al. Deep Learning-Based histopathologic assessment of kidney tissue. J. Am. Soc. Nephrol. 30, 1968–1979 (2019).
Ma, P. et al. AI-based system for analysis of electron microscope images in glomerular disease. JAMA Netw. Open. 8, e2534985 (2025).
Smerkous, D. et al. Development of an automated Estimation of foot process width using deep learning in kidney biopsies from patients with Fabry, minimal change, and diabetic kidney diseases. Kidney Int. 105, 165–176 (2024).
Bankhead, P. et al. QuPath: open source software for digital pathology image analysis. Sci. Rep. 7, 16878 (2017).
Asadi-Aghbolaghi, M. et al. Learning generalizable AI models for multi-center histopathology image classification. NPJ Precis Oncol. 8, 151 (2024).
Acknowledgements
We would like to acknowledge Google for supporting this computational pathology research. We would also like to acknowledge the Italian Ministry of the University MUR Dipartimenti di Eccellenza 2023-2027 (IMPACT MEDICINE, l. 232/2016, art. 1, commi 314-337).
Funding
The work has been funded by the European Union - Next Generation EU - NRRP M6C2 - Investment 2.1 Enhancement and strengthening of biomedical research in the NHS (DIPLOMAT - PNRR-MR1-2022-12375735).
Author information
Authors and Affiliations
Contributions
GioCa and VL defined the study design; MC provided the first annotations for the algorithm development; RB performed statistical analysis; GaCa helped in the scanning procedure; GioCa provided the statistical and computational analysis of the data; LN, GP, AE, VL, FaPa contributed with specialized renal pathology perspective; IC, GV, RM, FePi, LC, ED, MA, AP, SV, GG contributed with enrollment of cases; AE, GG and FP provided the funding acquisition and administrative support. All authors were involved in writing the paper and had final approval of the submitted and published versions.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical consideration
This study, conducted as part of the project ‘DIPLOMAT: DIgital PLatform for OMics and Artificial intelligence in Transplant and native rare renal diseases’ (Grant no. PNRR-MR1-2022-12375735), complies with the principles of the Declaration of Helsinki and all relevant ethical regulations. All participant data were fully anonymized. Ethical approval was granted by the main coordinating center (Integrated University Hospital of Verona, protocol no. 16681, March 16, 2023), with integrated amendments approved by the Ethics Committee 2 of Catania (protocol no. 15/CEL) and the Ethics Committee of Monza/University of Milano-Bicocca (UNIMIB, file no. 4269).
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cazzaniga, G., Carbone, M., Barretta, R. et al. Zebra bodies recognition by artificial intelligence (ZEBRA): a computational tool for Fabry nephropathy. Sci Rep 16, 5072 (2026). https://doi.org/10.1038/s41598-026-35466-w
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-026-35466-w







