Abstract
The European association of Urology (EAU) suggests a prognostic stratification of Upper Tract Urothelial Cancer (UTUC) based on high and low risk patients, with Radical nephroureterectomy (RNU) and bladder cuff resection being the gold standard for the treatment of non-metastatic High risk UTUC. However, no consensus on post-operative patient management or tools that predict who would benefit the most from a close follow-up rather than adjuvant chemotherapy regimen exist. in Machine Learning (ML) is gaining interest in Urology providing models for prognostic prediction purpose; It’s role in UTUC has not yet been investigated. We aim to develop and validate multiple supervised ML models based on patient- and tumor- related features to predict prognosis in patients with preoperative Histological or Imaging proved UTUC treated with RNU within a multiethnic large cohort. Data from an international multicenter large cohort of histologically proven UTUC patients from Asia and Europe treated with RNU were retrospectively collected. Twenty different ML-supervised predictive models were first trained and then external validate with two separate set. Nomograms were constructed based on 8 independent prognostic factors (age, gender, grading, pT, pN, presence of Carcinoma in Situ (CIS), multifocality and Lymphovascular invasion(LVI)) to predict 6 Outcomes (Overall Survival (OS), Cancer Specific Survival (CSS) and Disease Free Survival (DFS) at 3 and 5 year). Performances were compared using Area-under-curve (AUC) of Receiver-Operating Characteristics (ROC). A total of 3129 patients were enrolled: 637 Asian Patients (training cohort) and 2492 European patients (validation cohort). Upon training assessment, LR models achieved the best results, being the best model for prediction of 4/6 outcomes, with the best result in CSS both at 3 and 5 years (AUC: 0.85, 0.84, 0.81 for CSS-3y, CSS-5y and DFS-3y respectively). Upon external validation, LR(CSL) models achieve the best results, being the number 1 model for prediction of 3/6 outcomes (AUC: 0.84, 0.79, 0.77 for CSS-3y, OS-3y and OS-5y respectively). ML is a promising technology in the field of UTUC. Our model achieve favorable results in terms of prediction of prognosis after RNU, especially in terms of CSS at 3 and 5 years, moreover is the first model of prognosis taking into account the differences in epidemiology existing between European and Asian patients. Further clinical validation and verification of its reliability for the case selection of adjuvant therapy are needed to assess its use in clinical practice linked to clinical decision making. ML is an advancing technology in the field of medicine and urology, which can also be applied to the definition of the prognosis of patients with UTUC undergoing RNU. Our study represents the first experience investigating this potential.
Introduction
Urothelial carcinoma of the upper urinary tract (UTUC) is a rare disease, accounting for about 5–10% of all urothelial carcinomas1.The European association of Urology (EAU) suggests a prognostic stratification of UTUC based on high and low risk patients, with Radical nephroureterectomy (RNU) and bladder cuff resection being the gold standard for the treatment of non-metastatic High risk UTUC2. However, no consensus on post-operative patient’s management or risk stratification exist. The POUT trial, a phase III prospective randomized trial, aiming to evaluate the benefit of adjuvant chemo after RNU vs. surveillance in patients with pT2–T4 pN0–N3 M0 or pTany N1–3 M0 disease, in his preliminary subgroup analysis, demonstrated large variability in the benefits of patients undergoing adjuvant chemotherapy, underlining the need for a stratification strategy after RNU, especially in advanced disease setting3.
Several Prognostic nomograms, based on pre-operative and post-operative factors have been described4,5,6,7,8,9,10. However, to date none of them is currently used nor recommended by guidelines, with only the Yates et Al’s model (a nomogram to predict CSS post RNU) externally validated, using 200 bootstrap resamples11. Moreover, most of this nomograms didn’t take into account the existing differences in Patients with Asian ethnicity, who seem to present with more advanced and higher-grade diseases compared to other ethnicities12. This could led to models that are not Globally generalizable, as trained and tested only in a single-ethnicity population.
Over the past years, Machine learning (ML)-assisted models have been proposed as a supplement or alternative for standard statistical techniques, opening up the possibility of creating non-linear predictive models and with the ability to improve automatically13. In ML, through the process of supervised machine learning, it is possible to build a model using computer algorithms by making them learn the relationship between input variables (characteristics) and outputs (labels). In a first phase, the algorithms analyze the relationship between input and output thanks to a training dataset. Then, the algorithms are applied to a second set of data known as a validation set, to assess how well the predictive model is able to test inputs to predict outputs14.
Various ML techniques have been used in the field of Urology and especially in urothelial carcinomas. However, most of them were used in the lower urinary tract setting while the possible application of artificial intelligence within the field of the UTUC still remain unexplored, especially as a tool for prognosis prediction.
We aim to develop and compare training and validation performances of multiple supervised ML models based on patient- and tumor- related features to predict oncological outcomes [overall survival (OS), Cancer-specific Survival (CSS) and Disease-free survival (DFS)] in patients with preoperative Histological or Imaging proved UTUC treated with RNU within a large cohort of multi-ethnic patients.
Matherial and methods
Data sources
Data from an international multicenter large cohort of patients with preoperative Histological or Imaging proved UTUC treated with RNU between December 2001 and August 2020 were retrospectively collected in two dedicated database: the training cohort, consisting of Asian patients and the validation cohort, entailing of European Patients. Baselines as well as tumor related characteristics of patients were collected.
The common inclusion criteria were: patients undergoing RNU, with preoperative Histological or imaging proven UTUC, for which patient- and tumor-related data and oncological outcomes data were available.
Features and outcomes of interest
We used a total of 8 features as input sourced among the listed patient and tumor related factors on EAU UTUC guidelines: age, gender, grading (according to World Health Organization (WHO) 1973 classification for patienst enrolled before 2004 and to the WHO 2004 classification for patients enrolled after 2004), pT, pN, presence/absence of Carcinoma In Situ (CIS), multifocality and presence/absence of Lymphovascular invasion.
The outcomes of interest were: overall survival (OS), cancer-specific survival (CSS) and disease-free survival (DFS) as defined as both local or intravescical recurrence both at 3 and 5 years from index, defined as date of RNU. Patients were followed up as appropriate, following the principles of EAU-Guidelines in all the Centers involved in the analysis.
ML-supervised models
In this study, 20 predictive models were built using supervised learning algorithms, including logistic regression (LR), decision tree (DT) and its ensemble learning variants, support vector machines (SVM), k-nearest neighbours (KNN), and hard-voting ensemble of these algorithms. LR performs binary classification by modelling the relationship between input features and outcome with sigmoid function15. DT predicts an outcome by traversing a tree-like flowchart structure for a given set of input features16. Random forest (RF) is an ensemble of DTs built using different subsets of the dataset to reduce overfitting and noise17. Gradient boosting is class of sophisticated ensemble DT algorithms, where individual trees are built and summed sequentially such that the prediction error is minimized during model fitting. Different variants were adopted in this study, including the standard gradient boost (gboost) from the free scikit-learn machine learning library (https://scikit-learn.org/), the eXtreme Gradient Boosting (XGBoost) that iteratively combines multiple weaker base predictors18, light gradient boosting machine (lightGBM) by Microsoft Corporation which employs histogram-based DTs grown in a leaf-wise manner19, and categorical boosting (CatBoost)20 which deals with categorical features. SVM performs classification in higher dimensional feature space where a hyperplane is identified to separate distinct classes21. Two linear SVMs were adopted, including support vector classification (SVC), and its variant linearSVC which is more flexible and runs faster. KNN is a classical algorithm which perform classification based on a similarity measure. In hard-voting ensemble learning, denoted here as ensemble learning, the votes for the outcomes from the above algorithms are summed and the predicted class is the one with most votes.
In this study, class sensitive learning (CSL) is applicable to 9 algorithms – XGBoost, lightGBM, CatBoost, SVC, linearSVC, DT, LR, RF and ensemble learning – to counteract class imbalance with the minority class weighted higher22, thereby yielding 18 models of the original and the CSL versions. Adding Gboost and KNN, a total of 20 models were built.
Statistical analysis
Continuous variables were described as median and interquartile ranges, categorical variable as number and percentages as appropriate. We evaluated and compared the performance of each prediction model using Area-under-curve (AUC) of receiver-operating characteristics (ROC) for the training and validation.
Ethics approval
The study was performed in accordance with relevant guidelines and regulations. All experimental protocols were approved by a The Joint Chinese University-New Territories East Cluster Clinical Research Ethics Committee of Hong Kong, SAR. Informed consent was obtained from all participants involved in the study.
Results
Baseline characteristics
Overall, 3129 patients fulfilled the inclusion criteria and were therefore enrolled. The training set, consisting of data from 637 patients undergoing RNU from Asia and the validation set, consisting of 2492 patients from Europe.
Overall, median age was 68 years (61–76), 1959 (62,3%) of patients were male. The proportion of tumor located in the pelvis was similar among the two groups (69.7% (444 patients) in the Asian cohort vs. 64.7% (1613 patients) in European cohort). The detailed baseline characteristics of the training and validation cohorts were listed in Table 1.
Training
The results of each model in terms of AUC for the prediction of each outcome upon training are presented in Supplementary Table 1, the best five models are highlighted for each of the outcome. Overall, LR models seems to achieve the best results, being the number 1 model for prediction of 4/6 outcomes (AUC: 0.85, 0.84, 0.81, 0.77 for CSS-3y, CSS-5y, DFS-3y and OS-5y respectively) and number 2 on the other 2/6 outcomes (OS-3y and DFS-5y).
Regarding OS, the models show results slightly lower than 0,8 in AUC: the best model is SVC for OS-3y [AUC: 0.79 (95% CI 0.7142–0.8630)] and LR for OS-5y [ AUC: 0.77(0 0.7088–0.8398)].
Better results seem to be obtained in predicting the DFS, slightly overcome the threshold of AUC of 0.8, both at 3- and 5-years: the best DFS-3y model is LR [AUC of 0.81 (95% CI 0.7386–0.8816)] while at DFC − 5y prevails LR (CSL) [AUC of 0.80 (95% CI 0.7335–0.8751)].
The outcome showing the overall most promising results in all the trained models is the CSS, with a peak of AUC reaching 0,85: in this case the LR model provides the best results at both CSS-3y [AUC: 0.85 (95% CI 0.7839–0 0.9151)] and CSS-5y [AUC: 0.84 (95% CI 0.7680–0.9070)].
External validation
The results of each model in terms of AUC for the prediction of each outcome upon validation are presented in Supplementary Table 2, the best five models are highlighted for each of the outcome. Overall, upon validation LR(CSL) models achieved the best results, being the number 1 model for prediction of 3/6 outcomes (AUC: 0.84, 0.79, 0.77 for CSS-3y, OS-3y and OS-5y respectively), followed by LinearSVC(CSL) (AUC: 0.82 and 0.82 for DFS-3y and DFS-5y, respectively).
The comparison in AUC of the top 5 models are available in Fig. 1 for prediction of outcomes at 3year and Fig. 2 for 5year, the top 5 model’s AUC details upon training and validation are listed Table 2.
Comparison of the best 5 models upon training and their results upon validation at 3 years. From left to right from top to bottom: OS training and validation at 3 years, CSS training and validation at 3 years, DFS training and validation at 3 years. OS: Overall Survival; CSS: Cancer Specific Survival; DFS: Disease Free Suvival.
Comparison of the best 5 models upon training and their results upon validation at 5 years. From left to right, from top to bottom: OS training and validation at 5 years, CSS training and validation at 5 years, DFS training and validation at 5 years. OS: Overall Survival; CSS: Cancer Specific Survival; DFS: Disease Free Suvival.
Regarding OS, overall the models upon external validation show results slightly higher as compared to training: the best model is LR (CSL) for OS 3y [AUC: 0.79] and OS 5y [ AUC: 0.77].
Better results are obtained in predicting the DFS, slightly overcome the threshold of AUC of 0.8, both at 3- and 5-years: the best DFS-3y model is SVC [AUC of 0.82] while at DFC − 5y prevails LinearSVC(CSL) [AUC of 0.82].
Similar to training, the outcome with the most significant results is the CSS, with a peak of AUC reaching 0,84: the best CSS-3y is LR model [AUC: 0.84], while for CSS-5y best performance is by SVC [AUC: 0.83].
Discussion
In our study, the use of various ML-supervised models has shown good prediction value of oncological outcomes of UTUC after RNU. Using readily available clinical parameters, ML-supervised models could provide an accurate prediction of prognosis, potentially implementing pTNM staging alone as a guide for postoperative treatment. Among the various experiments, the LR ML-supervised model obtains the best results in predicting CSS at both 3 and 5 years, with a maximum AUC reached of 0.85 and 0.84 respectively upon training, while LR(SVC) is more reliable upon validation, with best results in CSS 3-year. Although we acknowledge that our is not the first attempt to propose a prediction models after RNU, the existing models are not fully comparable: (1) to date, we used one of the largest cohort of patients (n = 3129) for UTUC’s prognosis prediction; (2) we intentionally included two set of patients with different ethnicity; (3) we explored the applicability of ML-supervised models in UTUC field; (4) we performed a complete external validation, in fact the only model external validate was the Yates et Al’s model, using 200 bootstrap resamples.
Adjuvant therapies are invasive and burdened by toxicity, especially in the setting of single-kidney patients who might not even need it if better prediction tools exist. Numerous efforts have been made to generate predictive models of UTUC postoperative prognosis. Despite this, there is a lack of validation which makes these models still not reliable in clinical practice and none is yet recommended with strong evidence by current European guidelines. The POUT trial3, which is currently interested in the validation of adjuvant chemotherapy after RNU for UTUC, uses only pTNM staging data for selection porpoise, including pT2–T4 pN0–N3 M0 or pT any N1–3 M0; however his preliminary subgroup analysis demonstrated large variability in the benefits of patients undergoing adjuvant chemotherapy, underlining the need for a better stratification strategy after RNU, taking into account additional features.
Several prognosis prediction nomograms have been proposed, these tools exceed AJCC/TNM staging for prognosis of survival in internal validation. Among these, two studies4,6 include UTUC patients undergoing surgery regardless of RNU or other conservative surgery; Ku et Al11 limited to an external validation study, while instead Krabbe et al.7used different outcomes from ours study as per Relapse free survival, are therefore not comparable. Overall, four models5,8,9,10 are comparable, however, one of them use the old WHO 1973 grading system8. Those nomograms variously used 7 different independent prognostic factors (Age, pT, LVI, Location, CIS, Architecture, pN), with Cha’s model being the more comprehensive (7 features) followed by Seisen (6 features) and Roupret (5 features). All of the models assessed 5y-CSS; Cha’s model additionally assessed 2y-CSS; none exceeds the trade-off of 0.81 in terms of AUC for the prediction of CSS, neither in the training nor in the internal validation set. This support the hypothesis that the ML could implement existing models.
Furthermore, our study represents the first attempt to generate a model that can be reliable in more than one single ethnicity: most of this nomograms didn’t take into account the existing differences in Asian patients, who seem to present more advanced and higher-grade diseases compared to other ethnicities12. This could be explained with differences in genetic and epigenetic factors such as environmental and occupational exposures, lifestyle choices as well as socioeconomic factors23. Aiming to move towards a race-conscious medicine, keeping in mind that as suggested by Cardena et al.24 clinical research should be used to examine structural barrier, we decided use two set of patients with different ethnicity, rather than using race as a proxy for biology. Our models are therefor tested to both European and Asian patients and can be reliable regardless the origin of the patient.
Various machine learning techniques have been used in the field of Urology, most of them within the lower urinary tract setting: (1) regarding radiomics, AI have been implemented, capable of distinguishing between bladder tumor and normal bladder at multi parametric magnetic resonance imaging (mpMRI)25 or determining the stage of bladder cancer at Computed Tomography (CT)26; (2) in terms of prognosis, the only experience derives from Lam et al. and Wang et al., who used clinicopathological evidence to create and test a significant number of AI algorithms to estimate the 5-year survival after radical cystectomy27,28. To date, this is the first experience investigating the possible application of ML-supervised algorithms to the UTUC and in particular to predict prognosis after RNU.
Lastly, or model may help clinicians in stratifying patients with UTUC, addressing the challenge of understanding clinical aggressiveness based on baseline characteristics of this specific tumors. Not only it can be used to increment follow-up strategies in patients with high risk of recurrence, but also can be used to stratify potential candidate to adjuvant and subsequential therapies.
This study has several limitations. First, its nature as a multicenter study may have introduced inconsistencies in surgical skills, type of bladder cuff performed, use or not of intra- or perioperative mitomycin, neoadjuvant use of chemotherapy and pathological diagnoses. Second, since the cohort straddles 2004, the use of two different pathological gradings may have influenced the algorithms. Furthermore, 2 patients on training and 15 patients on validation cohort had a pT0 diagnosis at final histopathological specimen: even if this may reflect real world data, on the other hand the prognosis for those patients is by definition excellent. Moreover, there is a non-negligible difference in gender representation among the two cohorts: thus, due to the different underneath biology, may influence response to treatments and prognosis. Lastly, the lack of centralized pathological revisions of imaging and specimen could introduce a bias.
Conclusions
ML is a promising technology in the field of UTUC. Our model achieve favorable results in terms of prediction of prognosis after RNU, especially in terms of CSS at 3 and 5 years, moreover is the first model of prognosis taking into account the differences in epidemiology existing between European and Asian patients. Further clinical validation and verification of its reliability for the case selection of adjuvant therapy are needed to assess its use in clinical practice linked to clinical decision making.
Data availability
Data are available for bona fide researchers who request it from the authors. Please contact the corresponding author for related requests.
References
Siegel, R. L., Miller, K. D., Fuchs, H. E., Jemal, A. & Cancer Statistics CA Cancer J. Clin. 71 (1), 7–33. (2021). (2021). https://doi.org/10.3322/CAAC.21654
‘Upper Urinary Tract Urothelial Cell Carcinoma - INTRODUCTION - Uroweb’. Accessed: Feb. 28, 2023. [Online]. Available: https://uroweb.org/guidelines/upper-urinary-tract-urothelial-cell-carcinoma
Birtle, A. et al. Adjuvant chemotherapy in upper tract urothelial carcinoma (the POUT trial): a phase 3, open-label, randomised controlled trial. Lancet 395(10232), 1268–1277. https://doi.org/10.1016/S0140-6736(20)30415-3 (Apr. 2020).
Yoshida, T. et al. Development and external validation of a preoperative nomogram for predicting pathological locally advanced disease of clinically localized upper urinary tract carcinoma. Cancer Med. 9 (11), 3733–3741 (2020). https://doi.org/10.1002/CAM4.2988
Rouprêt, M. et al. Prediction of cancer specific survival after radical nephroureterectomy for upper tract urothelial carcinoma: development of an optimized postoperative nomogram using decision curve analysis. J. Urol. 189 (5), 1662–1669 (2013). https://doi.org/10.1016/J.JURO.2012.10.057
Zhang, G. L. & Zhou, W. ‘A Model for the Prediction of Survival in Patients With Upper Tract Urothelial Carcinoma After Surgery’, Dose Response, 17(4), (2019). https://doi.org/10.1177/1559325819882872
Krabbe, L. M. et al. Mar., ‘Postoperative Nomogram for Relapse-Free Survival in Patients with High Grade Upper Tract Urothelial Carcinoma’, J Urol, 197(3 Pt 1), 580–589, (2017). https://doi.org/10.1016/J.JURO.2016.09.078
Cha, E. K. et al. Apr., ‘Predicting clinical outcomes after radical nephroureterectomy for upper tract urothelial carcinoma’. Eur. Urol. 61 (4), 818–825 (2012). https://doi.org/10.1016/J.EURURO.2012.01.021
Yates, D. R. et al. Mar., ‘Cancer-specific survival after radical nephroureterectomy for upper urinary tract urothelial carcinoma: proposal and multi-institutional validation of a post-operative nomogram’, Br J Cancer, 106(6), 1083–1088, (2012). https://doi.org/10.1038/BJC.2012.64
Seisen, T. et al. Nov., ‘Postoperative nomogram to predict cancer-specific survival after radical nephroureterectomy in patients with localised and/or locally advanced upper tract urothelial carcinoma without metastasis’. BJU Int. 114 (5), 733–740 (2014). https://doi.org/10.1111/BJU.12631
Ku, J. H. et al. External validation of an online nomogram in patients undergoing radical nephroureterectomy for upper urinary tract urothelial carcinoma. Br. J. Cancer. 109 (5), 1130–1136 (2013). https://doi.org/10.1038/BJC.2013.462
Matsumoto, K. et al. Racial differences in the outcome of patients with urothelial carcinoma of the upper urinary tract: an international study’, BJU Int, 108(8b), E304–E309, (2011). https://doi.org/10.1111/j.1464-410X.2011.10188.x
Sargent, D. J. ‘Comparison of artificial neural networks with other statistical approaches’, Cancer, 91(S8) 1636–1642, (2001). https://doi.org/10.1002/1097-0142(20010415)91:8+%3C1636::AID-CNCR1176%3E3.0.CO;2-D
Rajkomar, A., Dean, J. & Kohane, I. Machine Learning in Medicine, N Engl J Med, 380(14), 1347–1358, (2019). https://doi.org/10.1056/NEJMRA1814259
Wright, R. E. ‘Logistic regression.’, in Reading and Understanding Multivariate Statistics., Washington, DC, US: American Psychological Association, 217–244. (1995).
Safavian, S. R. & Landgrebe, D. A survey of decision tree classifier methodology. IEEE Trans. Syst. Man. Cybern. 21 (3), 660–674. https://doi.org/10.1109/21.97458 (1991).
Breiman, L. ‘Random forests’, Mach Learn, 45(1) 5–32, (2001). https://doi.org/10.1023/A:1010933404324/METRICS
Chen, T. & Guestrin, C. ‘XGBoost: A Scalable Tree Boosting System’, Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, https://doi.org/10.1145/2939672
Ke, G. et al. ‘LightGBM: A Highly Efficient Gradient Boosting Decision Tree’, Accessed: Feb. 28, 2023. [Online]. Available: https://github.com/Microsoft/LightGBM
Dorogush, A. V., Ershov, V. & Gulin, A. ‘CatBoost: gradient boosting with categorical features support’, Oct. (2018). https://doi.org/10.48550/arxiv.1810.11363
Cortes, C., Vapnik, V. & Saitta, L. Support-vector networks, Mach. Learn. 20(3): 273–297, (1995). https://doi.org/10.1007/BF00994018
Thai-Nghe, N., Gantner, Z. & Schmidt-Thieme, L. ‘Cost-Sensitive Learning Methods for Imbalanced Data’, Accessed: Feb. 28, 2023. [Online]. Available: http://www.cs.waikato.ac.nz/ml/weka/
Soria, F. et al. Epidemiology, diagnosis, preoperative evaluation and prognostic assessment of upper-tract urothelial carcinoma (UTUC). World J. Urol. 35 (3), 379–387 (2017). https://doi.org/10.1007/s00345-016-1928-x
Cerdeña, J. P., Plaisime, M. V. & Tsai, J. From race-based to race-conscious medicine: how anti-racist uprisings call Us to act. Lancet 396, 1125–1128. https://doi.org/10.1016/S0140-6736(20)32076-6 (Oct. 2020).
Xu, X. et al. Three-dimensional texture features from intensity and high-order derivative maps for the discrimination between bladder tumors and wall tissues via MRI. Int. J. Comput. Assist. Radiol. Surg. 12 (4), 645–656 (2017). https://doi.org/10.1007/S11548-017-1522-8
Garapati, S. S. et al. Urinary bladder cancer staging in CT urography using machine learning. Med. Phys. 44 (11), 5814 (2017). https://doi.org/10.1002/MP.12510
Wang, G., Lam, K. M., Deng, Z. & Choi, K. S. Prediction of mortality after radical cystectomy for bladder cancer by machine learning techniques. Comput. Biol. Med. 63, 124–132 . (2015). https://doi.org/10.1016/J.COMPBIOMED.2015.05.015
Lam, K. M., He, X. J. & Choi, K. S. ‘Using artificial neural network to predict mortality of radical cystectomy for bladder cancer’, Proceedings of International Conference on Smart Computing, SMARTCOMP 2014, 201–207, (2014). https://doi.org/10.1109/SMARTCOMP.2014.7043859
Author information
Authors and Affiliations
Contributions
RN contributed acquisition of data, analysis and interpretation of data, drafting of manuscript, critical revision of the manuscript for important intellectual content and statistical analysis; NH contributed acquisition of data, analysis and interpretation of data; statistical analysios: HYL contributed acquisition of data; HLK contributed acquisition of data; WJW contributed acquisition of data; EL contributed acquisition of data; PS contributed acquisition of data; CHMW contributed acquisition of data, critical revision of manuscript; ICHK contributed critical revision of manuscript; CHL contributed analysis and interpretation of data; GG contributed acquisition of data; NV contributed acquisition of data; PG contributed acquisition of data; NCFA contributed acquisition of data, supervision; CCL contributed acquisition of data; WML contributed acquisition of data ; HCY contributed acquisition of data; RC contributed acquisition of data; SS contributed acquisition of data; MG contributed acquisition of data; SS contributed acquisition of data, TC contributed study concept and design, acquisition of data, Analysis and interpretation of data, and Statistical analysis; JYCT contributed study concept and design, Analysis and interpretation of data, statistical analysis, supervision.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Nicoletti, R., Ho, N., Lee, HY. et al. Training and external validation of machine learning supervised prognostic models of upper tract urothelial cancer (UTUC) after nephroureterectomy. Sci Rep 16, 2847 (2026). https://doi.org/10.1038/s41598-025-29043-w
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-29043-w

