Training and external validation of machine learning supervised prognostic models of upper tract urothelial cancer (UTUC) after nephroureterectomy

Nicoletti, Rossella; Ho, Nick; Lee, Hsiang-Ying; Wu, Wen-Jeng; Laukhtina, Ekaterina; Spatafora, Pietro; Wong, Chris Ho-Ming; Ko, Ivan Ching-Ho; Leung, Chi-Ho; Giannarini, Gianluca; Vasdev, Nikhil; Gontero, Paolo; Ng, Chi-Fai; Li, Ching-Chia; Li, Wei-Ming; Ke, Hung-Lung; Yeh, Hsin‑Chih; Campi, Riccardo; Serni, Sergio; Gacci, Mauro; Shariat, Shahrokh; Choi, Thomas; Teoh, Jeremy Yuen-Chun

doi:10.1038/s41598-025-29043-w

Download PDF

Article
Open access
Published: 22 January 2026

Training and external validation of machine learning supervised prognostic models of upper tract urothelial cancer (UTUC) after nephroureterectomy

Rossella Nicoletti^1,2,
Nick Ho³,
Hsiang-Ying Lee^4,5,
Wen-Jeng Wu^4,5,
Ekaterina Laukhtina⁶,
Pietro Spatafora²,
Chris Ho-Ming Wong¹,
Ivan Ching-Ho Ko¹,
Chi-Ho Leung¹,
Gianluca Giannarini⁷,
Nikhil Vasdev^8,9,
Paolo Gontero¹⁰,
Chi-Fai Ng¹,
Ching-Chia Li^4,5,11,
Wei-Ming Li^4,5,11,
Hung-Lung Ke^4,5,11,
Hsin‑Chih Yeh^4,5,
Riccardo Campi²,
Sergio Serni²,
Mauro Gacci²,
Shahrokh Shariat^6,12,13,14,
Thomas Choi³ &
…
Jeremy Yuen-Chun Teoh^1,6

Scientific Reports volume 16, Article number: 2847 (2026) Cite this article

2150 Accesses
11 Altmetric
Metrics details

Subjects

Abstract

The European association of Urology (EAU) suggests a prognostic stratification of Upper Tract Urothelial Cancer (UTUC) based on high and low risk patients, with Radical nephroureterectomy (RNU) and bladder cuff resection being the gold standard for the treatment of non-metastatic High risk UTUC. However, no consensus on post-operative patient management or tools that predict who would benefit the most from a close follow-up rather than adjuvant chemotherapy regimen exist. in Machine Learning (ML) is gaining interest in Urology providing models for prognostic prediction purpose; It’s role in UTUC has not yet been investigated. We aim to develop and validate multiple supervised ML models based on patient- and tumor- related features to predict prognosis in patients with preoperative Histological or Imaging proved UTUC treated with RNU within a multiethnic large cohort. Data from an international multicenter large cohort of histologically proven UTUC patients from Asia and Europe treated with RNU were retrospectively collected. Twenty different ML-supervised predictive models were first trained and then external validate with two separate set. Nomograms were constructed based on 8 independent prognostic factors (age, gender, grading, pT, pN, presence of Carcinoma in Situ (CIS), multifocality and Lymphovascular invasion(LVI)) to predict 6 Outcomes (Overall Survival (OS), Cancer Specific Survival (CSS) and Disease Free Survival (DFS) at 3 and 5 year). Performances were compared using Area-under-curve (AUC) of Receiver-Operating Characteristics (ROC). A total of 3129 patients were enrolled: 637 Asian Patients (training cohort) and 2492 European patients (validation cohort). Upon training assessment, LR models achieved the best results, being the best model for prediction of 4/6 outcomes, with the best result in CSS both at 3 and 5 years (AUC: 0.85, 0.84, 0.81 for CSS-3y, CSS-5y and DFS-3y respectively). Upon external validation, LR(CSL) models achieve the best results, being the number 1 model for prediction of 3/6 outcomes (AUC: 0.84, 0.79, 0.77 for CSS-3y, OS-3y and OS-5y respectively). ML is a promising technology in the field of UTUC. Our model achieve favorable results in terms of prediction of prognosis after RNU, especially in terms of CSS at 3 and 5 years, moreover is the first model of prognosis taking into account the differences in epidemiology existing between European and Asian patients. Further clinical validation and verification of its reliability for the case selection of adjuvant therapy are needed to assess its use in clinical practice linked to clinical decision making. ML is an advancing technology in the field of medicine and urology, which can also be applied to the definition of the prognosis of patients with UTUC undergoing RNU. Our study represents the first experience investigating this potential.

Introduction

Urothelial carcinoma of the upper urinary tract (UTUC) is a rare disease, accounting for about 5–10% of all urothelial carcinomas¹.The European association of Urology (EAU) suggests a prognostic stratification of UTUC based on high and low risk patients, with Radical nephroureterectomy (RNU) and bladder cuff resection being the gold standard for the treatment of non-metastatic High risk UTUC². However, no consensus on post-operative patient’s management or risk stratification exist. The POUT trial, a phase III prospective randomized trial, aiming to evaluate the benefit of adjuvant chemo after RNU vs. surveillance in patients with pT2–T4 pN0–N3 M0 or pTany N1–3 M0 disease, in his preliminary subgroup analysis, demonstrated large variability in the benefits of patients undergoing adjuvant chemotherapy, underlining the need for a stratification strategy after RNU, especially in advanced disease setting³.

Several Prognostic nomograms, based on pre-operative and post-operative factors have been described^{4,5,6,7,8,9,10}. However, to date none of them is currently used nor recommended by guidelines, with only the Yates et Al’s model (a nomogram to predict CSS post RNU) externally validated, using 200 bootstrap resamples¹¹. Moreover, most of this nomograms didn’t take into account the existing differences in Patients with Asian ethnicity, who seem to present with more advanced and higher-grade diseases compared to other ethnicities¹². This could led to models that are not Globally generalizable, as trained and tested only in a single-ethnicity population.

Over the past years, Machine learning (ML)-assisted models have been proposed as a supplement or alternative for standard statistical techniques, opening up the possibility of creating non-linear predictive models and with the ability to improve automatically¹³. In ML, through the process of supervised machine learning, it is possible to build a model using computer algorithms by making them learn the relationship between input variables (characteristics) and outputs (labels). In a first phase, the algorithms analyze the relationship between input and output thanks to a training dataset. Then, the algorithms are applied to a second set of data known as a validation set, to assess how well the predictive model is able to test inputs to predict outputs¹⁴.

Various ML techniques have been used in the field of Urology and especially in urothelial carcinomas. However, most of them were used in the lower urinary tract setting while the possible application of artificial intelligence within the field of the UTUC still remain unexplored, especially as a tool for prognosis prediction.

We aim to develop and compare training and validation performances of multiple supervised ML models based on patient- and tumor- related features to predict oncological outcomes [overall survival (OS), Cancer-specific Survival (CSS) and Disease-free survival (DFS)] in patients with preoperative Histological or Imaging proved UTUC treated with RNU within a large cohort of multi-ethnic patients.

Matherial and methods

Data sources

Data from an international multicenter large cohort of patients with preoperative Histological or Imaging proved UTUC treated with RNU between December 2001 and August 2020 were retrospectively collected in two dedicated database: the training cohort, consisting of Asian patients and the validation cohort, entailing of European Patients. Baselines as well as tumor related characteristics of patients were collected.

The common inclusion criteria were: patients undergoing RNU, with preoperative Histological or imaging proven UTUC, for which patient- and tumor-related data and oncological outcomes data were available.

Features and outcomes of interest

We used a total of 8 features as input sourced among the listed patient and tumor related factors on EAU UTUC guidelines: age, gender, grading (according to World Health Organization (WHO) 1973 classification for patienst enrolled before 2004 and to the WHO 2004 classification for patients enrolled after 2004), pT, pN, presence/absence of Carcinoma In Situ (CIS), multifocality and presence/absence of Lymphovascular invasion.

The outcomes of interest were: overall survival (OS), cancer-specific survival (CSS) and disease-free survival (DFS) as defined as both local or intravescical recurrence both at 3 and 5 years from index, defined as date of RNU. Patients were followed up as appropriate, following the principles of EAU-Guidelines in all the Centers involved in the analysis.

ML-supervised models

In this study, 20 predictive models were built using supervised learning algorithms, including logistic regression (LR), decision tree (DT) and its ensemble learning variants, support vector machines (SVM), k-nearest neighbours (KNN), and hard-voting ensemble of these algorithms. LR performs binary classification by modelling the relationship between input features and outcome with sigmoid function¹⁵. DT predicts an outcome by traversing a tree-like flowchart structure for a given set of input features¹⁶. Random forest (RF) is an ensemble of DTs built using different subsets of the dataset to reduce overfitting and noise¹⁷. Gradient boosting is class of sophisticated ensemble DT algorithms, where individual trees are built and summed sequentially such that the prediction error is minimized during model fitting. Different variants were adopted in this study, including the standard gradient boost (gboost) from the free scikit-learn machine learning library (https://scikit-learn.org/), the eXtreme Gradient Boosting (XGBoost) that iteratively combines multiple weaker base predictors¹⁸, light gradient boosting machine (lightGBM) by Microsoft Corporation which employs histogram-based DTs grown in a leaf-wise manner¹⁹, and categorical boosting (CatBoost)²⁰ which deals with categorical features. SVM performs classification in higher dimensional feature space where a hyperplane is identified to separate distinct classes²¹. Two linear SVMs were adopted, including support vector classification (SVC), and its variant linearSVC which is more flexible and runs faster. KNN is a classical algorithm which perform classification based on a similarity measure. In hard-voting ensemble learning, denoted here as ensemble learning, the votes for the outcomes from the above algorithms are summed and the predicted class is the one with most votes.

In this study, class sensitive learning (CSL) is applicable to 9 algorithms – XGBoost, lightGBM, CatBoost, SVC, linearSVC, DT, LR, RF and ensemble learning – to counteract class imbalance with the minority class weighted higher²², thereby yielding 18 models of the original and the CSL versions. Adding Gboost and KNN, a total of 20 models were built.

Statistical analysis

Continuous variables were described as median and interquartile ranges, categorical variable as number and percentages as appropriate. We evaluated and compared the performance of each prediction model using Area-under-curve (AUC) of receiver-operating characteristics (ROC) for the training and validation.

Ethics approval

The study was performed in accordance with relevant guidelines and regulations. All experimental protocols were approved by a The Joint Chinese University-New Territories East Cluster Clinical Research Ethics Committee of Hong Kong, SAR. Informed consent was obtained from all participants involved in the study.

Results

Baseline characteristics

Overall, 3129 patients fulfilled the inclusion criteria and were therefore enrolled. The training set, consisting of data from 637 patients undergoing RNU from Asia and the validation set, consisting of 2492 patients from Europe.

Overall, median age was 68 years (61–76), 1959 (62,3%) of patients were male. The proportion of tumor located in the pelvis was similar among the two groups (69.7% (444 patients) in the Asian cohort vs. 64.7% (1613 patients) in European cohort). The detailed baseline characteristics of the training and validation cohorts were listed in Table 1.

Table 1 Baseline characteristics of patients enrolled in training (Asian patients) and validation (European patients) cohort.

Full size table

Training

The results of each model in terms of AUC for the prediction of each outcome upon training are presented in Supplementary Table 1, the best five models are highlighted for each of the outcome. Overall, LR models seems to achieve the best results, being the number 1 model for prediction of 4/6 outcomes (AUC: 0.85, 0.84, 0.81, 0.77 for CSS-3y, CSS-5y, DFS-3y and OS-5y respectively) and number 2 on the other 2/6 outcomes (OS-3y and DFS-5y).

Regarding OS, the models show results slightly lower than 0,8 in AUC: the best model is SVC for OS-3y [AUC: 0.79 (95% CI 0.7142–0.8630)] and LR for OS-5y [ AUC: 0.77(0 0.7088–0.8398)].

Better results seem to be obtained in predicting the DFS, slightly overcome the threshold of AUC of 0.8, both at 3- and 5-years: the best DFS-3y model is LR [AUC of 0.81 (95% CI 0.7386–0.8816)] while at DFC − 5y prevails LR (CSL) [AUC of 0.80 (95% CI 0.7335–0.8751)].

The outcome showing the overall most promising results in all the trained models is the CSS, with a peak of AUC reaching 0,85: in this case the LR model provides the best results at both CSS-3y [AUC: 0.85 (95% CI 0.7839–0 0.9151)] and CSS-5y [AUC: 0.84 (95% CI 0.7680–0.9070)].

External validation

The results of each model in terms of AUC for the prediction of each outcome upon validation are presented in Supplementary Table 2, the best five models are highlighted for each of the outcome. Overall, upon validation LR(CSL) models achieved the best results, being the number 1 model for prediction of 3/6 outcomes (AUC: 0.84, 0.79, 0.77 for CSS-3y, OS-3y and OS-5y respectively), followed by LinearSVC(CSL) (AUC: 0.82 and 0.82 for DFS-3y and DFS-5y, respectively).

The comparison in AUC of the top 5 models are available in Fig. 1 for prediction of outcomes at 3year and Fig. 2 for 5year, the top 5 model’s AUC details upon training and validation are listed Table 2.

Table 2 Top 5 models for prediction of each of the six outcomes, in terms of AUC upon training and Validation.

Full size table

Regarding OS, overall the models upon external validation show results slightly higher as compared to training: the best model is LR (CSL) for OS 3y [AUC: 0.79] and OS 5y [ AUC: 0.77].

Better results are obtained in predicting the DFS, slightly overcome the threshold of AUC of 0.8, both at 3- and 5-years: the best DFS-3y model is SVC [AUC of 0.82] while at DFC − 5y prevails LinearSVC(CSL) [AUC of 0.82].

Similar to training, the outcome with the most significant results is the CSS, with a peak of AUC reaching 0,84: the best CSS-3y is LR model [AUC: 0.84], while for CSS-5y best performance is by SVC [AUC: 0.83].

Discussion

In our study, the use of various ML-supervised models has shown good prediction value of oncological outcomes of UTUC after RNU. Using readily available clinical parameters, ML-supervised models could provide an accurate prediction of prognosis, potentially implementing pTNM staging alone as a guide for postoperative treatment. Among the various experiments, the LR ML-supervised model obtains the best results in predicting CSS at both 3 and 5 years, with a maximum AUC reached of 0.85 and 0.84 respectively upon training, while LR(SVC) is more reliable upon validation, with best results in CSS 3-year. Although we acknowledge that our is not the first attempt to propose a prediction models after RNU, the existing models are not fully comparable: (1) to date, we used one of the largest cohort of patients (n = 3129) for UTUC’s prognosis prediction; (2) we intentionally included two set of patients with different ethnicity; (3) we explored the applicability of ML-supervised models in UTUC field; (4) we performed a complete external validation, in fact the only model external validate was the Yates et Al’s model, using 200 bootstrap resamples.

Adjuvant therapies are invasive and burdened by toxicity, especially in the setting of single-kidney patients who might not even need it if better prediction tools exist. Numerous efforts have been made to generate predictive models of UTUC postoperative prognosis. Despite this, there is a lack of validation which makes these models still not reliable in clinical practice and none is yet recommended with strong evidence by current European guidelines. The POUT trial³, which is currently interested in the validation of adjuvant chemotherapy after RNU for UTUC, uses only pTNM staging data for selection porpoise, including pT2–T4 pN0–N3 M0 or pT any N1–3 M0; however his preliminary subgroup analysis demonstrated large variability in the benefits of patients undergoing adjuvant chemotherapy, underlining the need for a better stratification strategy after RNU, taking into account additional features.

Several prognosis prediction nomograms have been proposed, these tools exceed AJCC/TNM staging for prognosis of survival in internal validation. Among these, two studies^4,6 include UTUC patients undergoing surgery regardless of RNU or other conservative surgery; Ku et Al¹¹ limited to an external validation study, while instead Krabbe et al.⁷used different outcomes from ours study as per Relapse free survival, are therefore not comparable. Overall, four models^5,8,9,10 are comparable, however, one of them use the old WHO 1973 grading system⁸. Those nomograms variously used 7 different independent prognostic factors (Age, pT, LVI, Location, CIS, Architecture, pN), with Cha’s model being the more comprehensive (7 features) followed by Seisen (6 features) and Roupret (5 features). All of the models assessed 5y-CSS; Cha’s model additionally assessed 2y-CSS; none exceeds the trade-off of 0.81 in terms of AUC for the prediction of CSS, neither in the training nor in the internal validation set. This support the hypothesis that the ML could implement existing models.

Furthermore, our study represents the first attempt to generate a model that can be reliable in more than one single ethnicity: most of this nomograms didn’t take into account the existing differences in Asian patients, who seem to present more advanced and higher-grade diseases compared to other ethnicities¹². This could be explained with differences in genetic and epigenetic factors such as environmental and occupational exposures, lifestyle choices as well as socioeconomic factors²³. Aiming to move towards a race-conscious medicine, keeping in mind that as suggested by Cardena et al.²⁴ clinical research should be used to examine structural barrier, we decided use two set of patients with different ethnicity, rather than using race as a proxy for biology. Our models are therefor tested to both European and Asian patients and can be reliable regardless the origin of the patient.

Various machine learning techniques have been used in the field of Urology, most of them within the lower urinary tract setting: (1) regarding radiomics, AI have been implemented, capable of distinguishing between bladder tumor and normal bladder at multi parametric magnetic resonance imaging (mpMRI)²⁵ or determining the stage of bladder cancer at Computed Tomography (CT)²⁶; (2) in terms of prognosis, the only experience derives from Lam et al. and Wang et al., who used clinicopathological evidence to create and test a significant number of AI algorithms to estimate the 5-year survival after radical cystectomy^27,28. To date, this is the first experience investigating the possible application of ML-supervised algorithms to the UTUC and in particular to predict prognosis after RNU.

Lastly, or model may help clinicians in stratifying patients with UTUC, addressing the challenge of understanding clinical aggressiveness based on baseline characteristics of this specific tumors. Not only it can be used to increment follow-up strategies in patients with high risk of recurrence, but also can be used to stratify potential candidate to adjuvant and subsequential therapies.

This study has several limitations. First, its nature as a multicenter study may have introduced inconsistencies in surgical skills, type of bladder cuff performed, use or not of intra- or perioperative mitomycin, neoadjuvant use of chemotherapy and pathological diagnoses. Second, since the cohort straddles 2004, the use of two different pathological gradings may have influenced the algorithms. Furthermore, 2 patients on training and 15 patients on validation cohort had a pT0 diagnosis at final histopathological specimen: even if this may reflect real world data, on the other hand the prognosis for those patients is by definition excellent. Moreover, there is a non-negligible difference in gender representation among the two cohorts: thus, due to the different underneath biology, may influence response to treatments and prognosis. Lastly, the lack of centralized pathological revisions of imaging and specimen could introduce a bias.

Conclusions

ML is a promising technology in the field of UTUC. Our model achieve favorable results in terms of prediction of prognosis after RNU, especially in terms of CSS at 3 and 5 years, moreover is the first model of prognosis taking into account the differences in epidemiology existing between European and Asian patients. Further clinical validation and verification of its reliability for the case selection of adjuvant therapy are needed to assess its use in clinical practice linked to clinical decision making.

Data availability

Data are available for bona fide researchers who request it from the authors. Please contact the corresponding author for related requests.

References

Siegel, R. L., Miller, K. D., Fuchs, H. E., Jemal, A. & Cancer Statistics CA Cancer J. Clin. 71 (1), 7–33. (2021). (2021). https://doi.org/10.3322/CAAC.21654
Article PubMed Google Scholar
‘Upper Urinary Tract Urothelial Cell Carcinoma - INTRODUCTION - Uroweb’. Accessed: Feb. 28, 2023. [Online]. Available: https://uroweb.org/guidelines/upper-urinary-tract-urothelial-cell-carcinoma
Birtle, A. et al. Adjuvant chemotherapy in upper tract urothelial carcinoma (the POUT trial): a phase 3, open-label, randomised controlled trial. Lancet 395(10232), 1268–1277. https://doi.org/10.1016/S0140-6736(20)30415-3 (Apr. 2020).
Yoshida, T. et al. Development and external validation of a preoperative nomogram for predicting pathological locally advanced disease of clinically localized upper urinary tract carcinoma. Cancer Med. 9 (11), 3733–3741 (2020). https://doi.org/10.1002/CAM4.2988
Article CAS PubMed PubMed Central Google Scholar
Rouprêt, M. et al. Prediction of cancer specific survival after radical nephroureterectomy for upper tract urothelial carcinoma: development of an optimized postoperative nomogram using decision curve analysis. J. Urol. 189 (5), 1662–1669 (2013). https://doi.org/10.1016/J.JURO.2012.10.057
Article PubMed Google Scholar
Zhang, G. L. & Zhou, W. ‘A Model for the Prediction of Survival in Patients With Upper Tract Urothelial Carcinoma After Surgery’, Dose Response, 17(4), (2019). https://doi.org/10.1177/1559325819882872
Krabbe, L. M. et al. Mar., ‘Postoperative Nomogram for Relapse-Free Survival in Patients with High Grade Upper Tract Urothelial Carcinoma’, J Urol, 197(3 Pt 1), 580–589, (2017). https://doi.org/10.1016/J.JURO.2016.09.078
Cha, E. K. et al. Apr., ‘Predicting clinical outcomes after radical nephroureterectomy for upper tract urothelial carcinoma’. Eur. Urol. 61 (4), 818–825 (2012). https://doi.org/10.1016/J.EURURO.2012.01.021
Article PubMed Google Scholar
Yates, D. R. et al. Mar., ‘Cancer-specific survival after radical nephroureterectomy for upper urinary tract urothelial carcinoma: proposal and multi-institutional validation of a post-operative nomogram’, Br J Cancer, 106(6), 1083–1088, (2012). https://doi.org/10.1038/BJC.2012.64
Seisen, T. et al. Nov., ‘Postoperative nomogram to predict cancer-specific survival after radical nephroureterectomy in patients with localised and/or locally advanced upper tract urothelial carcinoma without metastasis’. BJU Int. 114 (5), 733–740 (2014). https://doi.org/10.1111/BJU.12631
Article PubMed ADS Google Scholar
Ku, J. H. et al. External validation of an online nomogram in patients undergoing radical nephroureterectomy for upper urinary tract urothelial carcinoma. Br. J. Cancer. 109 (5), 1130–1136 (2013). https://doi.org/10.1038/BJC.2013.462
Article CAS PubMed PubMed Central Google Scholar
Matsumoto, K. et al. Racial differences in the outcome of patients with urothelial carcinoma of the upper urinary tract: an international study’, BJU Int, 108(8b), E304–E309, (2011). https://doi.org/10.1111/j.1464-410X.2011.10188.x
Article PubMed Google Scholar
Sargent, D. J. ‘Comparison of artificial neural networks with other statistical approaches’, Cancer, 91(S8) 1636–1642, (2001). https://doi.org/10.1002/1097-0142(20010415)91:8+%3C1636::AID-CNCR1176%3E3.0.CO;2-D
Article CAS PubMed Google Scholar
Rajkomar, A., Dean, J. & Kohane, I. Machine Learning in Medicine, N Engl J Med, 380(14), 1347–1358, (2019). https://doi.org/10.1056/NEJMRA1814259
Article PubMed Google Scholar
Wright, R. E. ‘Logistic regression.’, in Reading and Understanding Multivariate Statistics., Washington, DC, US: American Psychological Association, 217–244. (1995).
Google Scholar
Safavian, S. R. & Landgrebe, D. A survey of decision tree classifier methodology. IEEE Trans. Syst. Man. Cybern. 21 (3), 660–674. https://doi.org/10.1109/21.97458 (1991).
Article ADS MathSciNet Google Scholar
Breiman, L. ‘Random forests’, Mach Learn, 45(1) 5–32, (2001). https://doi.org/10.1023/A:1010933404324/METRICS
Chen, T. & Guestrin, C. ‘XGBoost: A Scalable Tree Boosting System’, Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, https://doi.org/10.1145/2939672
Ke, G. et al. ‘LightGBM: A Highly Efficient Gradient Boosting Decision Tree’, Accessed: Feb. 28, 2023. [Online]. Available: https://github.com/Microsoft/LightGBM
Dorogush, A. V., Ershov, V. & Gulin, A. ‘CatBoost: gradient boosting with categorical features support’, Oct. (2018). https://doi.org/10.48550/arxiv.1810.11363
Cortes, C., Vapnik, V. & Saitta, L. Support-vector networks, Mach. Learn. 20(3): 273–297, (1995). https://doi.org/10.1007/BF00994018
Article ADS Google Scholar
Thai-Nghe, N., Gantner, Z. & Schmidt-Thieme, L. ‘Cost-Sensitive Learning Methods for Imbalanced Data’, Accessed: Feb. 28, 2023. [Online]. Available: http://www.cs.waikato.ac.nz/ml/weka/
Soria, F. et al. Epidemiology, diagnosis, preoperative evaluation and prognostic assessment of upper-tract urothelial carcinoma (UTUC). World J. Urol. 35 (3), 379–387 (2017). https://doi.org/10.1007/s00345-016-1928-x
Article PubMed Google Scholar
Cerdeña, J. P., Plaisime, M. V. & Tsai, J. From race-based to race-conscious medicine: how anti-racist uprisings call Us to act. Lancet 396, 1125–1128. https://doi.org/10.1016/S0140-6736(20)32076-6 (Oct. 2020).
Xu, X. et al. Three-dimensional texture features from intensity and high-order derivative maps for the discrimination between bladder tumors and wall tissues via MRI. Int. J. Comput. Assist. Radiol. Surg. 12 (4), 645–656 (2017). https://doi.org/10.1007/S11548-017-1522-8
Article PubMed Google Scholar
Garapati, S. S. et al. Urinary bladder cancer staging in CT urography using machine learning. Med. Phys. 44 (11), 5814 (2017). https://doi.org/10.1002/MP.12510
Article PubMed PubMed Central Google Scholar
Wang, G., Lam, K. M., Deng, Z. & Choi, K. S. Prediction of mortality after radical cystectomy for bladder cancer by machine learning techniques. Comput. Biol. Med. 63, 124–132 . (2015). https://doi.org/10.1016/J.COMPBIOMED.2015.05.015
Article PubMed Google Scholar
Lam, K. M., He, X. J. & Choi, K. S. ‘Using artificial neural network to predict mortality of radical cystectomy for bladder cancer’, Proceedings of International Conference on Smart Computing, SMARTCOMP 2014, 201–207, (2014). https://doi.org/10.1109/SMARTCOMP.2014.7043859

Download references

Author information

Authors and Affiliations

S.H. Ho Urology Centre, Department of Surgery, Faculty of Medicine, The Chinese University of Hong Kong, Sha Tin, Hong Kong
Rossella Nicoletti, Chris Ho-Ming Wong, Ivan Ching-Ho Ko, Chi-Ho Leung, Chi-Fai Ng & Jeremy Yuen-Chun Teoh
Department of Experimental and Clinical Biomedical Science, University of Florence, Florence, Italy
Rossella Nicoletti, Pietro Spatafora, Riccardo Campi, Sergio Serni & Mauro Gacci
The Hong Kong Polytechnic University, Hung Hom, Hong Kong
Nick Ho & Thomas Choi
Kaohsiung Medical University Hospital, Kaohsiung, Taiwan
Hsiang-Ying Lee, Wen-Jeng Wu, Ching-Chia Li, Wei-Ming Li, Hung-Lung Ke & Hsin‑Chih Yeh
Department of Urology, School of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
Hsiang-Ying Lee, Wen-Jeng Wu, Ching-Chia Li, Wei-Ming Li, Hung-Lung Ke & Hsin‑Chih Yeh
Department of Urology, Medical University of Vienna, Vienna, Austria
Ekaterina Laukhtina, Shahrokh Shariat & Jeremy Yuen-Chun Teoh
Unit of Urology, Santa Maria della Misericordia Academic Medical Center, Udine, Italy
Gianluca Giannarini
Department of Urology, Hertfordshire and Bedfordshire Urological Cancer Centre, Lister Hospital, Stevenage, UK
Nikhil Vasdev
School of Life and Medical Sciences, University of Hertfordshire, Hatfield, UK
Nikhil Vasdev
AOU Città della Salute e della Scienza di Torino, Torino School of Medicine, Torino, Italy
Paolo Gontero
Department of Urology, Kaohsiung Medical University Gangshan Hospital, Kaohsiung, Taiwan
Ching-Chia Li, Wei-Ming Li & Hung-Lung Ke
Department of Urology, Weill Cornell Medical College, New York, USA
Shahrokh Shariat
Department of Urology, University of Texas Southwestern, Dallas, USA
Shahrokh Shariat
Hourani Center of Applied Scientific Research, Al-Ahliyya Amman University, Salt, Jordan
Shahrokh Shariat

Authors

Rossella Nicoletti
View author publications
Search author on:PubMed Google Scholar
Nick Ho
View author publications
Search author on:PubMed Google Scholar
Hsiang-Ying Lee
View author publications
Search author on:PubMed Google Scholar
Wen-Jeng Wu
View author publications
Search author on:PubMed Google Scholar
Ekaterina Laukhtina
View author publications
Search author on:PubMed Google Scholar
Pietro Spatafora
View author publications
Search author on:PubMed Google Scholar
Chris Ho-Ming Wong
View author publications
Search author on:PubMed Google Scholar
Ivan Ching-Ho Ko
View author publications
Search author on:PubMed Google Scholar
Chi-Ho Leung
View author publications
Search author on:PubMed Google Scholar
Gianluca Giannarini
View author publications
Search author on:PubMed Google Scholar
Nikhil Vasdev
View author publications
Search author on:PubMed Google Scholar
Paolo Gontero
View author publications
Search author on:PubMed Google Scholar
Chi-Fai Ng
View author publications
Search author on:PubMed Google Scholar
Ching-Chia Li
View author publications
Search author on:PubMed Google Scholar
Wei-Ming Li
View author publications
Search author on:PubMed Google Scholar
Hung-Lung Ke
View author publications
Search author on:PubMed Google Scholar
Hsin‑Chih Yeh
View author publications
Search author on:PubMed Google Scholar
Riccardo Campi
View author publications
Search author on:PubMed Google Scholar
Sergio Serni
View author publications
Search author on:PubMed Google Scholar
Mauro Gacci
View author publications
Search author on:PubMed Google Scholar
Shahrokh Shariat
View author publications
Search author on:PubMed Google Scholar
Thomas Choi
View author publications
Search author on:PubMed Google Scholar
Jeremy Yuen-Chun Teoh
View author publications
Search author on:PubMed Google Scholar

Contributions

RN contributed acquisition of data, analysis and interpretation of data, drafting of manuscript, critical revision of the manuscript for important intellectual content and statistical analysis; NH contributed acquisition of data, analysis and interpretation of data; statistical analysios: HYL contributed acquisition of data; HLK contributed acquisition of data; WJW contributed acquisition of data; EL contributed acquisition of data; PS contributed acquisition of data; CHMW contributed acquisition of data, critical revision of manuscript; ICHK contributed critical revision of manuscript; CHL contributed analysis and interpretation of data; GG contributed acquisition of data; NV contributed acquisition of data; PG contributed acquisition of data; NCFA contributed acquisition of data, supervision; CCL contributed acquisition of data; WML contributed acquisition of data ; HCY contributed acquisition of data; RC contributed acquisition of data; SS contributed acquisition of data; MG contributed acquisition of data; SS contributed acquisition of data, TC contributed study concept and design, acquisition of data, Analysis and interpretation of data, and Statistical analysis; JYCT contributed study concept and design, Analysis and interpretation of data, statistical analysis, supervision.

Corresponding author

Correspondence to Jeremy Yuen-Chun Teoh.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download XLSX )

Supplementary Material 2 (download XLSX )

Supplementary Material 3 (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nicoletti, R., Ho, N., Lee, HY. et al. Training and external validation of machine learning supervised prognostic models of upper tract urothelial cancer (UTUC) after nephroureterectomy. Sci Rep 16, 2847 (2026). https://doi.org/10.1038/s41598-025-29043-w

Download citation

Received: 17 April 2024
Accepted: 13 November 2025
Published: 22 January 2026
Version of record: 22 January 2026
DOI: https://doi.org/10.1038/s41598-025-29043-w