Fig. 2: Performance of final models.

A–C External validation using CUIMC data. Per-Class ROC Curves for each model: Tumor size (T14), Regional lymph node involvement (N03), and Distant metastasis (M01). D Best-performing models applied to TCGA held-out and CUIMC pathology reports. Models were selected based on TCGA internal validation set performance.