Fig. 2: Clinical Transformer performance and impact of pretraining.

a Kaplan-Meier (KM) curves of the Clinical Transformer applied to the Chowell et al.3 dataset, using the median survival score to stratify patients into high and low populations. b KM curves of Chowell et al.3 random forest model used in the testing dataset (using Chowell et al. optimal pan-cancer cutoffâ=â0.238). c KM curves for evaluating TMB score in the Chowell et al.3 dataset (cutoffâ=â10 mutations per megabase [mut/mb]). P-values for HR in (aâc) reported from a Wald statistical test. d Learning curves of the Clinical Transformer (C-index vs. training epochs), evaluated on the 10 testing splits from the Samstein et al.33 pan-cancer dataset, with and without using GENIE data for model pretraining. e Learning curves of the Clinical Transformer, evaluated on the 10 testing splits from the MYSTIC dataset, with and without using GENIE data for model pretraining. f Learning curves of the Clinical Transformer, evaluated on the 10 testing splits from the MYSTIC dataset, with and without using the Chowell et al. 3 dataset for model pretraining. Source data are provided in the SourceData file.