Table 8 Disaggregated wall-clock timings (mean±SD over 5 runs) and peak memory. All numbers are per 5-fold CV on 5046 notes (\(\approx\) 4037 train + 1009 validation). Inference latency measured as mean time to predict one held-out note.

From: An empirical evaluation of dimensionality reduction and class balancing for medical text classification

Pipeline stage

PCA+SMOTE

UMAP+SMOTE

t-SNE+SMOTE

No reduction

(i) DR fit (s)

3 ± 1

71 ± 5

95 ± 7

(ii) Sampler fit (s)

2 ± 1

2 ± 1

2 ± 1

1 ± 0

(iii) Model fit (s)

37 ± 2

37 ± 2

37 ± 2

72 ± 4

Total training (s)

42 ± 3

110 ± 7

134 ± 9

73 ± 5

(iv) Inference (ms/note)

0.9 ± 0.1

0.9 ± 0.1

0.9 ± 0.1

1.8 ± 0.2

(v) Peak memory (GB)

4.1 ± 0.2

4.5 ± 0.3

4.6 ± 0.3

7.2 ± 0.4