Table 8 Unified performance comparison of transformer models for routing and urgency estimation.

Model	Accuracy (%)	MAE	Latency (ms)	GPU Req.
BERT-base (fine-tuned)	94.1	0.052	128	Yes
RoBERTa-base (fine-tuned)	94.8	0.048	141	Yes
DeBERTa-v3-base (fine-tuned)	95.2	0.061	158	Yes
LLaMA-3 8B (zero-shot)	93.7	0.059	420	Yes
MobileBERT (zero-shot, Proposed)	92.4	0.041	19	No

Quick links

Search