Table 5 Performance metrics of different models

Model	Accuracy	PPV	Sensitivity	F1 score
XLNet³⁰	58.76%	54.38%	58.76%	56.07%
ALBERT³¹	69.07%	47.71%	69.07%	56.44%
DistilBERT³²	58.76%	51.85%	58.76%	54.44%
RoBERTa³³	57.73%	58.12%	57.73%	57.92%
BERT³⁴	63.92%	60.86%	63.92%	61.87%
Mistral - 7b	70.29%	70.29%	100.00%	82.55%
Qwen2 - 7b	70.29%	70.29%	100.00%	82.55%
Gemini 2.0 Flash-based	63.47%	72.34%	77.97%	75.05%
GPT 4o - based	67.05%	70.76%	90.51%	79.43%
GPT 3.5 - based	69.14%	71.52%	93.22%	80.94%

Quick links

Search