Table 5 Combined comparison of fine-tuning and inference-stage methods.
From: Accurate discharge summary generation using fine tuned large language models with self evaluation
Model& Method | Fine-Tuning | Inference Optimization | ROUGE-L | BERTScore | Accuracy | Completeness |
|---|---|---|---|---|---|---|
Qwen2-7B Baseline | ✘ | few-shot | 0.391 | 0.866 | 4.0 | 3.9 |
Qwen2-7B + DoRA | ✔ | few-shot | 0.451 | 0.923 | 4.5 | 4.6 |
Qwen2-7B + Self-Evaluation | ✘ | ✔ | 0.451 | 0.923 | 4.7 | 4.9 |
Qwen2-7B + DoRA + Self-Evaluation | ✔ | ✔ | 0.486 | 0.941 | 4.8 | 4.9 |