Fig. 4

Confusion matrix designed using a BERTScore threshold value of 0.7 to represent the proportion of cases in which the LLM achieved a highly reliable radiology report.

Confusion matrix designed using a BERTScore threshold value of 0.7 to represent the proportion of cases in which the LLM achieved a highly reliable radiology report.