Fig. 3
From: A dataset for evaluating clinical research claims in large language models

Performance and stratified analysis of the top discriminative and generative models.
From: A dataset for evaluating clinical research claims in large language models
Performance and stratified analysis of the top discriminative and generative models.