Extended Data Fig. 4: Comparing model outputs on open-ended question answering, example 2.
From: A multimodal generative AI copilot for human pathology

An example question in PathQABench-Public regarding glioblastoma for which the responses by all models were considered to be of roughly comparable quality by expert pathologists for all producing a reasonable and reasonably accurate response to the query, though with some variation between them. Scale bar is 200 µm.