In an evaluation involving 125 standardized patient cases, open-source DeepSeek large language models are shown to perform at least on par with state-of-the-art proprietary large language models in diagnosis and treatment recommendation tasks.
- Sarah Sandmann
- Stefan Hegselmann
- Julian Varghese