Table 1 An overview of the considered LLMs and their properties

Model	Base	Parameters	Training dataset	Downloadable
Llama 2 Chat³²	Llama 2 (ref. ³²)	70B	Public data^a	✓
OASST³³	Llama 2 (ref. ³²)	70B	Public data^a, https://huggingface.co/OpenAssistant/llama2-70b-oasst-sft-v10/, open-source data	✓
WizardLM³⁴	Llama 2 (ref. ³²)	70B	Public data^a, Evol-Instruct generated³⁴	✓
Clinical Camel¹⁹	Llama 2 (ref. ³²)	70B	Public data^a, https://sharegpt.com/; ShareGPT; PubMed articles (before 2021)¹⁹, MedQA¹³	✓
Meditron³⁵	Llama 2 (ref. ³²)	70B	Public data^a, https://huggingface.co/datasets/epfl-llm/guidelines/; clinical guidelines, public PubMed abstracts³⁵, public PubMed papers³⁵, RedPajama⁵⁸	✓
Chat-GPT⁵⁹	GPT3.5 (ref. ⁶⁰)	???	User conversations^b, Common Crawl⁶¹, WebText2 (ref. ⁶²), Books1 (ref. ⁶³), Books2 (ref. ⁶³), Wikipedia	✗
GPT-4 (ref. ⁶⁴)	???	???	???	✗
Med-PaLM⁹	Flan-PaLM⁶⁵	540B	Webpages^b, Wikipedia^b, social media^b, GitHub^b, news articles^b, books^b, 473 instruction fine-tuning datasets⁶⁵, HealthSearchQA⁹, MedicationQA⁶⁶, LiveQA⁶⁷	✗
Med-PaLM 2 (ref. ⁸)	PaLM 2 (ref. ⁶⁸)	340B	Web Documents^b, books^b, code^b, mathematics^b, conversational data^b, MedQA¹³, HealthSearchQA⁹, MedicationQA⁶⁶, LiveQA⁶⁷	✗

Due to the data usage agreement of MIMIC-IV, only open-access models that can be downloaded can be used with the data; thus, only LLMs based on Llama 2 were used in this study. ??? indicates no information has been made public.
^aMeta defines ‘public data’ as a ‘mix of data from publicly available sources’.
^bNo further information provided.

Quick links

Search