Ji et al. assess the responses of four major large language models in the context of cardiovascular disease prevention queries in both English and Chinese. The large language model chatbots exhibit significant disparities in performance across different models and languages, with ChatGPT-4 outperforming the others in English.
- Hongwei Ji
- Xiaofei Wang
- Yih-Chung Tham