Sim et al. compare how different prompting strategies guide ChatGPT-4o and Llama-3.1 in classifying how pain and fatigue affect functioning in childhood cancer survivors. They show that prompts that leverage background knowledge and stepwise reasoning improve accuracy and reveal distinct precision–sensitivity trade-offs among models.
- Jin-ah Sim
- Madeline R. Horan
- I-Chan Huang