Fig. 2: Top 10 consistency.
From: Prompt engineering in consistency and reliability with the evidence-based guideline for LLMs

The vertical axis represents the combination of the chosen model and prompt, for example, ‘gpt-4-Web-ROT’ indicates that the selected model is gpt-4-Web, and the prompt is ROT prompting.