Fig. 22: Distribution of Stage 2 responses per model when prompted in Russian. | npj Artificial Intelligence

Fig. 22: Distribution of Stage 2 responses per model when prompted in Russian.

From: Large language models reflect the ideology of their creators

Fig. 22: Distribution of Stage 2 responses per model when prompted in Russian.The alternative text for this image may have been generated using AI.

left Label distributions of valid responses. right validity rates. A response is invalid if the Stage 1 response is a refusal or clear hallucination, or if the Stage 2 response cannot clearly be mapped to the answer scale

Back to article page