Fig. 2: Average Distinct n-gram scores by the different LLM-RAG Models. | npj Digital Medicine

Fig. 2: Average Distinct n-gram scores by the different LLM-RAG Models.

From: Retrieval augmented generation for 10 large language models and its generalizability in assessing medical fitness

Fig. 2

The figure shows the average distinct 1-gram (Yellow line), 2-gram (Orange line), and 3-gram (Red line) scores for each LLM-RAG model. N-gram scores are a measure of linguistic diversity, with higher scores indicating greater originality and creativity in the generated text. 1-grams represents individual words, 2-grams represents pairs of consecutive words, and 3-grams represents triplets of successive words. The bot shows that the Llama3 models have little variations in their linguistic variability.

Back to article page