Fig. 7: The evaluation scores of different approaches across various LLM bases.
From: Self-reflection enhances large language models towards substantial academic response

For the proposed RBB method, the reflection bank is constructed by GLM-4-Flash, and the reasoning based on the reflection bank is implemented by three LLM bases. Error bars in the figure represent one standard deviation (SD) to indicate the variability of the data within each group.