Extended Data Fig. 6: Execution details of Q85-Q167 in BioMed-AQA. | Nature Biomedical Engineering

Extended Data Fig. 6: Execution details of Q85-Q167 in BioMed-AQA.

From: Empowering AI data scientists using a multi-agent LLM framework with self-evolving capabilities for autonomous, tool-aware biomedical data analyses

Extended Data Fig. 6: Execution details of Q85-Q167 in BioMed-AQA.

The detailed testing of each question in BioMed-AQA (Q85-Q167) by BioMedAgent, utilizing the IMF memory update strategy after three rounds of learning, includes the planned steps, Win scores, and execution outcomes. The planned steps are automatically generated by BioMedAgent, with +2/+6 indicates 2 or 6 more steps not shown.

Back to article page