Extended Data Fig. 6: Execution details of Q85-Q167 in BioMed-AQA.

The detailed testing of each question in BioMed-AQA (Q85-Q167) by BioMedAgent, utilizing the IMF memory update strategy after three rounds of learning, includes the planned steps, Win scores, and execution outcomes. The planned steps are automatically generated by BioMedAgent, with +2/+6 indicates 2 or 6 more steps not shown.