Fig. 7: The accuracy of prompting methods engineered for cancer stage recognition.

The figure compares the accuracy of extracting two cancer staging-related entities—cancer_staging_method and cancer_stage_type—before (orange bars) and after (blue bars) prompt optimization across various model-method pairs. Optimized prompts consistently improved performance, particularly for models using hierarchical prompting strategies like BFOP and 2POP.