Fig. 3

Visualization of the GT lesion diagnosis results of the GTCAD and existing methods. The 1st column represents original images with a complicated GT lesion, and the 2nd to 7th columns represent attention heads (head zero to seven). From 1st row to 8th row represents Baseline, CrossViT19ViTfSCD30MedViT20FLATer16CST31AG-CNN18and GTCAD (ours). As we can see, the proposed model can accurately diagnose and visualize lesions regardless of their size.