Fig. 6: Impact of LLMs selection on feature count.

We have observed the difference between GPT-3.5 and GPT-4o when detecting SLD Features F1-F10. We observe that GPT-4o adopts a more conservative approach than GPT-3.5, showing a slightly lower detection rate across key ASD-related markers.