Fig. 1
From: Visual information extraction from documents via classification-guided large vision-language models

Overview of the classification-guided LVLM framework for multi-VIE. The design leverages a parameter-frozen LVLM to support minimally supervised, scalable deployment.