Fig. 1

The Overall Architecture of the VBG Model, Integrating Vision Transformer (ViT), BERT, and Generative Adversarial Network (GAN) for Enhanced Multimodal Relic Classification.

The Overall Architecture of the VBG Model, Integrating Vision Transformer (ViT), BERT, and Generative Adversarial Network (GAN) for Enhanced Multimodal Relic Classification.