Table 6 The results of the text description generator.
Model | Recognition accuracy (%) | BLEU (%) |
|---|---|---|
MobileVLM | 7.50 | 9.75 |
MobileVLM-Fine-tuned | 16.69 | 16.18 |
CLIP + MobileVLM-Fine-tuned | 78.38 | 36.65 |
Model | Recognition accuracy (%) | BLEU (%) |
|---|---|---|
MobileVLM | 7.50 | 9.75 |
MobileVLM-Fine-tuned | 16.69 | 16.18 |
CLIP + MobileVLM-Fine-tuned | 78.38 | 36.65 |