Table 4 Performance comparison with different modality inputs. The best results are in bold and the second-best results are italics.The improvement is calculated over the second-best result.
From: Leveraging multimodal large language model for multimodal sequential recommendation
Metrics | Recall@5 | Recall@10 | NDCG@5 | NDCG@10 | MRR@5 | MRR@10 |
---|---|---|---|---|---|---|
Text-only | 0.0584 | 0.0648 | 0.03090 | 0.0457 | 0.0238 | 0.0299 |
Image-only | 0.0525 | 0.0679 | 0.0689 | 0.0844 | 0.1012 | 0.1490 |
Text + Image | 0.0620 | 0.0984 | 0.0733 | 0.0893 | 0.1100 | 0.1593 |
%Improv. | 6.16% | 4.78% | 6.39% | 5.81% | 8.91% | 6.91% |