Table 1 Illustration of instruction data formulation for instruction tuning.
From: Leveraging multimodal large language model for multimodal sequential recommendation
Instruction |
Based on the user’s historical multimodal interaction sequence, please deduce whether the user might like the target item next by responding \(\backslash\)” Yes. \(\backslash\)” or \(\backslash\)” No. \(\backslash\)”.\(\backslash\)n |
Input |
The user_\(\left\{ user\_id \right\}\) is \(\left\{ user\_profile \right\}\) and based on the previous multimodal interaction history ,his preferences are summarized as: \(\backslash\)n |
\(\backslash\)” \(\left\{ user\_preference \right\}\) \(\backslash\)” \(\backslash\)n |
Whether user_ \(\left\{ user\_id \right\}\) |
will like the target item |
\(\backslash\)” \(\left\{ \left( image_{n+1},text_{n+1} \right) \right\}\) \(\backslash\)” |
next \(\backslash\)? \(\backslash\)n |
Response |
\(\backslash\)” Yes. \(\backslash\)” if label = 1 else \(\backslash\)” No. \(\backslash\)” |