Table 7 Highlights of the best achieved performance across all data modalities.

EEG only	Video only	Subjects’ consensus	Fusion
39.78%	TRN: 31.70%	31.70% \(\rightarrow\) 35.67%	46.14%
39.78%	TSM: 42.33%	42.33% \(\rightarrow\) 43.66%	46.14%

Subjects’ consensus refers to VIDEO only performance, where models are trained with the additional supervision provided by the consensus predictions inferred from the EEG of the multiple subjects who watched a specific video. Fusion refers to explicit EEG+VIDEO fusion and is thus an upper bound for model performance.

Quick links

Search