Table 10 Multimodal variational autoencoder (MVAE) for data fusion.

Time (s)	Latent representation (Video)	Latent representation (Audio)	Latent representation (Motion)	Reconstruction error	Anomaly indicator
0–10	0.12	0.15	0.18	0.05	Normal
10–20	0.13	0.17	0.16	0.07	Normal
20–30	0.40	0.38	0.45	0.25	Anomalous
30–40	0.11	0.13	0.14	0.04	Normal
40–50	0.14	0.16	0.15	0.06	Normal

Quick links

Search