Table 10 Multimodal variational autoencoder (MVAE) for data fusion.
Time (s) | Latent representation (Video) | Latent representation (Audio) | Latent representation (Motion) | Reconstruction error | Anomaly indicator |
|---|---|---|---|---|---|
0–10 | 0.12 | 0.15 | 0.18 | 0.05 | Normal |
10–20 | 0.13 | 0.17 | 0.16 | 0.07 | Normal |
20–30 | 0.40 | 0.38 | 0.45 | 0.25 | Anomalous |
30–40 | 0.11 | 0.13 | 0.14 | 0.04 | Normal |
40–50 | 0.14 | 0.16 | 0.15 | 0.06 | Normal |