Table 1 Sensor fusion models with their strengths and limitations.
Stage | Model | Strengths | Limitations |
|---|---|---|---|
Early | MEFA31 | High-resolution fusion, fine-grained spatial feature alignment | Rigid fusion, high computational cost, limited real-time use |
VirConv32 | Lightweight, efficient voxel feature propagation | Limited adaptability, struggles with semantic inconsistencies | |
Mid-Level | BEVFusion4D33 | Temporal BEV fusion, consistent scene aggregation | Executes fusion regardless of feature alignment, redundant computations |
CRN34 | adar-Camera feature fusion with dynamic spatial reasoning | Lacks semantic consistency gating , sensitive to modality disagreement | |
DeepStep21 | Progressive integration, improved temporal continuity | Unconditional fusion execution, limited adaptability | |
Late | C-CLOCs35 | Modality-specific decision making, robustness to misalignment | Post-hoc alignment, limited semantic feature utilization |
BAFusion17 | Uncertainty-aware fusion, reliability weighting | Unconditional fusion, limited adaptability to sensor disagreement | |
Hybrid | DecoratingFusion25 | Multi-stage refinement, dense feature propagation | Executes all fusion stages unconditionally, increased latency |
MS-Occ26 | Multi-scale occupancy reasoning, strong spatial coverage | Performs all fusion stages unconditionally, computationally intensive | |
HydraFusion37 | Flexible per-branch processing, adaptable fusion routes | Static fusion pipelines within each branch, unnecessary computations | |
RCBEV36 | Radar-Camera feature alignment, modality-specific spatial adaptation | Limited to Radar-Camera pairs, lacks generalized modality fusion |