Table 1 Sensor fusion models with their strengths and limitations.

From: FDSNet: dynamic multimodal fusion stage selection for autonomous driving via feature disagreement scoring

Stage

Model

Strengths

Limitations

Early

MEFA31

High-resolution fusion, fine-grained

spatial feature alignment

Rigid fusion, high computational

cost, limited real-time use

VirConv32

Lightweight, efficient voxel feature

propagation

Limited adaptability, struggles

with semantic inconsistencies

Mid-Level

BEVFusion4D33

Temporal BEV fusion, consistent

scene aggregation

Executes fusion regardless of

feature alignment, redundant

computations

CRN34

adar-Camera feature fusion with

dynamic spatial reasoning

Lacks semantic consistency gating

, sensitive to modality disagreement

DeepStep21

Progressive integration, improved

temporal continuity

Unconditional fusion execution,

limited adaptability

Late

C-CLOCs35

Modality-specific decision making,

robustness to misalignment

Post-hoc alignment, limited

semantic feature utilization

BAFusion17

Uncertainty-aware fusion,

reliability weighting

Unconditional fusion, limited

adaptability to sensor

disagreement

Hybrid

DecoratingFusion25

Multi-stage refinement, dense

feature propagation

Executes all fusion stages

unconditionally, increased latency

MS-Occ26

Multi-scale occupancy reasoning,

strong spatial coverage

Performs all fusion stages

unconditionally, computationally

intensive

HydraFusion37

Flexible per-branch processing,

adaptable fusion routes

Static fusion pipelines within

each branch, unnecessary

computations

RCBEV36

Radar-Camera feature alignment,

modality-specific spatial adaptation

Limited to Radar-Camera pairs,

lacks generalized modality fusion