Fig. 5
From: Multimodal fusion transformer network for multispectral pedestrian detection in low-light condition

Shows the details of the Cross-modal Feature Enhancement module. This module enhances feature representations by integrating information from different modalities, with the goal of improving the overall performance of multispectral pedestrian detection systems.