Table 6 The influence of using different attention at different stages of the backbone on the model performance evaluation indicators.

Feature Maps	Attention	mAP₅₀(%)	mAP_50:95(%)	GFLOPs
C2/C3/C4	–	66.9	35.5	24.4
C4	ECA	67.0(+ 0.1)	35.8(+ 0.3)	24.4(+ 0.0)
	GAM³⁷	67.2(+ 0.3)	35.9(+ 0.4)	29.7(+ 5.3)
	CAM³⁷	67.1(+ 0.2)	35.6(+ 0.1)	24.7(+ 0.3)
	PAM³⁷	66.1(− 0.8)	35.4(− 0.1)	24.7(+ 0.3)
	TripleAttention	66.0(− 0.9)	34.9(− 0.6)	24.5(+ 0.1)
C3	ECA	66.6(− 0.3)	35.0(− 0.5)	24.4
	GAM	66.0(− 0.9)	35.2(− 0.3)	29.7(+ 5.3)
	CAM	66.5(− 0.4)	35.4(− 0.1)	24.6(+ 0.2)
	PAM	65.5(− 1.4)	34.9(− 0.6)	24.7(+ 0.3)
	TripleAttention	67.2(+ 0.3)	35.8(+ 0.3)	24.5(+ 0.1)
	CBAM³⁸	65.2(− 1.7)	34.6(− 0.9)	24.5(+ 0.1)
	CoTAttention³⁹	67.0(+ 0.1)	35.6(+ 0.1)	26.3(+ 1.9)
	Nonlocal⁴⁰	66.6(− 0.3)	35.6(+ 0.1)	25.3(+ 0.9)
	MSCA⁴¹	66.0(− 0.9)	35.1(− 0.4)	24.7(+ 0.3)
	SeaAttention⁴²	67.0(+ 0.1)	35.5(–)	47.4(+ 23.0)
	LSKblock	67.1(+ 0.2)	35.8(+ 0.3)	25.2(+ 0.8)
C2	GAM	66.4(− 0.5)	35.5(–)	29.7(+ 5.3)
	PAM	66.4(− 0.5)	35.4(− 0.1)	24.7(+ 0.3)
	TripleAttention	66.9(–)	35.6(+ 0.1)	24.5(+ 0.1)
	CBAM	66.3(− 0.6)	35.3(− 0.2)	24.5(+ 0.1)
	CoTAttention	66.6(− 0.3)	35.9(+ 0.4)	26.3(+ 1.9)
	Nonlocal	65.9(− 1.0)	35.1(− 0.4)	25.3(+ 0.9)
	MSCA	67.2(+ 0.3)	35.5(–)	24.8(+ 0.4)
	SeaAttention	65.5(− 1.4)	35.1(− 0.4)	40.9(+ 16.5)
	LSKblock	67.2(+ 0.3)	35.9(+ 0.4)	25.3(+ 0.9)

In the table, “–” indicates that the attention mechanism is not used at the end of each stage of the backbone.

Quick links

Search