Scientific Reports

Table 2 Different model results on the testset of the MSVD dataset.

From: Semantic guidance network for video captioning

Years	Method	B4	M	R	C
2022	vc-HRNAT(IR+C)⁶⁶	57.7	36.3	74.0	96.3
2022	vc-HRNAT(IR+I)⁶⁶	55.7	36.8	74.1	98.1
2021	SGN³⁸	52.8	35.5	72.9	94.3
2021	SCST⁶⁷	50.9	35.1	72.4	94.5
2020	STG⁶⁸	52.2	36.9	73.9	93.0
2020	SAAT³³	46.5	33.5	69.4	81.0
2019	E2E⁶⁹	50.3	34.0	70.8	87.5
2019	POS-CG(I3D+M)⁷⁰	53.5	34.9	72.1	91.0
2019	POS-CG(IR+M)⁷⁰	52.5	34.1	71.3	88.7
	Ours	55.3	36.1	74.2	98.4

The best experimental results are presented in bold.

Back to article page

Search

Advanced search

Quick links