Table 1 Summary of the used topic, image, and language models in the topic-based approaches.

From: Novel concept-based image captioning models using LSTM and multi-encoder transformer architecture

 

Topic modeling technique

Image encoder

Language model

Topic-based captioning7

LDA

Inception-V3

LSTM

Show and tell more22

LDA

ResNet-152

Two layers of LSTM

What topics do images say23

LDA

VGGNet

LSTM

Topic-oriented (NeuralTalk2-T-oe)28

LDA & NMF

VGGNet

LSTM

Topic-guided attention (VA)8

LDA

VGGNet

LSTM with attention

Topic-sensitive30

HDP

AlexNet

LSTM