Fig. 9 | Scientific Reports

Fig. 9

From: Research on pedestrian recognition in complex scenarios based on data augmentation using large language models

Fig. 9

Schematic diagram of the large language model augmentation process: (a) WanX-v1 model interface. (b) Doubao AI interface. According to the user agreement between Volcano Engine Doubao and AliCloud, images could be used for non-profit academic research, including publication in papers. The figure illustrates the process and examples of image generation using large language models as described in this paper.

Back to article page