Fig. 4
From: A novel flexible identity-net with diffusion models for painting-style generation

The image processing workflow for Painting-42 involves several steps. Initially, raw images underwent captioning using BLIP CapFit to generate adaptive subtitles, which were meticulously validated and corrected. Additionally, manual cropping and enhancement techniques were applied to the original images to achieve uniformly sized \(512 \times 512\) images along with corresponding labels.