Table 5 Performance comparison of different rendering models in ethnic painting style transfer tasks.
From: A painting art rendering system by deep learning framework and machine translation
Model | FID | LPIPS | StyleSimilarity | Generation speed (s/img) | Key limitations |
|---|---|---|---|---|---|
CNN-Style | 45.3 | 0.28 | 80.0% | 0.22 | Texture breaks; noticeable discontinuity in Miao silver “sawtooth pattern” |
VGG-19 | 41.6 | 0.25 | 82.0% | 0.25 | Harsh color transitions; blurred layering in Tibetan Thangka mineral pigments |
Vanilla GAN | 37.8 | 0.21 | 85.0% | 0.30 | Mode collapse; repeated “color-layered” generation in Yi lacquerware |
StyleGAN3 | 32.5 | 0.18 | 86.5% | 0.85 | Blurred boundaries between gold leaf base and mineral colors in Thangka |
CUT | 35.8 | 0.21 | 84.2% | 0.78 | Local distortion in Miao silver “symmetrical patterns” |
Diffusion Model | 30.1 | 0.16 | 88.3% | 1.20 | Insufficient detail restoration for Yi lacquerware “floral painting”; low generation efficiency |
IGAN | 28.2 | 0.12 | 91.0% | 0.35 | No obvious defects; high fidelity in technique and cultural feature reproduction |