Table 1 The research advancements in the field of sculpture image classification

From: The analysis of sculpture image classification in utilization of 3D reconstruction under K-means++

Author/year

Specific objective

Algorithm used

Advantages

Disadvantages

Basic assumption

Overall evaluation

Nawaz et al. (2022)

Unified image representation learning and image synthesis tasks

Representation learning method based on image semantic masks

Implemented multi-task learning and demonstrated state-of-the-art performance across multiple datasets

Did not directly optimize for sculpture image classification

Image representation and synthesis could mutually enhance each other

This study provided a new perspective on image representation learning, but its direct contribution to sculpture image classification was limited

Sujithra et al. (2024)

Image classification based on self-supervised learning

Pre-training with unlabeled image data, and fine-tuning with limited labeled data

Effectively utilized unlabeled data to improve classification efficiency

Did not fully explore the application in sculpture image classification

Self-supervised learning could learn general features

This method demonstrated the potential of self-supervised learning in image classification but required further validation for its effectiveness in sculpture image classification

Zhai et al. (2024)

Reviewed the applications and developments of CNN in computer vision

ResNet50

It solved the gradient vanishing problem in deep networks by introducing residual blocks

It had a large number of parameters and high computational resource requirements

Deep networks were prone to the gradient vanishing problem

ResNet50, with its innovative residual block design, significantly improved the training efficiency and performance of deep networks

Xu et al. (2024)

Proposed a deep image classification model Based on prior feature knowledge embedding

Inception v3

It captured multi-scale information by parallelly using convolution kernels of different sizes in each convolutional layer

Its structure was relatively complex and training was more challenging

Multi-scale information was crucial for image classification

Inception v3 enhanced the model’s expressive power and classification performance by designing multi-scale convolutional kernels