Table 2 Dataset specifications and preprocessing parameters.

From: Community participatory Jingchu folk pattern generation platform construction and user co-creation mechanism analysis

Parameter category

Specification

Details

Image acquisition

Original resolution

2048 × 2048 pixels

Professional museum digitization

Color depth

24-bit RGB

sRGB color space

File format

TIFF (archival), PNG (training)

Lossless compression

Dataset composition

Total images

9700

8500 historical + 1200 contemporary

Training set

7760 (80%)

Stratified by pattern category

Validation set

970 (10%)

Stratified by pattern category

Test set

970 (10%)

Stratified by pattern category

Preprocessing

Training resolution

512 × 512 pixels

Bicubic downsampling

Color normalization

Histogram equalization

Per-channel normalization

Contrast enhancement

CLAHE

Clip limit = 2.0, tile size = 8 × 8

Data augmentation

Horizontal flip

Probability 0.5

Preserves symmetry characteristics

Random rotation

± 15 degrees

Maintains compositional integrity

Random crop

Scale 0.8-1.0

Preserves pattern completeness

Color jitter

B ± 0.1, C ± 0.1, S ± 0.1

Simulates lighting variations

Annotation quality

Annotators

3 experts + 1 senior reviewer

Folklore, art history, conservation

Cohen’s Kappa

0.82 (95% CI: 0.79–0.85)

Substantial agreement

Fleiss’ Kappa

0.78

Multi-rater consistency

Cultural tags

237 categories

Hierarchical organization

Visual descriptors

1,856 terms

Standardized vocabulary