Table 5 Cold-Start Hyperparameter Settings

From: Large language models learning to write rhyming Tang poetry A Xunzi Yayun R1 case study

Hyperparameter

Description

Parameter Value

Batch_size

Batch size for training

2

Learning_rate

Learning rate

1e-4

Max_len

Maximum context length

2048

Num_epochs

Number of training epochs

3

gradient_accumulation_steps

Gradient accumulation steps

4