Table 4 Hyperparameters for transformer Models.
Hyperparameter/setting | Value/description |
|---|---|
Tokenizer | AutoTokenizer with max length of 128 tokens |
Learning rate | 2e-5 |
Batch size | 16 |
Epochs | 10 |
Evaluation metric | Accuracy, F1-Score |
Loss function | Cross-entropy (used by trainer) |
Optimizer | AdamW (default in trainer) |