Table 1 The config of ViT-MAE.

From: Visual prediction method based on time series-driven LSTM model

Config

Value and Method

Optimizer

AdamW

Base learning rate

Le-3

Actual learning rate

2.5e-4

Weight decay

0.05

Optimizer momentum

\(\beta _{1}\), \(\beta _{2}\) =0.9, 0.95

Batch size

64

Learning rate schedule

Cosine decay

Loss function

MSE MAE

Input length

200

Patch size

\(16\times 16\)

Encoder layers

3

Dncoder layers

1

Masking ratio

85%