Table 5 Comparison of key component variants for multi horizon prediction on ETTm2 under the same backbone and training recipe. No JSD and KLD removes divergence based terms from module scoring and retains only normalized MSE and normalized MAE. Fixed threshold uses a single constant threshold shared by all modules and horizons, and the threshold is set as the median of warm up adaptive thresholds from the same run. Another fusion mode replaces weighted fusion with hard selection that copies parameters from the parent module with smaller module level fitness. Lower is better.

From: Hybrid evolutionary-gradient training improves long-term time series forecasting

Variant

96

192

336

720

Avg

MSE

MAE

MSE

MAE

MSE

MAE

MSE

MAE

MSE

MAE

Baseline

0.180

0.264

0.250

0.309

0.311

0.348

0.412

0.407

0.288

0.332

No JSD and KLD

0.179

0.265

0.256

0.309

0.308

0.363

0.412

0.411

0.289

0.337

Fixed threshold

0.191

0.279

0.263

0.312

0.323

0.353

0.434

0.421

0.303

0.341

Another fusion mode

0.186

0.272

0.266

0.304

0.316

0.353

0.412

0.412

0.295

0.335

EGMF-GR

0.177

0.262

0.241

0.301

0.304

0.342

0.407

0.399

0.282

0.326