Table 3 Top three complexity estimations. Among the 23 theoretical complexity measures tested, only a few exhibit moderate alignment with empirical generalization performance. This highlights a significant concern regarding the reliability of the theoretical estimation.
From: A practical generalization metric for deep networks benchmarking
Complexity measures | ImageNet | CIFAR-100 | |||
|---|---|---|---|---|---|
\(SE_g\) of generalization bounds | \(SE_g\) of 10th percentile | \(SE_g\) of generalization bounds | \(SE_g\) of 10th percentile | ||
ZeroShot% | INVERSE_MARGIN | 0.428571 | 0.321429 | 0.500000 | 0.464286 |
LOG_SUM_OF_FRO | 0.247638 | 0.163783 | 0.642857 | 0.607143 | |
PARAM_NORM | 0.366667 | 0.392857 | 0.714286 | 0.678571 | |
p-value | \(4.718e-10\) | \(8.786e-10\) | |||
SSIM | INVERSE_MARGIN | 0.285714 | 0.142857 | 0.466667 | 0.321429 |
LOG_SUM_OF_FRO | 0.714286 | 0.821429 | 0.166667 | 0.285714 | |
PARAM_NORM | 0.785714 | 0.892857 | 0.10 | 0.357143 | |
p-value | \(1.207e-10\) | \(1.504e-10\) | |||