Table 5 Results of different thawing training methods.

From: Counterclockwise block-by-block knowledge distillation for neural network compression

Freeze?

CIFAR-10

Tiny-imagenet-200

VGG-16 (88.5%)

Resnet-18 (85.0%)

VGG-16 (59.4%)

Resnet-18 (57.3%)

No

85.8%

82.4%

55.9%

54.2%

Yes(lr*0.1)

86.5%

83.4%

57.9%

55.0%

Yes(lr*0.25)

87.3%

84.8%

58.7%

55.2%

Yes(lr)

88.2%

85.6%

59.4%

55.7%