Fig. 2: No overfitting in deep networks.
From: Complexity control by gradient descent in deep networks

Empirical and expected error in CIFAR-10 as a function of number of neurons in a 5-layer convolutional network. The expected classification error does not increase when increasing the number of parameters beyond the size of the training set.