I am getting pretty awful results with resnet18 on digit recogniser.
epoch
train_loss
valid_loss
error_rate
time
0
0.759114
0.485745
0.142024
01:02
There is at least one reason for that I can think of. Training data is in gray scale, whereas nn trained for input of 3 channels. The workaround for that is as follows:
Again, a quick glance shows nothing significantly wrong. You trained for one epoch and got 86% accuracy. Train for longer and it will improve obviously.
Only thing I can think of is maybe your plotting code is wrong. But the predictions are probably fine as the confusion matrix looks OK.
Also, I will say your code could be cleaner, and you could probably select the learning rate using the learning rate finder rather than using a default learning rate.