The funny thing is, I just ran:
learn = create_cnn(data, models.resnet34, ps=0.1, wd=0.04, metrics=error_rate)
for 24 epochs this time, and finally got train_loss < valid_loss!
So that feels like success, but I don’t understand it! Does it make sense to you?
I’ll run with wd=0.001 now.