I did find this comment from Jeremy:
on this post:
Since Jeremy’s comment favors error_rate over validation loss this implies my best model is around epoch 100. This flips the question around:
If error_rate is the more important metric what does it mean that validation loss is getting worse and why should I ignore that in favor of error rate?