Approach to debugging a woeful model

Did you separate your dataset correctly?

It seems that the test dataset will have different drivers from the ones that we get to train on. If you don’t create a good validation set, you might be getting very optimistic results. Your validation set should not contain any drivers from your training set.

If you want a few good pointers like this one, I recommend visiting the Statefarm thread: Statefarm kaggle comp

In the upcoming videos, Jeremy will show you a bunch of different ideas you can try with this dataset :slight_smile:

Cheers, and good luck!