I’m experiencing problems when I try to create my submission for a Kaggle competition.
I copied the Lesson 1 notebook and trained a learner which performed nicely on the validation set (.95 accuracy).
Then, I do the following to create a submission to a competition where you have to predict the classes of pictures (e.g. Plant Seedlings Classification).
probs = learn.predict(is_test = True)
preds = np.argmax(probs, axis = 1)
sub = [data.classes[pred] for pred in preds]
sample_sub = pd.read_csv(PATH + ‘sample_submission.csv’)
sample_sub[‘species’] = sub
sample_sub.to_csv(PATH+‘subm.csv’, index = False)
Can anyone see what I’m doing wrong? I end up with random noise, only <.1 accuracy on the leaderboard