When using fit_generator with batches, each individual batch can be created with shuffle=True or False separately. But when using fit(), you don’t get the option to shuffle or not shuffle the validation set independent of the training set.
So my question is, when setting shuffle=True above, is only the training data getting shuffled OR is the validation data set getting shuffled as well?
Just so I can test my understanding, isn’t it irrelevant if the validation data is being shuffled, since a) it’s not adjusting any weights stochastically using the validation data, and b) the accuracy number should be the same regardless of the order the validation set is tested in? Is this correct?
The validation set is just being used how well the trained model works on examples it hasn’t seen during training, and so it being shuffled is irrelevant.
Hi, thanks for the post.
If I split my data to train, validation and test. train and validation are used for training where validation is a specific dataset (not cross validated).
Test is used for model performance evaluation.
Do you mean that the shuffle should be done on both training and validation set?