In dogscats.zip
, I see the following count of files:
11500 data/dogscats/train/dogs
11500 data/dogscats/train/cats
1000 data/dogscats/valid/dogs
1000 data/dogscats/valid/cats
12500 data/dogscats/test1
In Andrew Ng’s Machine Learning course, he says to use a 60/20/20% train/valid/test set split.
Any reason why the validation set (2000 examples) is less than the test set (12500 examples)?