Hi everyone. I’ve added some bash commands to my .bashrc to make it easier to setup sample datasets. The main helpers are cpn/mvn which is “copy n” and “move n” respectively. I thought others might be interested so here is a gist:
Here is how it works. Assuming you have already set up your full train/valid/test directories and you wanted to:
create the sample directory
copy 200 random dogs and cats into both the training and validation sample sets
Good one! I’ve found this command that helps move a defined share of files from each of the subfolders to another master folder - I have used it to take 10% of the samples from training dataset subfolders (10 subfolders for 10 classes) and move them to validation folder.