In today’s session, the last question/issue we had was with knowing exactly how splits
and split_idx
would work inside of Datasets, DataLoader, etc. Specifically, we wanted to figure out how we might go about restricting a Transform
to a particular subset of our data. For example, there may be times when you do not want to apply particular augmentations to images in your validation set.
For this, we were looking at the source code for TfmdDL
and Datasets
and trying to understand how it’s all working. That’s exactly where we stopped and decided to continue the next day.
But I did some digging afterwards and wrote this blog about dealing with that exact issue, and understanding a few more subtleties about spreading your transforms among your Datasets
splits.
Here it is: Using separate Transforms for your training and validation sets in fastai2
This blog is a bit technical and goes slightly towards the advanced side of things. Also, it’s kind of rough at this point simply because I wanted to finish it off quickly . But I guess it gets the point across.
Any feedback is very much appreciated!
(I’ll also add it to the wiki)