[Solved] What's the easiest way to get the list of file names in the training dataset

Always appreciate the more pythonic way of doing things. This led to some great albeit obvious insights (and correct me if I’m wrong):

  • dataset items are tuples! i.e. (image (size), breed index), which led me to…
  • data.train_ds[0][0] (the image) is the transformed version of the original image data.train_ds.ds[0][0]
  • when viewing the cropped version in data.train_ds[0][0], it changes every time. I’m putting all the pieces together now and presume that the “DatasetTfm” contains the original dataset and a transformer “tfm”, which is applies to the original image (data.train_ds.ds) and stored in data.train_ds.

Apologies if I’ve just stated the obvious, but might be insightful for anyone else like me starting out.

4 Likes