[Solved] What's the easiest way to get the list of file names in the training dataset

danield · October 25, 2018, 4:17am

Always appreciate the more pythonic way of doing things. This led to some great albeit obvious insights (and correct me if I’m wrong):

dataset items are tuples! i.e. (image (size), breed index), which led me to…
data.train_ds[0][0] (the image) is the transformed version of the original image data.train_ds.ds[0][0]
when viewing the cropped version in data.train_ds[0][0], it changes every time. I’m putting all the pieces together now and presume that the “DatasetTfm” contains the original dataset and a transformer “tfm”, which is applies to the original image (data.train_ds.ds) and stored in data.train_ds.

Apologies if I’ve just stated the obvious, but might be insightful for anyone else like me starting out.