Hey guys, greenhorn here.
I am trying to use the datablock API for a Kaggle competition. It seems to me that many kaggle datasets have the images archived in zip folders.
It would be extremely convenient if I could use the API with these archives something like…
data = (ImageList.from_csv(PATH, 'train.csv', folder='images.zip', suffix='.png') .split_by_folder(train='train_images.zip', valid='valid_images.zip') .label_from_df() .transform(size=224) .databunch())
This gives me a basic error which I interpret to mean I am not loading anything successfully.
IndexError: index 0 is out of bounds for axis 0 with size 0
Is there a method for loading zip archives into the data_block? Or should I unzip them with some other method, and load them into the data_block in some other way.