This works for me with an fresh paperspace instance. You could try untar_data(URLs.MNIST_SAMPLE, force_download=True) to re-download the .tgz which might be broken or alternatively run untar_data(URLs.MNIST_SAMPLE, archive='tmp' data='tmp') to change the download / data folder and have a ‘fresh start’ this way.
If this doesn’t work you would need to give us more information about the error, so post the whole stack trace otherwise we can only guess what’s the issue
The first suggestion produced the same error but your second suggestion worked like a charm! Cell executed w/o error and I was able to move forward. Thanks, Ben!
Sure thing. Attached are two screenshots that should give you the info you seek.
Are you surprised by the results?
(p.s. Also, thanks very much for your help. I’ve been working in isolation on Fastai for 3 months in my spare time and you’re the first community member I’ve interacted with to date! Big confidence booster for me!)
Sorry for the delay… Apparently calling untar_data with those parameters makes fastai_cfg return the changed paths, but I was interested in the default ones
I also don’t want to take more of your time, so I just leave this here in case someone tries to resolve a similar issue:
The default paths are saved in a config.ini file at fastai_cfg().config_file
from fastai.data.all import fastai_cfg
!cat {fastai_cfg().config_file}
for me returns
[DEFAULT]
archive = /storage/archive/
model = /storage/model/
data = /storage/data
storage = /storage/data/
on paperspace. You can also access those paths as described in my previous post or with fastai_cfg()['data'] for example. Note that ‘non absolute’ paths in the config.ini are usually relative to ~/.fastai.
…anyways, when something went wrong with the download or extraction you can check these locations and remove the concerned file/folder and start over.
Also, changes made by untar_data are not persistent, so even if you’re happy with the solution above, you have to remember to pass the same folders to untar_data if you want to use the downloaded data in a different notebook .
You could make persistent changes to the config with:
fastai_cfg().__setitem__('data', 'something') # the other paths respectivly
fastai_cfg().save()
(or use an editor to change config.ini) and wouldn’t have to pass the archive and data parameters to untar_data.