Lesson 4 In-Class Discussion

utkb · September 2, 2018, 7:33am

At a later part of the notebook, for the commands:
IMDB_LABEL = data.Field(sequential=False)
splits = torchtext.datasets.IMDB.splits(TEXT, IMDB_LABEL, ‘data/’)
it seems to want to download a aclImdb_v1.tar.gz.

I just renamed the aclImdb.tgz to aclImdb_v1.tar.gz within the ./data/ folder, and it seems to work fine, without having to re-download anything.

Note that when I googled aclImdb_v1.tar.gz, I found this file, which does not seem to be the right file to use…! Maybe it is just a different / outdated file for a previous version of the example? This Stanford file was breaking the following commands:
t = splits[0].examples[0]
t.label, ’ '.join(t.text[:16])

Thanks.