Pytorch 0.3 upgrade needed

jeremy · December 7, 2017, 10:17pm

Our next lesson will be using pytorch 0.3. If you do a conda env update it will upgrade you automatically. Let me know if you notice any new issues.

Hopefully you’ll find some of your notebooks are somewhat faster, and also any memory issues you may have had may be resolved.

bushaev · December 7, 2017, 10:37pm

Release Notes for anyone interested.

wgpubs · December 8, 2017, 1:23am

The environment.yml file is still at >=0.2.0 … so conda env update won’t upgrade to 0.3.

I think you either want to change it to >=0.3.0 or have folks do a conda update --all (which apparently may break things for some folks).

jeremy · December 8, 2017, 1:25am

Have you tried it? It worked for me.

wgpubs · December 8, 2017, 1:31am

Yah

conda env update

conda list pytorch

(fastai) $conda list pytorch
# packages in environment at /development/_tools/anaconda/envs/fastai:
#
pytorch                   0.2.0                py36_4cu75    soumith

jeremy · December 8, 2017, 1:32am

Ah I think you haven’t done a git pull yet?

jeremy · December 8, 2017, 1:32am

Correction - I haven’t pushed the change yet! Coming right up

wgpubs · December 8, 2017, 1:34am

You got me before I could say this. I’ve gotten bit by this so much i’ll do a git status back-to-back just to make sure there is nothing to commit or push.

Thanks.

jeremy · December 8, 2017, 1:36am

OK just pushed the fix. Sorry about that.

kcturgutlu · December 8, 2017, 7:43am

Introduced torch.erf and torch.erfinv that compute the error function and the inverse error function of each element in the Tensor.

This is interesting, I wonder if they add this after seeing it’s been used in Porto Seguro Kaggle competition winning solution

Moody · December 10, 2017, 7:13am

The predict_with_targs(True) is not working after upgrade.

Moody · December 10, 2017, 7:32am

learn.fit is not working either.

jeremy · December 10, 2017, 6:21pm

These do not appear related @Moody . Your 2nd problem appears to be because you have multiple labels for some rows, and the first appears to be because your test set isn’t set correctly.

anurag · December 11, 2017, 8:32pm

Crestle pytorch has also been updated to 0.3.

layla.tadjpour · December 12, 2017, 6:28am

I upgraded to pytorch 0.3 (using conda env update) but get this error when run
lesson2-image_models ( cell 16:lrf=learn.lr_find(), learn.sched.plot() )

RuntimeError Traceback (most recent call last)
in ()
----> 1 lrf=learn.lr_find()
2 learn.sched.plot()

~/fastai/courses/dl1/fastai/learner.py in lr_find(self, start_lr, end_lr, wds)
234 layer_opt = self.get_layer_opt(start_lr, wds)
235 self.sched = LR_Finder(layer_opt, len(self.data.trn_dl), end_lr)
–> 236 self.fit_gen(self.model, self.data, layer_opt, 1)
237 self.load(‘tmp’)
238

~/fastai/courses/dl1/fastai/learner.py in fit_gen(self, model, data, layer_opt, n_cycle, cycle_len, cycle_mult, cycle_save_name, metrics, callbacks, use_wd_sched, **kwargs)
143 n_epoch = sum_geom(cycle_len if cycle_len else 1, cycle_mult, n_cycle)
144 fit(model, data, n_epoch, layer_opt.opt, self.crit,
–> 145 metrics=metrics, callbacks=callbacks, reg_fn=self.reg_fn, clip=self.clip, **kwargs)
146
147 def get_layer_groups(self): return self.models.get_layer_groups()

~/fastai/courses/dl1/fastai/model.py in fit(model, data, epochs, opt, crit, metrics, callbacks, kwargs)
84 batch_num += 1
85 for cb in callbacks: cb.on_batch_begin()
—> 86 loss = stepper.step(V(x),V(y))
87 avg_loss = avg_loss * avg_mom + loss * (1-avg_mom)
88 debias_loss = avg_loss / (1 - avg_mombatch_num)

~/fastai/courses/dl1/fastai/model.py in step(self, xs, y)
41 if isinstance(output,(tuple,list)): output,*xtra = output
42 self.opt.zero_grad()
—> 43 loss = raw_loss = self.crit(output, y)
44 if self.reg_fn: loss = self.reg_fn(output, xtra, raw_loss)
45 loss.backward()

~/anaconda3/envs/fastai/lib/python3.6/site-packages/torch/nn/functional.py in binary_cross_entropy(input, target, weight, size_average)
1177 weight = Variable(weight)
1178
-> 1179 return torch._C._nn.binary_cross_entropy(input, target, weight, size_average)
1180
1181

RuntimeError: Expected object of type Variable[torch.cuda.FloatTensor] but found type Variable[torch.cuda.LongTensor] for argument #1 ‘target’

jeremy · December 12, 2017, 6:57am

@layla.tadjpour sorry about that! Should be fixed now.

layla.tadjpour · December 12, 2017, 9:05am

yes it is working now. Thanks

rfa123c · March 16, 2018, 3:46pm

Could someone test this to see if it’s still working? I’m getting the same error and I think I’ve got everything up to date.

Thanks in advance.

pdr8k1 · March 16, 2018, 7:20pm

I’m getting the same error after doing a git pull and conda env update yesterday. FYI -
I’m using Ubuntu on Google Cloud

rfa123c · March 17, 2018, 5:40pm

Yes, that was precisely what happened. Also using ubuntu but with the fast.ai template for paperspace.