A walk with fastai2 - Vision - Study Group and Online Lectures Megathread

rsomani95 · June 8, 2020, 12:52am

Thanks for your response

Sure, that makes sense. I mentioned the Data Blocks API because if I pass in a DataFrame like the one shown here, I get a KeyError when forming the dlock.dataloaders() since -100 is not part of the vocab for any of the columns.

muellerzr · June 8, 2020, 12:54am

Ah that would make sense, I’d possibly recommend a custom vocab then possibly if you want to go that route, otherwise no (pass said vocab to CategoryBlock(vocab=))

rsomani95 · June 8, 2020, 12:58am

Hmm, I see. That is a bit awkward since if you already have a dedicated NA class, it doesn’t make sense. However, I guess if the loss function will ignore it anyways, it doesn’t matter – it won’t take up any part of the probability space in the Softmax activation?

I’ll give it a shot anyways and report back here.

muellerzr · June 8, 2020, 1:30am

Honestly not sure, that’s out of my realm/scope for right now

Keep us updated

rsomani95 · June 8, 2020, 3:44am

Definitely.

I’m not sure how the library is calling the batches during training, because when I run the following chunk to the notebook, the shape for yb is definitely wrong

I think it would be a super neat addition to the notebook to add an example for how the batch is actually called during training, and an example CombinationLoss(), since given the above, I can’t test the CombinationLoss() function before actually training.

muellerzr · June 8, 2020, 3:47am

It’s exactly as you have it, so:

xb, yb = next(iter(dl))

Which is also what one_batch does

To further this on any loss function simply have it accept an input and output and print what they are (but you may already know this one)

rsomani95 · June 8, 2020, 4:07am

Hah, I’m a bit confused. I’d love to just pop the xs and ys into the loss func, but I think my sizes are off.

To be precise, next(iter(dls)) returns a TfmdDL object and not a batch. Even after I grab a batch, I can’t plug it into the CombinationLoss() straight away.

Here’s the code I’m running:

dls = bengel.dataloaders(train.sample(1000), bs=bs)

batch = dls.one_batch()
xb = batch[0]
yb = batch[1]

# xb.shape ==> torch.Size([128, 1, 128, 128]) 
# yb.shape ==> torch.Size([128]

I expected yb to be a list of 3 elements with size=128 each, or a stack of 3 tensors of size 128. Looking at the contents of yb, it appears to me that these are only the ground truths for grapheme and not vowel or constant.

If I try running this through CombinationLoss() as such:

loss_func = CombinationLoss()
loss_func(xb,yb)

I run into a ValueError: Expected input batch_size (1) to match target batch_size (128).

To take a step back, am I understanding CombinationLoss()'s forward correct? I’ve jotted my thoughs in the comments

def forward(self, xs, *ys, reduction='mean'):
    # xb.shape ==> (batch_size,1,128,128)
    # what shape/type should `ys` be?
    # list of 3 tensors of size (batch_size, truths)?
    for i, w, x, y in zip(range(len(xs)), self.w, xs, ys):
        # this loop will run three times, calculate the loss for
        # grapheme,vowel,consonant and sum them up
        if i == 0: loss = w*self.func(x, y, reduction=reduction) 
        else: loss += w*self.func(x, y, reduction=reduction) 
    return loss

muellerzr · June 8, 2020, 4:09am

dls are both DataLoaders, so this makes sense. You should do dls[0] (train), dls[1] (valid) or .train, etc

How are you building your DataLoaders? What’s your DataBlock like? (Is it just my example notebook?)

rsomani95 · June 8, 2020, 4:20am

Ahhh gotcha. That makes sense re. dls.
Yeah, I’m using your notebook, nothing changed, just ran these code blocks after the definition of the loss function.

muellerzr · June 8, 2020, 4:29am

Okay, I’ll look into and debug that notebook tomorrow then and get back with you, as that shouldn’t be what’s happening. Thanks!

rsomani95 · June 8, 2020, 4:36am

Awesome. Thank you!

rsomani95 · June 8, 2020, 5:55am

Actually, I made a silly error

batch = dls.one_batch()
len(batch) # ==> 4
x  = batch[0]
# ys = batch[1] # len ==> 1 (ofc!)
ys = batch[1:] # len ==> 3

rsomani95 · June 8, 2020, 9:04am

On further thinking, this shouldn’t be done, because your model will have one parameter in the final layer that will be making predictions for the ignore_index element.
I think tweaking Categorize to support ignore_index is the correct approach.

mgloria · June 12, 2020, 8:44am

Could somebody please remember me how to add the test dataset to the dataloader after the training?, i.e. my training data has been splitted into training and validation (originally in the same folder 80/20) and now I would like to know how well my model works in the test set which is in another folder i.e. get predictions and metrics.

muellerzr · June 12, 2020, 9:54am

test_dl = learn.dls.test_dl(items)

learn.validate(dl=test_dl)

vijayabhaskar · June 12, 2020, 11:02am

@muellerzr if you have time can you please look into this? Ignore the code inside show_results, once it gets called I can work on that. Typedispatch system is what causing this error.

muellerzr · June 12, 2020, 11:41am

I’m not sure there. Sylvain would be more help, I haven’t faced that error before

mgloria · June 12, 2020, 4:07pm

I am building a simple cat vs dog classifier BUT using multi-cat example so that if the picture is not a dog or a cat I get not output. This is my learner and its threshold:

learn = cnn_learner(dls, resnet34, pretrained=True, metrics=[partial(accuracy_multi, thresh=0.9)]).to_fp16()

I trained the model and trained with a random pic. The prediction is NOT above 0.9 but stills says ‘Dog’… what am I missing in here?

Makes no sense to me

muellerzr · June 12, 2020, 4:08pm

That’s because that’s done on the loss function, not the metric So you need to adjust the threshold of the BCELossLogits loss function (I had this same question myself last week or so)

mgloria · June 12, 2020, 4:27pm

Can you explain a bit more? My understanding is that the loss function is what we are trying to minimize, the metric just a proxy for how good our model is working. So I would have said it belongs to the metric, not to the loss…

I re-trained the model again… how can I prevent it about being so confident about the wrong class? I find this really concerning