Multilabel classification: Valid loss increasing along with accuracy increase

sumeetd · February 14, 2020, 3:22pm

Hi
I’m training a multilabel classifier with metrics as accuracy_thresh and fbeta score.
However, the following is happening

The valid loss is increasing, however the metrics are improving.
Any idea what is happening here?

jeremyeast · February 14, 2020, 9:14pm

can you share code?

lgvaz · February 14, 2020, 9:59pm

That can happen theoretically at least

It would mean that the model is getting more certain about the correct results and at the same time making more mistakes

EDIT: Everything I said but the other way around lol

Accuracy doesn’t care how certain are the results, right? It’s either right or wrong (this is why it’s not a great loss function). The loss function on the other hand cares about the certainty

sumeetd · February 14, 2020, 10:22pm

Should I chose a threshold in a way that loss decreases with metrics increasing?

sumeetd · February 14, 2020, 10:23pm

its the standard code shown in the lesson.

leviritchie · February 14, 2020, 10:49pm

In my work, I generally take the halt of validation loss to mean that further epochs aren’t really improving anything - an increase in metrics is likely a coincidence from this point forward, and fitting further will actually decrease the performance of ensemble models (very common in my field) that use the current model’s predictions as an output.

sumeetd · February 16, 2020, 8:31am

you mean to say that I should chose a model with lowest valid_loss even though it might not have the best fbeta?

lgvaz · February 16, 2020, 3:46pm

No, I’m not saying that, I was just trying to say that the results you’re getting are theoretically possible and it’s not a bug

vijayabhaskar · February 16, 2020, 4:23pm

This happens a lot, as long as the metric you care about increases you don’t need to worry about the loss, if both goes in the wrong direction then the model is overfitting. Jeremy explained this in one of the videos I will link it if I find it.

sumeetd · February 17, 2020, 11:51am

@vijayabhaskar that’ll be helpful if you can share the link

hasan.kanmaz · March 1, 2021, 3:12pm

I don’t agree with @vijayabhaskar about “as long as the metric you care about increases you don’t need to worry about the loss”.

There should be some other things to be checked.

Is your validation set sufficiently representative for unseen scenarios?
Are you experiencing the same behaviour with cross-validation?

I think that models is slightly overfitting and this is why validation loss is increasing. Even if it returns a high accuracy, I don’t say that it will generalize unseen scenarios better. Btw, I can say that it isn’t a severe overfitting issue (that’s why I say slightly).

vijayabhaskar · August 27, 2021, 7:46am

Yeah, you’re right. Past me is wrong.