Hi Nilesh, welcome in our community!
It is the probability that the model assigns to the “best” predicted class, that is, the class characterized by the highest probability. I used quotation marks since it is to be considered best in a somewhat bad sense.
I didn’t quite understand what you meant by “it seems like the probability of the actual class in the prediction”, that is, I’m not catching the difference between:
- probability of the predicted class
- probability of the actual class in the prediction
But just to clarify, let me say that, as you probably know, we don’t use a softmax in a multi-labeled setting. We use sigmoids cut off with a threshold. Every class that goes above that threshold should be accepted as a predicted class.
But as I implemented plot_multi_top_losses I somewhat thought it was best to pick the class with the maximum absolute probability, since I always intended to use it as a debugging tool, and the threshold is (regrettably) specified arbitrarily. If that “best” class is not amongst the actual ones (whatever their number), then that’s the key information for debugging, and it doesn’t matter so much if other classes would go above the threshold. It is not difficult to make it show the probs relative to all the classes above the thresh. Feel free to submit a PR, making it an optional flag, if you want.
Thanks!