Error while Implementing softmax as a formula in Torch

ishan · September 6, 2020, 4:34pm

Hi,
I am implementing softmax as a formula and I get nan on the first row consistently. This results in the loss being nan as well. I tried adding a small number to the numerator, which did not help. Using nn.softmax resolves the problem and the loss values is a scalar as well. The formula implementation works fine while using seperately with random numbers. I wonder if it has to do with numerical instability. I have attached couple of images to demonstrate the code.
Thank you for the help!

Pomo · September 6, 2020, 7:01pm

I imagine because e^115 is very, very big.

ishan · September 6, 2020, 9:12pm

Very true.

orendar · September 6, 2020, 9:29pm

Hey ishan,

You just found out why it’s common to subtract the maximum before doing the exp - to prevent numerical explosion. Just add a first line in softmax saying something like preds -= preds.max() and you should be good to go

ishan · September 6, 2020, 10:04pm

That makes sense. Thank you very much!
@orendar