Crossentropy loss and Softmax

No takers on this question? Nor on my likely related one

I have not been able to figure out the answers on my own.

If no one replies soon, I’ll take them over to stackoverflow and post the replies back here.

Thanks for reading!