Hi,
I have created a simple 2 layer Neural Network from scratch, but my training accuracy is not improving with the number of epochs. I suspect my calculation of gradient for Cross-Entropy loss with Softmax has some issues. Can someone help me out?
Below is the formula for my gradient calculation: