"Be Careful What You Backpropagate" paper

amanmadaan · July 18, 2017, 2:19am

Thanks for sharing the paper.

You may like thread here. It discusses a toy dataset that’s easily fitted using a tanh activation as opposed to the conventional choice of a relu.