Meet Mish: New Activation function, possible successor to ReLU?

That’s exactly the paper I was thinking of :slight_smile:

Here’s two more related to that concept of HAN (heterogeneous activation networks). I think this is a space that might yield new innovation and results, along with leveraging scale inside the networks (ala res2net).

4 Likes