Meet Mish: New Activation function, possible successor to ReLU?

@morgan see sguggers comment here. Turns out we already have a full ranger! This should help with adapting QH :slight_smile:

Meet Ranger - RAdam + Lookahead optimizer

Perhaps you could implement the Quasi Hyperbolic Momentum? (Only part missing) and make it modular like RAdam and LookAhead? (Or see if your current version will stack together!)

2 Likes