Meet Mish: New Activation function, possible successor to ReLU?

muellerzr · November 5, 2019, 7:51pm

@morgan see sguggers comment here. Turns out we already have a full ranger! This should help with adapting QH

Meet Ranger - RAdam + Lookahead optimizer

Perhaps you could implement the Quasi Hyperbolic Momentum? (Only part missing) and make it modular like RAdam and LookAhead? (Or see if your current version will stack together!)