Deep Learning Generalisation Game

I saw this conversation on Twitter about whether sharp minima are actually bad for generalisation after all:

I built this game to explore the idea and build my intuition. For some distributions you can see sharp minima aren’t a problem at all! But it depends on the data, see for yourselves: