A Recipe for Training Neural Networks

PierreO · April 25, 2019, 7:21pm

Hi everyone!

Andrej Karpathy, the guy that first created the now famous cs231n class at Stanford and is now Director of AI at Tesla, just published a blog post about how to train Neural Networks that I find very very interesting and so I thought I’d share it here. Enjoy!

Also while I’m talking about Tesla, they had an “Autonomy Day” a few days ago which was basically talking about how they design every aspect of their AutoPilot hardware and software. Very interesting also.

LessW2020 · April 25, 2019, 8:17pm

Good read, thanks for posting!

chho6822 · April 26, 2019, 3:41am

Welcome to the real world! Thanks for sharing!

AlisonDavey · April 26, 2019, 6:00am

Thanks. It’s a great read, written in accessible language. Karpathy stresses the importance of visualising what is happening during training, which reminded me of Lesson 10.

cedric · April 26, 2019, 11:13am

Thanks for sharing.

… suffering is a perfectly natural part of getting a neural network to work well, but it can be mitigated by being thorough, defensive, paranoid, and obsessed with visualizations of basically every possible thing.

I enjoyed the “Neural Network Essentials” part from Karpathy’s Tesla “Autonomy Day” presentation.

Many practical advice.

suvash · April 27, 2019, 5:07pm

Finally had some time to watch it. This was def. well worth a watch.