Training ImageNet in 4 Minutes: what do you think?


(urmas pitsi) #1

Highly Scalable Deep Learning Training System with
Mixed-Precision: Training ImageNet in Four Minutes: ‘https://arxiv.org/pdf/1807.11205v1.pdf

What do you think is the most impressive in this work?

  1. insane training speed for Imagenet, 4min!
  2. ability to get hands on 2048GPU-s simultaneously? :slight_smile:
  3. something else?

(Matthijs) #2

When training AlexNet with 95 epochs, our system can achieve 58.7% top-1 test accuracy within 4 minutes, which also outperforms all other existing systems.

I can train AlexNet in 10 seconds (to 0% test accuracy). :wink: