Highly Scalable Deep Learning Training System with
Mixed-Precision: Training ImageNet in Four Minutes: ‘https://arxiv.org/pdf/1807.11205v1.pdf’
What do you think is the most impressive in this work?
- insane training speed for Imagenet, 4min!
- ability to get hands on 2048GPU-s simultaneously?
- something else?