I am trying to use Azure NC6 machine and the speed is pretty slow (~800+ seconds for lesson 1 notebook training).
I’ve moved to NC12 (two GPU) and still very slow, I guess because it uses only one GPU anyhow (when I do nvidia-smi it seems like only one GPU it working (Volatile Gpu-Util)).
Both NC6 & NC12 use K80 GPU. Here are the details on these machines
I am using python 3.6 and Keras 2 as in here1.
Using tensorflow as backend with keras.json with:
I also tried to use theano backend but I still get aroun 800 seconds elapsed time.
I see people getting ~200 seconds fitting while with these “strong” servers I get about x4 slower performance.
Any suggestion? Maybe @dradientgescent who reported 229 seconds.