Hi @kzuiderveld, I'm trying on the same nvidia 1080-ti gpu but my performance seems to be significantly slow even though gpu utilization is 100%
Below is the screenshot after running your code
The time it takes is 2190s compared to your 111s.
Below are my specs
anaconda for python 2.7
nvidia geforce gtx 1080-ti
dell precision tower 7910
Thread(s) per core: 2
Core(s) per socket: 10
Below is my .theanorc
root = /usr/local/cuda-8.0/include
I'm also getting this below exception when I import theano
Using Theano backend.
WARNING (theano.sandbox.cuda): The cuda backend is deprecated and will be removed in the next release (v0.10). Please switch to the gpuarray backend. You can get more information about how to switch at this URL:
Using gpu device 0: Graphics Device (CNMeM is disabled, cuDNN Mixed dnn version. The header is from one version, but we link with a different version (5110, 5105))
Is this some kind of CPU bottleneck issue
Has anyone faced this issue ?
Thanks in advance