my notebook shows this message under the 4th cell, the AWS instance is a p2.large configured with the script provided.
Am I missing some configuration?
Using Theano backend.
WARNING (theano.sandbox.cuda): CUDA is installed, but device gpu is not available (error: Unable to get the number of gpus available: no CUDA-capable device is detected)
(I donât know much about GPUs, but that command should give you some information about the recognized nvidia card, and might give you a helpful error message if thereâs something wrong.)
yes, there seems to be something wrong, the output is below:
ubuntu@ip-10-0-0-10:~/as/repos/DL/nbs/data/redux$ nvidia-smi
Failed to initialize NVML: Driver/library version mismatch
Glad itâs working now! I got that error last night, and then it worked fine in the morning for me (I stopped and re-started my instance in the meantime)
Generally speaking you should find that âsudo modprobe nvidiaâ fixes most problems that would otherwise need a reboot. Just a little shortcut - nothing wrong with rebooting, of course.
Just to add - I created a new p2 instance (Ireland) and nvidia-smi failed with Failed to initialize NVML: Driver/library version mismatch. However, the suggested simple fix sudo modprobe nvidia did not help. Rebooting the instance fixed the problems.
As this was rather confusing, and given that the ~tutorial~ setup video otherwise just âworked perfectlyâ, I hope this thread helps other newcomers get around this issue.