Do you know where the source code for nvidia-ml-py3-7.352.0-py_0 is? I would like to send a PR for a NVML loading bug in that library.
Issue is on my machine nvml.dll is in $env:WinDir\system32 (installed by nvidia installers). The nvidia-ml-py3-7.352.0-py_0 should then ideally search in both locations.
The etiquette is to wait until someone replies. And only ping a specific person if you have a prior connection there. Imagine if every user here were to ping everybody they wish every time they have an issue. And there are thousands of users on this forum.
Also if you find a bug, like in this situation, Filing an Issue Issues ¡ fastai/fastai ¡ GitHub will give it the fastest attention (and if itâs not a bug you will be asked to go back to forums, so please donât abuse that feature).
Also you will get the fastest response if you find the correct thread to post in. e.g. in this particular case the install thread I linked to, which I monitor closely. Of course, please donât post there unless itâs install related.
Does it work better if you remove nvidia-ml-py3 and install this version instead:
Excellent. Thank you, and once accepted please ask the maintainer to make a new release
However the package is from fastai channel. Once the PR is merged - what is the process to publish the package to the fastai channel?
I will make a new build. Just let me know when itâs ready.
Also, since a few by now reported this problem, I split off the python workaround in its own module, so that a normal user wonât need to have a working nvml to use fastai. So, please install the git version of fastai and let me know whether this removed the issue.
And then if youâd like to contribute and it sounds like you have the right know how, please help me to sort it out pynvml on win10, so that you could use all the gpu profiling functions there (fastai.utils.mem). We have just sorted it out on OSX, so windows is the only area to sort out. And perhaps your patch is all thatâs needed.
Also if you get a chance to try this one on win10 that would be great, so that weâd know whether we have an alternative that works on windows. I only work on linux, so rely on others for feedback on other OSes.
In any case that library has the same issue. So I sent the PR for the fix.
#2: I split off the python workaround in its own module, so that a normal user wonât need to have a working nvml to use fastai. So, please install the git version of fastai and let me know whether this removed the issue.
I did the above and an confirm the following
Just importing fastai.vision doesnât give the âunable to load nvml.dllâ error
Importing fastai.utils.mem does give the error
With my fix (monkey patched at site-packages), running gpu_mem_get_all() gives the same result as nvidia-smi.exe
In short with my patch, both your objectives are achieved on Windows 10.
#3: Once nvidia-ml-py3 PR accepted please ask the maintainer to make a new release. I will make a new build. Just let me know when itâs ready.
I sent the PR. Though the maintainer is active on github, nvidia-ml-py3 itself was last updated ~2 years back. So I am not sure when this will get it.
We can certainly wait a bit. However since fastai v1 has a hard dependency on nvidia-ml-py3 and we are releasing nvidia-ml-py3 in through the fastai conda channel, it will be prudent to pull the above into fastai org and release it from there.
Loving your proficiency and communication style, @partho!
Thank you for submitting the PRs and verifying that the fastai core w/o attempting to use memory profiling functions works just fine on win10. That gives us some breathing space.
The problem is simple - it needs to work on pypi and conda, so if we just release the improved conda package, there will be a mismatch (we canât upload an alternative on pypi).
So we can wait a bit and see whether (1) the PR gets submitted (2) a new release is made - hotfix release would be fine.
Here is alternative solution until things get sorted out upstream - go into fastai/utils/pynvml_gate.py and monkey patch nvml there and PR it. That sounds like itâd be the fastest way to give users the best experience. And once upstream makes a new pypi release, we build a new conda package and remove the monkey patch.
If the original upstream fix isnât happening after some time, but its py3nvml fork does integrate your fix and makes a new release - we probably just switch to that version. Since it means that itâs actively maintained and it would be a better version to rely on.
On it. For now letâs go with your idea of patching pynvml shim we have in fastai.
Also feel free to send across all Win10 issues and testing my way. Thatâs my primary environment. And is highly underrated IMHO for DL purposes.
Iâll also send across a PR for a Win10 âServer setupâ & âReturning to workâ guides - and if you are OK with that you can accept it for the couse-v3 documentation.
Iâll also send across a PR for a Win10 âServer setupâ & âReturning to workâ guides - and if you are OK with that you can accept it for the couse-v3 documentation.
Oh, Iâm sure Jeremy or Sylvain will take care of it - since they both are at home on windows.
Hi,
you asked in the other thread where the dll is located. It is indeed in the System32 directory in my case so your proposed PR should fix the issue (Windows 10). My rather simple workaround was copying the DLL to the directory the original code is searching in.