Thank you for your reply and your patience.
Unfortunately, I only have a /usr/local/cuda-10.1/lib64
directory instead of a /usr/local/cuda-10.0/lib64
.
Therefore, I tried to set it up with using CUDA-10.1
echo /usr/local/cuda-10.1/lib64 | sudo tee -a /etc/ld.so.conf
sudo ldconfig
echo 'export PATH=/usr/local/cuda-10.1/bin:$PATH' >> ~/.bashrc
source ~/.bashrc
but still no luck.
My /etc/ld.so.conf
looks like this:
include /etc/ld.so.conf.d/*.conf
/usr/local/cuda-10.1/lib64
I thought with a working CUDA and cudnn installation I can directly start with the S4TF installation, but this seems to be not enough.
But running swift
in bash still results in the error:
swift: error while loading shared libraries: libcusparse.so.10.0: cannot open shared object file: No such file or directory
Therefore, I explored the output of python register.py --sys-prefix --swift-python-use-conda --use-conda-shared-libs --swift-toolchain ~/swift
:
kernel.json:
{
"argv": [
"/home/user/anaconda3/envs/swift/bin/python",
"/home/user/Documents/swift-jupyter/parent_kernel.py",
"-f",
"{connection_file}"
],
"display_name": "Swift",
"language": "swift",
"env": {
"PYTHONPATH": "/home/user/swift/usr/lib/python3.6/site-packages",
"LD_LIBRARY_PATH": "/home/user/swift/usr/lib/swift/linux:/home/user/anaconda3/envs/swift/lib",
"REPL_SWIFT_PATH": "/home/user/swift/usr/bin/repl_swift",
"SWIFT_BUILD_PATH": "/home/user/swift/usr/bin/swift-build",
"SWIFT_PACKAGE_PATH": "/home/user/swift/usr/bin/swift-package",
"PYTHON_LIBRARY": "/home/user/anaconda3/envs/swift/lib/libpython3.6m.so"
}
}
Registered kernel 'Swift' as 'swift'!
The 2nd LD_LIBRARY_PATH
points to the directory /home/user/anaconda3/envs/swift/lib
that includes:
/home/user/anaconda3/envs/swift/lib/libcusparse.so
/home/user/anaconda3/envs/swift/lib/libcusparse.so.9.2
/home/user/anaconda3/envs/swift/lib/libcusparse.so.9.2.148
Maybe that here is no libcusparse.so.10.0
is the problem?
With find / -name "libcublas.so.10.0"
I was able to located the file in several locations:
/home/user/anaconda3/pkgs/cudatoolkit-10.0.130-0/lib/libcublas.so.10.0
/home/user/anaconda3/envs/fastai-pytorch-nightly/lib/libcublas.so.10.0
/home/user/anaconda3/envs/fastai/lib/libcublas.so.10.0
/home/user/anaconda3/envs/fastai-dev/lib/libcublas.so.10.0
Then I tried to pass /home/user/anaconda3/pkgs/cudatoolkit-10.0.130-0/lib/
as the --use-conda-shared-libs
parameter in python register.py --sys-prefix --swift-python-use-conda --use-conda-shared-libs --swift-toolchain ~/swift
but I was not successful.
I am not sure how the syntax should look like to hand over this argument. Maybe somebody can point me in the right direction?
PS:
I am not sure if this part is also needed from the installation steps from the first post:
cd ~/download
wget http://files.fast.ai/files/cudnn-10.0-linux-x64-v7.5.0.56.tgz
tar xf cudnn-10.0-linux-x64-v7.5.0.56.tgz
sudo cp cuda/include/cudnn.h /usr/local/cuda/include
sudo cp -P cuda/lib64/libcudnn* /usr/local/cuda/lib64
sudo chmod a+r /usr/local/cuda/include/cudnn.h /usr/local/cuda/lib64/libcudnn*
sudo ldconfig
However, I am afraid that this is maybe breaking my current CUDA setup?