Personal DL box


(RobG) #126

I have 2x16GB DDR4 3000MHz RAM. It’s one area I wish I had more, but it costs. I will soon be adding a second GPU and hope to avoid buying more memory, but that is more hope than expectation. For all the DL-box threads online, many don’t emphasise the non-GPU aspects of memory and disk size and speed required.


(Andrea de Luca) #127

I asked because if you are able to load the entire dataset in ram (it’s done automatically if you use a bcolz array, for example), disk speed doesn’t matter anymore.

However, consider that for DL tasks, ram speed doesn’t matter: go for the cheapest.


(Thomas Chambon) #128

Currently waiting to buy my DL box.
I don’t know what is better, get one RTX2080 (850€) or one GTX1080 (~500€) knowing that for 150€ more than one RTX2080, I could buy 2 GTX1080 (is working with 2 GPU a real benefit to play with deep learning?).
RTX2070 could be a good bet but still no release date.


#129

I second that approach (waiting :slight_smile: ) and question. one difference will be in memory size, which may be important for large data sets, but that difference is not huge…

possibly prices of 1080s will drop when the 2080s are released…


#130

Hi @radek,

I built my own DL machine, to setup the course requirement I’m using your scripts I see that you are using cuda9. I’ve installed CUDA 10 as part of my installation. Is it required to have cuda 9 instead of cuda 10? Also I see that cuda-tool-kit 9 is being isntalled as part of conda installation.

cudatoolkit-9. 100% |##############################################################################################| Time: 0:03:46 1.63 MB/s

If I have to change it to work for CUDA 10, where should I change to install the cudatoolkit-10? Infact I already have the cudatoolkit-10 installed. Do you see any problems with this setup?


#131

You can install both Cuda 10 and 9.x on the same system. See writeup here:

With that said, currently it is more straightforward to build PyTorch with Cuda 9.2 than 10.0


#132

Thank you @redturtle

Here is my simple 5 step guide.

Hope somebody will find it useful.


#133

Hi @redturtle @radek,

I wanted to follow the steps so, instead of running your script, I ran most of the instructions manually to get to know what all the dependencies is being resolved.

In the script I see that instead of doing “conda env update” you are suggesting to install pytorch by git clone. I followed the same procedure, everything works fine.

Now if I got to fastai’s git root directory and try to run the “conda evn update” it trys to install the pytorch again.

 conda env update
Solving environment: done

Downloading and Extracting Packages
pytorch-0.3.1        | 486.5 MB  | 1                                                                                                   |   0% pytorch-0.3.1        | 486.5 MB  | 1                                                                                                   |   0% 

How to inform the anaconda to stop looking for pytorch? I tried multiple times but it takes really long to download and most of the times http connection gets timedout and hence couldn’t install pytorch using “conda install pytorch” or “conda env update” .

Now that I’ve the working pytorch, I don’t want anaconda to look for pytorch again. How to disable that?


#134

@redturtle,
I didn’t face any problem what so ever, while building pytorch using cuda 10.0


#135

I should have worded this as “you will have to build PyTorch from source if you use cuda 10.0, whereas you can use the stable conda package with cuda 9.2.”

I also didn’t have any issues building PyTorch 1.0 with cuda 10.0, but using conda with 9.2 is easier and probably ok unless there are features in 10.0 that you need (eg tensor cores for turing).


#136

Thank you for your reply. Now I get that. Any inputs on how to inform/configure conda to use installed pytorch rather than trying to download one, when I do “conda env update” ?


#137

Hi All,

I got my system up and running. Here is my blog post which may help others about how to setup the machine, my h/w details. How to access it from outside world.