Platform: GCP ✅

sderuiter · November 16, 2018, 12:31pm

Thank you. Can you link to an install guide, or something similar?
My best guess now is to delete my instance and just start over (using the fast.ai GCP tutorial).

arunoda · November 16, 2018, 12:55pm

I use this: https://github.com/arunoda/fastai-shell

sderuiter · November 16, 2018, 1:33pm

Thanks, that works wonders. Question on this: where do you store you own work? On my other instance I stored it in the same dl1 folder, but that messes up the git pull command (I need to stash and stash pop to get git pull working).

avatar · November 16, 2018, 4:50pm

I have not been able to start my GPU machine on GCP since the past couple of days.
Its says “quota exceeded globally”. I have started the machine with out GPU to do some data pre-processing but, did not do any model training.

Is any one in the same situation as me?
Should I move to AWS?

arunoda · November 16, 2018, 4:56pm

You need to increase your GPU quota and upgrade your account.
Search this thread for more info.

arunoda · November 16, 2018, 4:57pm

You can save anywhere inside the your instance.
But I recommend do it in a different directory and push changes to GitHub.

sderuiter · November 19, 2018, 9:31am

Question: sgugger has committed a fix to the master branch, that is currently not in the latest (tagged) release. As of writing, 1.0.27 is the latest tag, which I have, using the update_fastai.sh script.

Do you know of a quick way for me to download the master branch, while still maintaining the possibility to use update_fastai.sh if needed?

arunoda · November 19, 2018, 10:29am

Simply do this: https://github.com/fastai/fastai#developer-install

Do this after update-fastai.sh.
(Which updates pytorch)

sderuiter · November 19, 2018, 11:45am

Perfect! After doing this, version shows 1.0.28.dev.
What would be the procedure to reverse this again?

arunoda · November 19, 2018, 1:16pm

Just install the fastai via conda.
Or just run the update-fastai.sh script.

You can also checkout a release tag in the repo instead of checkout the master.

deepanshu2017 · November 27, 2018, 12:59pm

While trying to create a V100 instance I am receiving this error Quota 'GPUS_ALL_REGIONS' exceeded. Limit: 0.0 globally.

tillia · November 27, 2018, 1:23pm

Check your GPU quota settings (IAM& admin->Quotas).
If you have to change quota for GPU, just write a ticket (choose GPU quota->Edit).

hwasiti · November 27, 2018, 4:00pm

When you get error when SSH to a GCP instance:

[Connection Refused]

The solution is to excute this line in your pc bash:
gcloud compute routes create default-internet --destination-range 0.0.0.0/0 --next-hop-gateway default-internet-gateway

This is because the default route for non-local traffic (0.0.0.0/0) had been inadvertently deleted, which caused all external traffic to be lost on the return path.

Source

deepanshu2017 · November 28, 2018, 2:18pm

@tillia I cannot see any quota related to GPU in my IAM & Admin -> Quotas

tillia · November 28, 2018, 2:46pm

Start filter in Metrics dropdown by ‘GPU’ - you should see all GPU related quotas.

deepanshu2017 · November 28, 2018, 2:59pm

@tillia In filters I see all the GPU and they are enabled (blue tick in front of them)

tillia · November 28, 2018, 3:53pm

You have to filter only GPU quotas (in dropdown None, then filter by GPU and check quotas you wanna select). You should have all GPU quotas listed and on the right side of table should be actual quota for each row. Use checkbox in quotas you want to change and then select edit and write a ticket

marcmuc · December 1, 2018, 9:44pm

Just in case someone encounters this problem in gcp: Learning is very slow, because pytorch only uses one process, even though you specified num_workers = x (>1) (normally fastai does this for you by default with x = num cpus). This seems to be a bug in older pytorch versions (also the one that came preinstalled with the official image I used according to fastai docs.)

Upgrade pytorch with conda install pytorch-nightly -c pytorch, not conda update (which will tell you it is on the latest already), and the problem will be solved and you will get multiple workers and faster training. (works at least with version build pytorch.dev2018-11-30)

Ephibob · December 2, 2018, 4:44pm

I tried updating the course but got this error. help pls?

jupyter@instance-1:~$ cd tutorials/fastai/course-v3

-bash: cd: tutorials/fastai/course-v3: No such file or directory
jupyter@instance-1:~$

Ephibob · December 2, 2018, 6:44pm

same here. looks like it’s no long available