Since we all will be using the planet dataset for the Lesson 2, I thought it would be best to put down the steps to do this on AWS. I have done this and been able to run the note book successfully. Hope this helps.
Install Kaggle CLI (if done, Go to Step 2)
pip install kaggle-cli
Configure your kaggle account
kg config –u <your username (your email most likely)> -p <your password> -c <competition name>
a. Go to Kaggle Competition Website, Login and accept the rules of competition
b. If you’ve always signed into Kaggle using a linked social media account, you will get an error using the kaggle cli, which requires that you have a separate kaggle login. Fortunately, Kaggle has a solution: if you select Forgot Password?, you’ll receive an email with a few different options, the 3rd of which lets you set up your own Kaggle username/password and connects it to your original social media account
c. How to find Kaggle competition name – Go to Kaggle competition page in kaggle website and take the name. For ex – if page is https://www.kaggle.com/c/planet-understanding-the-amazon-from-space, then competition name is planet-understanding-the-amazon-from-space
Download the data
Extract data: zip files
unzip –q <filename.zip>
Extract data: tar files
7za x <filename.tar.7z>This extracts 7z format and delivers an output <filename.tar>
tar xf <filename.tar>
You only need the following files for running the notebook (as per my understanding for now. @jeremy will probably explain this in the next class)
I deleted the rest of the files as the device was running out of space, but if you have space you can keep it in a separate folder under data/planet.