How to download data for Lesson 2 from Kaggle for Planet Competition


(Vijay Narayanan Parakimeethal) #21

Can you install 7zip in the crestle instance using

sudo apt-get install p7zip-rar

Post that pls try

7za x <filename.tar.7z>


(Vijay Narayanan Parakimeethal) #22

@anurag Need your help here. Looks like in crestle you cannot install kaggle-cli and unzip tar.7z files. There is a error like lxml<4.1,>=4.0.0 distribution is needed. can you help @memetzgz in this as she is using crestle?


(Maureen Metzger) #23

Thanks Vijay for your help – yes, this is the error I’m getting


(Anurag Goel) #24

I’ll look into the test data.


(Anurag Goel) #25

Crestle does have the test-jpg, test-jpg-additional and test-tif-2 folders with ~40k/20k/61k images respectively. Are you running into issues with using them?


(Maureen Metzger) #26

Hi @anurag, problem may be then that I did not create the right symlinks? I will check when I next log on. Appreciate your looking into this on your end. Crestle is working very well otherwise!


(Vikrant Behal) #27

It shows 1.7G for me.


(Sudarsan Padmanabhan) #28

It is also possible to download specific file using kaggle-cli

$ kg download -u <username> -p <password> -c <competition> -f train.zip

(Jeremy Howard) #29

Thanks for the tip - I didn’t know that :slight_smile:


(Clark Updike) #30
for f in test-jpg-additional.tar.7z test-jpg.tar.7z test_v2_file_mapping.csv.zip train-jpg.tar.7z train_v2.csv.zip
do
   kg download -f $f
done

(aswin) #31

I followed the steps from the first post, and kg download gives me this error.
‘NoneType’ object has no attribute ‘find_all’

I installed the cli using the --upgrade option.