How to untar Chest X-rays data from Box using FastAI?

Hi All,

I am new to FastAI library. I just finished lecture 3 and wants to do something similar involving multi-label classification. To this end, I wanna retrieve data from Box here which is in several files available online.

Is there a way to download and unpack this data in images folder using FastAI untar function?

And by the way what would be the best approach to download this data from Python?

The platform is Google Cloud with lastest FastAI library.

Any guidance will be much appreciated.

I think fastai’s untar_data() function is just for fastai’s available datasets and allows .tgz file extension only so I think the best approach is to :

  • download your data simply by wget and you give it the link :
    wget https://www.-----.tar.gz
  • extract it manually to your destination folder :
    tar -xvzf dataset.tar.gz
  1. There’s already a python script in the link you posted, to download all files

Hey Zowlex, thanks for the reply.

I followed the step 1 and was able to download all the data on Google Cloud. Now trying various architectures.

1 Like

I’m glad you figured it out, best luck !