Lesson 2 official topic

mike.moloch · May 7, 2022, 6:35pm

I’m seeing this issue when trying to us download_images.

It appears that search_images_ddg returns a list of None values. It seems it complains about results.attrgot(‘contentUrl’)

My setup is:

=== Software === 
python        : 3.7.11
fastai        : 2.6.3
fastcore      : 1.4.2
fastprogress  : 0.2.7
torch         : 1.10.0
nvidia driver : 470.103
torch cuda    : 11.3 / is available
torch cudnn   : 8200 / is enabled

=== Hardware === 
nvidia gpus   : 1
torch devices : 1
  - gpu0      : NVIDIA GeForce GTX 1070 Ti

=== Environment === 
platform      : Linux-5.4.0-109-generic-x86_64-with-debian-buster-sid
distro        : #123-Ubuntu SMP Fri Apr 8 09:10:54 UTC 2022

strickvl · May 7, 2022, 6:39pm

You probably want to use the code and the search_images function listed in this notebook. Note that it’s part of step 1 and you might need to ‘unhide’ the specific code for the function.

mike.moloch · May 7, 2022, 6:43pm

Thanks @strickvl ! I was just trying the vanilla fastbook notebook. I should’ve looked at the birds notebook (in fact I did that whole notebook just the other day )

Raymond-Wu · May 7, 2022, 7:29pm

That looks awesome! Love the idea of an ml marketplace. Let me know when you’re ready for some open source contributions.

jeremy · May 8, 2022, 1:14am

Yup you’re reading too much into it. One is an object of type torch.Tensor and the other is of TensorBase (which is part of fastai).

jeremy · May 8, 2022, 1:16am

Sometimes because kaggle has so many packages installed they can clobber each other. In this case it looks like somehow typing-extensions is expected to be there, but isn’t, so try pip -Uqq install typing-extensions.

tapashettisr · May 8, 2022, 6:26am

Hi Imran,
What is the way to install fastai and fastbook on a laptop?

tapashettisr · May 8, 2022, 6:28am

Remove
.attrgot('contentUrl')
only results will suffice

Raymond-Wu · May 8, 2022, 6:56am

I’m having trouble getting my model to train properly. The training loss is low but validation loss is high (meaning if overfit if I remember corrrectly). Would love some advice!

In the mean time, I’ll try adding more data as each mineral species can have many different shapes and colors.

tapashettisr · May 8, 2022, 7:03am

I go with option b.
During training for dog images snake and OTHER responses are driven down and similarly for snake images , dog and OTHER are driven down. For a house image both snake and dog have low response while OTHER which has not trained to be high or low for OTHER images will also have a low response. So all 3 low responses resulting in ~ [0.3,0.3,0.3]
Would love to see the actual answer revealed.

tapashettisr · May 8, 2022, 7:11am

Hi,
I am running lesson 2 notebook on Colab. I am facing an issue after downloading the images.

fns = get_image_files(path)
fns

is showing zero files.

madhavajay · May 8, 2022, 8:11am

@VishnuSubramanian and @jpc awesome, thanks for the help, that worked and I was able to train.

I started at half, then went up to full size in the second leaner reloading the weights.

It then looks like it started to overfit.
I ran it at a lower learning rate range and unfrozen while i was away from the computer as an experiment.

I guess that LR landscape isn’t great and the epochs look to just be getting worse.

So, I guess im getting around 0.87 DICE on the Validation set which isn’t bad. Does anyone know how can I run this against the test images and calculate a Dice score against them all?

Also I am guessing this leaderboard is against the test set masks / ground truth which isn’t available?
https://uwm-bigdata.github.io/wound-segmentation/

bencoman · May 8, 2022, 8:13am

jeremy:

bencoman:
FileNotFoundError: [Errno 2] No such file or directory: 
'/opt/conda/lib/python3.7/site-packages/typing_extensions-4.2.0.dist-info/METADATA'
Sometimes because kaggle has so many packages installed they can clobber each other. In this case it looks like somehow typing-extensions is expected to be there, but isn’t, so try pip -Uqq install typing-extensions.

Awesome! That did it. Thanks Jeremy.
I can confirm that this works…

!pip install -Uqq gradio 
!pip install -Uqq fastai
!pip install -Uqq typing-extensions

although only having a localhost url (127.0.0.1) is not much use.
At least Colab provides a public-url (see below).
[Edit:] From poking around I discovered that in kaggle adding share=True to launch() caused a public-url to be generated. Interesting that flag wasn’t required for Colab.

P.S, for others following, the order matters - this order doesn’t work…

!pip install -Uqq typing-extensions
!pip install -Uqq gradio 
!pip install -Uqq fastai

shravan.koninti · May 8, 2022, 9:58am

Question:

Do we have any resource where we can see all the pre-trained models that can be passed to fastai model API.
For e.g., here I am trying to pass inception_v3. it throws an error —

any resource list on all the pre-trained models with their names that we can use? Thanks

mdmanurung · May 8, 2022, 9:59am

Hi all, I have a question. When should we use learn.save() and learn.export()? It seems to me that the latter is more convenient to use than the former.

Thanks in advance.

mdmanurung · May 8, 2022, 10:09am

You need to enclose the name of the model in quotes. You can also use wildcard to find the model by using the following command from timm: timm.list_models('*densenet*')

shravan.koninti · May 8, 2022, 10:10am

I know about learn.export.

Learn.export() is used to save the trained model in pickle or .pth format.
The saved model can be used for inference purpose on new/unknown test data with predict method.

I am not very clear on learn.save(). Can someone please answer this.

bencoman · May 8, 2022, 11:08am

It wasn’t clear to me exactly how to run that, so pulling on that thread,
worked out from here, the following works well enough in a fresh notebook on kaggle…

!pip install -Uqq timm
import timm; print(timm.__version__)
import pprint
pretrained_models = timm.list_models(pretrained=True)
print(len(pretrained_models))
pprint.pprint(pretrained_models)

0.5.4
592
[‘adv_inception_v3’,
‘bat_resnext26ts’,
‘beit_base_patch16_224’,
…

bencoman · May 8, 2022, 11:42am

Yay! Here is my Cats V Dogs gradio app launched from kaggle.
But I can’t quickly determine why the Flag button is shown (i.e. where it is defined),
or why my two example photos don’t show.

suvash · May 8, 2022, 12:15pm

Having a low training loss but high validation loss does not necessarily mean that your model is already overfitting. This is a point of confusion for most of us when starting out and trying to understand these ideas.

As mentioned in the book; when the validation set accuracy stops getting better, and instead goes in the opposite direction and gets worse, that is when you can start thinking that the model is starting to memorize the training set rather than generalize, and has started to overfit.
Taken straight from the book, Chapter 1 :

In other words, as you progress further on the training, once the validation loss starts getting consistently worse than what it was before, (and hence meaning that it’s not able to generalize as well to unseen data, while getting better on training data) then the model has started to overfit.

Not to be taken as an exact example, but I’ve tried to quickly sketch out what this could mean in practice.