Lesson 3 In-Class Discussion

I found TTA made my results slightly worse on the Dog Breeds experiment. Is there a way to get the probabilities without using TTA?

learn.predict(is_test=True)

2 Likes

Jeremy mentioned the fastai machine learning course? Iā€™m having trouble finding it on the interwebs. Does someone have a link?

2 Likes

The files are in the ml1 folder in the github repo

3 Likes

@yinterian Thanks. Does the order of the files matter in the submission? In the dogs vs. cats one, I thought I was getting the files out of order.

Thank you :slight_smile:

They are awesome. I am only through lesson 2, but there is a ton of really good content.

1 Like

What matters is that you match the prediction with the right ā€œidā€.

why model is giving different predictions for single image?
Initial 37 and then 33?

Thank you. I think I remembered the order looking more straightforward in the version 1 keras version, but I think it was because it was pulling from the test directory sorted by the file system. I thought I had specified something incorrectly in the fastai libraryā€™s predict method.

Is Octavioā€™s video publicly available? (if so, link please <3)

5 Likes

I donā€™t think so, It was unlisted.

whereā€™s the Octavio video? Is it available? Tried searching and couldnā€™t find it.

3 Likes

@yinterian
How do we arrive at the filters?
Are these optimised as well (via gradient descent or other methods?)

1 Like

Yes, the filters are optimized with Stochastic Gradient Descent (SGD) or a version of it.

2 Likes

I have a Keras implementation that looks a lot more like the fast ai lib. forum thread is here and github here.

Right now it gets about 98.5% acc on dogs and cats. I am still looking for the places where fastai (pytorch) and my lib differ to get it up to 99%

6 Likes

If @jeremy would have 3channel input in his excel spreadsheet, then the filter in first hidden layer would have dimension [3,3,3]?

Why using a simple sum of previous convolutions instead of weighted one?

Intuitively every layer captures either edges, shapes of the image. As information passes through these kernels and activations, isnā€™t there progressive information loss? Is information preserved or does this not matter?

2 Likes

Are there any situations where a non-square maxpool might be useful?

1 Like