Lesson 5 discussion

jp_beaudry · August 7, 2017, 5:25am

bobby,

I had the same error when trying to run this line from the lesson3 notebook:
load_fc_weights_from_vgg16bn(bn_model)

After some searching, I (re)discovered that the vgg16_bn.h5 was stored in my ~/.keras/models directory. An ls -lh command revealed that it was only 63K in size whereas the oft-used vgg16.h5 (ie sans bn) is 528M.

ubuntu@ip-10-0-0-9:~/.keras/models$ ls -lh
total 528M
-rw-rw-r-- 1 ubuntu ubuntu 35K Jun 18 18:26 imagenet_class_index.json
-rw-rw-r-- 1 ubuntu ubuntu 63K Jun 18 18:04 vgg16_bn.h5
-rw-rw-r-- 1 ubuntu ubuntu 528M Jun 18 18:26 vgg16.h5

I thus wgot the file again using the link you provided and that above line from lesson 3 now runs without error for me.

HTH,
JP

xubujie · August 7, 2017, 2:19pm

Dose any one know why no batchnorm is used in all lesson 5(the notebook) 's model. Does it helps to use batchnorm?

bobbylindsey · August 7, 2017, 3:16pm

Awesome observation! That fixed my error as well. Appreciate it, jp_beaudry.

TCK · August 24, 2017, 10:25am

Hey everyone, I’m trying to re-implement Dogs and Cats using the functional API, but with little success…

I set up batches as earlier in the course, i.e.

batch_size=64
batches = get_batches(train_path, batch_size=batch_size)
val_batches = get_batches(valid_path, batch_size=batch_size*2)

Then built a model as per the functional API, and successfully loaded up the weights from vgg16.h5. But now when I try to run:

model.fit_generator(batches, samples_per_epoch=batches.nb_sample, nb_epoch=1, validation_data=val_batches, nb_val_samples=val_batches.nb_sample)

I get the error:

Exception: Error when checking model input: expected input_4 to have shape (None, 3, 244, 244) but got array with shape (64, 3, 224, 224)

(As if the generator isn’t recognising the batch dimension?)

Any help on this would be massively appreciated. (The capacity to identify cats from dogs has become rather central to my sense of self worth over the past month…)

mdimercurio · August 29, 2017, 2:45pm

@idano did you get an answer on this one? I would have expected this to throw an error but it looks like it works.

callmefons · September 17, 2017, 10:46am

How to deal with this error?
Thanks

nok · September 24, 2017, 4:55pm

I have the same issue. I am using python 3.4, when I install Theano==0.9 I get error like this.

Exception: Compilation failed (return status=1): g++.exe: error: Chan\theano\compiledir_Windows-10-10.0.15063-Intel64_Fa. g++.exe: error: Chan\theano\compiledir_Windows-10-10.0.15063-Intel64_Family_6_Model_78_Stepping_3_GenuineIntel-3.4.5-6. lazylinker_ext\mod.cpp: No such file or directory

mistakenot · October 18, 2017, 5:09pm

In the sentiment example, why is setting all words to a specific value, 5000, better than just dropping those rare words altogether?

marcohs · October 19, 2017, 7:39am

Hi all,

Can someone please explain why in the create_emb function we divide the emb matrix by 3 at the end?

Cheers,

Marco

bennnun · October 20, 2017, 4:37am

@marcohs
This is explained in this post:

anr · November 7, 2017, 2:07am

Lesson 5 is an amazing introduction to sentiment analysis. I tried to game the system by predicting the sentiment of a sarcastic review. Of course it failed (in fact it got better score than a truly honest positive review). Has anyone tried to train against sarcasm? Is it even possible or that’s A.I. 2.0?

phrase = np.array([], dtype="int64")

np.append(phrase, [1.])

textphrase = 'yeah sure, you should trust the reviews, by all means, this is an amazing movie, come and enjoy :/ NOOOOT'

for o in textphrase.split(' '):
    if o in ids:
        phrase = np.append(phrase, ids[o])

padded_phrase = sequence.pad_sequences([phrase], maxlen=seq_len, value=0)

conv1.predict(padded_phrase, True)

output ---> array([[ 0.942]], dtype=float32)

npvisual · November 16, 2017, 4:15pm

I completely agree. Great catch, IMO.

The Embedding layer is based on an output tensor of size

(None, 500, 50)

as you pointed out. This fits properly with the normalized sequence length (500) and the dimension of embedding (50) attached to each sentence.

The size of the vocabulary is only relevant to the Embedding layer to understand the range of integers used to represent the inputs fed to that layer (i.e. the result of the word2idx).

The input size of the convolutional layers however should match the length of each sentence. The filters (3,4,5) are applied to the tensor of latent factors (500x50).

While the size of the resulting matrices are ten fold in the class’ notebook vs. what they are supposed to be, the number of resulting parameters, however, does not change : it’s primarily dependent on the size of filters.

Bottom line is that it still works but it’s probably not as efficient (takes longer to train per epoch) and the results are probably slightly more “noise” prone (therefore taking more epoch to reach the same accuracy).

@Jeremy / @Rachel : if you agree with @idano 's assessment, what’s the best way to correct this ? Pull request ?

anuclearbomb · December 4, 2017, 6:27am

hey,

I believe you can solve this by change the line
if word and ....

to this:

if word in words and re.match(r"^[a-zA-Z0-9\-]*$", word):

tdphong · December 13, 2017, 7:37am

Does anyone get the different result from the lecture’s note when running the model by yourself ? I often get lower accuracy comparing with the lecture’s note result. For example below I trained the model “Single conv layer with max pooling” as in the note, but the result is very low although I did the same steps.

tdphong · December 14, 2017, 4:58am

I did the same and got higher accuracy score. Thanks for your advice

hel0 · January 4, 2018, 12:50pm

I tried cats and dogs redux using vgg16bn by

changing vgg=Vgg16() to vgg=Vgg16BN(),
Importing using the updated utils.py and vgg16bn.py

I have a few questions:

My accuracies are about the same at 0.9755 for vgg16 and vgg16BN, is there anything I might have missed?

fig 1.png862×500 29.1 KB
At this cell, Is it fine to have output 0.999725 for for the first picture?

fig 2.png850×393 164 KB
Based on the following confusion matrices, is there anything that I might have missed when I ran vgg16BN? The confusion matrix after I used vgg16BN shows 1000 cats-cats and 947 dogs-dogs:

fig 3.png766×640 41.9 KB

While the confusion matrix after I used vgg16 (without BN) shows 979 cats-cats and 986 dogs-dogs:

pgodzin · January 20, 2018, 11:07pm

Are there any good rules about using multiple embeddings? For example, using word2vec or GloVe as a general embedding but also adding a domain-specific embedding for something like medical terminology.

bit3125 · April 7, 2018, 2:41pm

Hi, bobbylindsey. I have the same problem .Did you figure it out?

bit3125 · April 7, 2018, 3:14pm

I got it . Thx!!! I thought vgg16() method create the file named vgg16.h5 , before. But I know I was wrong. ( I configure the running environment on my own computer ,not AWS. So, I didn’t have the model files before)

bit3125 · April 7, 2018, 3:15pm

Is it pre-training model ?