Lesson 4 In-Class Discussion

ramesh · November 21, 2017, 3:12am

Any suggestions on Linear - Relu - BN vs Linear - BN - Relu (like the BN paper suggests)? Is it because one works better than other?

lgvaz · November 21, 2017, 3:12am

Yeah I agree, basically you have to assume what you can change and about how much you can change, for the labels to still be the same.

bahram1 · November 21, 2017, 3:14am

My guess is school holiday and promo are already hot coded because it is Boolean.
other categorical variables are going to become hot coded.

sidravi1 · November 21, 2017, 3:15am

That log rule is not right. log(a/b) = log(a) - log(b) which not the equal to log(a)/log(b)

anurag · November 21, 2017, 3:17am

Are you installing feather-format? This works for me on Crestle:

pip3 install feather-format

stathis · November 21, 2017, 3:17am

Wow thanks for the insight!

karthikramesh · November 21, 2017, 3:18am

True was just typing that great catch

pete.condon · November 21, 2017, 3:19am

I owe you a beer

stathis · November 21, 2017, 3:20am

It’s not really the same. For normalization you use only the data but for binning you choose arbitrary values. Most of the time these are driven by domain specific knowledge.

KevinB · November 21, 2017, 3:23am

would dropout in this case put certain columns to 0 or would you take out the entire row?

yinterian · November 21, 2017, 3:24am

certain columns

charlielee · November 21, 2017, 3:24am

¯\(ツ)/¯ sounds right to me.

zaoyang · November 21, 2017, 3:26am

Why does 2014 get converted to 2? Is it because there’s only 2012 and 2012 is 0, 2013 is 1, and 2014 is 2?

KevinB · November 21, 2017, 3:26am

Is he describing one-hot encoding currently?

ar_ai · November 21, 2017, 3:26am

embeddings.

yinterian · November 21, 2017, 3:26am

He is explaining embeddings.

binga · November 21, 2017, 3:26am

@yinterian just a note to ask jeremy to tell us the trick (at 53:00 minute) towards the end of the class.

init_27 · November 21, 2017, 3:27am

2012 is the starting year inside the dataset.
2013 is the next one and so on…

anandsaha · November 21, 2017, 3:27am

Sunday was a Rank1 matrix of length 4 (4x1). How could it fit (get appended) into the original input, which was nx1. I did not understand the dimensions.

KevinB · November 21, 2017, 3:27am

So do you not use one-hot encoding when doing this at all? It just uses the embedding matrix instead?