Lesson 4 In-Class Discussion

creviera · November 21, 2017, 2:58am

Jeremy just mentioned during the live stream that you can adjust dropout per layer and how you do this by passing an array in for ps, like so: learn = ConvLearner.pretrained(arch, data, ps=[0.,0.2], ...)

pete.condon · November 21, 2017, 2:59am

Has anyone managed to get the Rossman code working in Crestle? I’m getting an SSL error installing feather.

charlielee · November 21, 2017, 3:02am

Is binning a common practice to force continuous values into categorical groups?

pete.condon · November 21, 2017, 3:03am

Yep, sometimes there are so many categories that it’s not practical to use them all … especially when there isn’t a huge amount of variation between them.

jenna · November 21, 2017, 3:03am

Is it normal to set a column as continuous because it’d be more costly to set it as categorical?

jenna · November 21, 2017, 3:03am

Can you double-count a column by using it categorically and continuously?

yinterian · November 21, 2017, 3:03am

Just wait …

stathis · November 21, 2017, 3:04am

Isn’t binning a form of feature engineering? If so isn’t that kind of counterintuitive to use with deep learning?

charlielee · November 21, 2017, 3:04am

I would’ve considered it normalization and/or part of the process in data prep.

Avhirup · November 21, 2017, 3:05am

probabilities

rishubhkhurana · November 21, 2017, 3:06am

Will we capture the trend along the years when we code the year column as categorical?

elfrank · November 21, 2017, 3:06am

This note helped me a lot:

training, validation, accuracy
0.3, 0.2, 0.92 = under fitting, cycle is too short (cycle_mult=2?)
0.2, 0.3, 0.92 = over fitting

lgvaz · November 21, 2017, 3:07am

Maybe if you want to do a more general approach yeah, it is counter-intuitive… But if you want to win at kaggle competitions… haha

arjunrajkumar · November 21, 2017, 3:07am

Guessing we cant do data augmentation with structured data?

guthl · November 21, 2017, 3:07am

Is there a difference between using the type Category vs using LabelEncoder from sk-learn ?

aymenim · November 21, 2017, 3:08am

@yinterian Why are school holiday and promo in continous var not catagorical ?

atreides · November 21, 2017, 3:09am

@jeremy Can you add a link to the new paper about binning continuous features you discussed briefly

lgvaz · November 21, 2017, 3:10am

@arjunrajkumar, I would guess we can, for example, we could change the temperature of the day by a little bit, since it probably wouldn’t affect the sales of the day.

pete.condon · November 21, 2017, 3:11am

That’s a big assumption

yinterian · November 21, 2017, 3:11am

Note sure, I would have to look at this data.