Lesson 12 (2019) discussion and wiki

tanyaroosta · April 18, 2019, 2:46am

probably using embeddings, but not sure…

Raymond-Wu · April 18, 2019, 2:47am

Along the same lines of what Jeremy is talking about right now - is it possible to have a TDD approach when doing deep learning?

nswitanek · April 18, 2019, 2:48am

Say more…
Linear combinations of numerics, datetimes, and one-hot encoded categorical variables seem pretty straightforward, no?

nbharatula · April 18, 2019, 2:48am

Is there any work around building open source simulators/debuggers that can help debug the kinds of issues that Jeremy just described without having to spend $$$?

I don’t even know if this is possible! Just curious.

neuradai · April 18, 2019, 2:49am

Quote of the day (month? year?):
“Training models sucks. And deep learning is a miserable experience and you shouldn’t do it.” - Jeremy Howard

rachel · April 18, 2019, 2:49am

Reminder to upvote & ask questions here for the last 10 minutes of class:

PierreO · April 18, 2019, 2:49am

For what it’s worth, I’ve heard Jeremy say this several times now: he’s worked a lot with TDD in the past, but in DL he seems to prefer working with notebooks which is in itself a kind of micro-TDD, if you have the discipline to check everything as you progress like he just mentioned.

maxim.pechyonkin · April 18, 2019, 2:51am

What does Jeremy mean by scientific journal? Is it a file where all code goes by date? What is the best way to keep that?

yeldarb · April 18, 2019, 2:52am

What does Jeremy mean when he says “lab notes”? Is it a physical paper notebook he records stuff on? Or is there some software he’s using to record experiments/results/progress?

Edit: @maxim.pechyonkin same question at the same time!

devforfu · April 18, 2019, 2:53am

I guess it is more about using some digital notebook, i.e. OneNote or Evernote maybe. To easily dump the files, attach logs, diagrams, etc.

sgugger · April 18, 2019, 2:54am

I think it’s just a traditional text file with the resutls of the experiments copy-pasted.

mindtrinket · April 18, 2019, 2:56am

Not really. I just see combining .3 of the Feb 2nd and .7 of the Feb 15th for Feb 11th… its probably just the 2 combined images that is confusing me.

pcuenq · April 18, 2019, 2:57am

Any sort of version control system is essential, even for teams of 1. That way you can go back to any point in the history of your project. And sometimes you need to.

neuradai · April 18, 2019, 2:58am

I use Evernote for this exact purpose. Every research project has a notebook. I keep records of changes, plans for future changes, and try to record results along the way.

zachcaceres · April 18, 2019, 3:00am

Notion is also a fabulous tool for this.

sergeman · April 18, 2019, 3:00am

What about just cloning notebooks after each experiment and keeping .ipynb files in version control system? I am going with this strategy and it seems to be working so far. Is there a reason why copying results into a text file would be better?

PierreO · April 18, 2019, 3:00am

“That we’ll transfer in a future lesson” could become a meme of this course

KarlH · April 18, 2019, 3:01am

As I understand it, it’s a linear combination of things in vector space. Images are already tensors so that’s easy. For categorical data you would need to convert all categoricals to embeddings and whatnot.

So if you have day_of_month and month categoricals you would need to pass them through embeddings first.

benjmann · April 18, 2019, 3:01am

transformer xl has state

rkingery · April 18, 2019, 3:01am

Regarding applying mixup to NLP or other fields where you have categorical inputs, is it necessary to have pretrained embeddings first to do the mixups on? It doesn’t seem obvious to me how to do mixup when you’re dynamically learning the embedding weights as well. And you certainly don’t wanna do mixup on one-hot encoded inputs before embedding them.