Lesson 12 (2019) discussion and wiki

devforfu · April 18, 2019, 2:53am

I guess it is more about using some digital notebook, i.e. OneNote or Evernote maybe. To easily dump the files, attach logs, diagrams, etc.

sgugger · April 18, 2019, 2:54am

I think it’s just a traditional text file with the resutls of the experiments copy-pasted.

mindtrinket · April 18, 2019, 2:56am

Not really. I just see combining .3 of the Feb 2nd and .7 of the Feb 15th for Feb 11th… its probably just the 2 combined images that is confusing me.

pcuenq · April 18, 2019, 2:57am

Any sort of version control system is essential, even for teams of 1. That way you can go back to any point in the history of your project. And sometimes you need to.

neuradai · April 18, 2019, 2:58am

I use Evernote for this exact purpose. Every research project has a notebook. I keep records of changes, plans for future changes, and try to record results along the way.

zachcaceres · April 18, 2019, 3:00am

Notion is also a fabulous tool for this.

sergeman · April 18, 2019, 3:00am

What about just cloning notebooks after each experiment and keeping .ipynb files in version control system? I am going with this strategy and it seems to be working so far. Is there a reason why copying results into a text file would be better?

PierreO · April 18, 2019, 3:00am

“That we’ll transfer in a future lesson” could become a meme of this course

KarlH · April 18, 2019, 3:01am

As I understand it, it’s a linear combination of things in vector space. Images are already tensors so that’s easy. For categorical data you would need to convert all categoricals to embeddings and whatnot.

So if you have day_of_month and month categoricals you would need to pass them through embeddings first.

benjmann · April 18, 2019, 3:01am

transformer xl has state

rkingery · April 18, 2019, 3:01am

Regarding applying mixup to NLP or other fields where you have categorical inputs, is it necessary to have pretrained embeddings first to do the mixups on? It doesn’t seem obvious to me how to do mixup when you’re dynamically learning the embedding weights as well. And you certainly don’t wanna do mixup on one-hot encoded inputs before embedding them.

nbharatula · April 18, 2019, 3:01am

Side note: much respect for both Jeremy and Rachel for tonight - both look visibly unwell.

mindtrinket · April 18, 2019, 3:01am

I tried doing that. I need some dropout with my notes.

So I use a physical notebook for “strategic non-code notes” then move onto one-note for coding. Separated by project. As I finish my project, I publish my findings in Github and an article to myself, my associated one-note gets deleted.

If it was important enough to remember, it is important enough to publish.

wgpubs · April 18, 2019, 3:01am

What is required to use ULMFiT with SentencePiece?

Do we have to re-train the full model using SP for tokenization?

sgugger · April 18, 2019, 3:01am

Jeremy is speaking about parameters of a script, so they are easy to copy and paste. A notebook (as long as you save the commit/date of the corresponding libraries), it should work.

gamino · April 18, 2019, 3:02am

We are using ULMFit in prod, and it performs great even with small data set.

sgugger · April 18, 2019, 3:02am

Yes, you need a pretrained model with the same tokenization.

wgpubs · April 18, 2019, 3:02am

Any examples of ULMFiT being applied to multi-task learning problems (esp. seq2seq tasks)?

mindtrinket · April 18, 2019, 3:02am

thanks! that helps, but then wouldn’t embedding drop out be similar?

wgpubs · April 18, 2019, 3:03am

I’m assuming Jeremy is about to discuss this, but if there are folks who have done it with SentencePiece, I’d be interested to know how they did it.