Lesson 4 In-Class Discussion ✅

sgugger · November 14, 2018, 3:02am

The language model we use is publicly available, though. No?

devforfu · November 14, 2018, 3:02am

Yes, correct, but I was thinking to use the lib instead of writing custom training loop and models =) Also, they provide some embeddings already.

anthonyjules · November 14, 2018, 3:03am

Why not also train the language model on the unsupervised entries in the IMDB dataset?

rohitr · November 14, 2018, 3:04am

If we don’t hold out a validation set, does that mean there is no possibility of overfitting when building the language model?

sgugger · November 14, 2018, 3:04am

We do!

nithanaroy · November 14, 2018, 3:04am

How to expand the vocab to medical records from wiki text if using transfer learning? Assuming vocab only considers high frequency English words from Wikipedia

sgugger · November 14, 2018, 3:04am

A validation set is hold out, just a tinier portion (10k reviwes instead of 25k)

wgpubs · November 14, 2018, 3:05am

Is there a backwards pre-trained wiki103 model?

sgugger · November 14, 2018, 3:05am

The new model we finetune will have new words in its vocab. That’s fine, it will learn their meaning during the fine-tuning.

KarlH · November 14, 2018, 3:05am

You create the vocab from your own dataset.

edwardjross · November 14, 2018, 3:05am

Is there an analogue of Language Models for Images? Unstructured learning on an image corpus? e.g. obstructing part of the image and trying to predict it

gpakosz · November 14, 2018, 3:06am

Does that work also for titlecase, and mixed case?

fredguth · November 14, 2018, 3:06am

TextLMDataBunch does not let us set bs nor max_vocab anymore. How do we set that?
I guess we should use new DataBlock api, but how to set those?

rramphal · November 14, 2018, 3:06am

Can anyone find a source/citation for what Jeremy was talking about with SwiftKey(?) with the generated LaTeX proofs?

lesscomfortable · November 14, 2018, 3:06am

When fitting with Wikipedia there is no risk of overfitting because that is not the task that we are going to test the model on. With IMDB as Sylvain said, there is a validation set to avoid overfitting.

miwojc · November 14, 2018, 3:06am

what is moms?

NandoBr · November 14, 2018, 3:06am

If we use another language. Where I set Lang=´pt´ for example? And do I set that it is going to use spacy?

KevinB · November 14, 2018, 3:07am

Momentums

whatrocks · November 14, 2018, 3:07am

Where are the English language punctuation rules defined?

wdhorton · November 14, 2018, 3:07am

This competition is particularly strict, they’re limiting external data to a pre-selected set of embeddings. (see discussion https://www.kaggle.com/c/quora-insincere-questions-classification/discussion/70978#418095 for example). But the un-trained models and training techniques would still be of use, as @devforfu noted