Lesson 7 In-Class Discussion

A_TF57 · December 12, 2017, 2:58am

Weren’t we taking the transpose of these chunks? (bptt x bs, versus bs x bptt)

vikbehal · December 12, 2017, 2:58am

So, we should try to increase bptt as much as possible?

KevinB · December 12, 2017, 3:00am

Just keep things below the explosion point.

memetzgz · December 12, 2017, 3:00am

@KevinB, you make it sound so simple

narvind2003 · December 12, 2017, 3:02am

Since the loop is dynamic, can we keep changing BPTT value as we loop?

KevinB · December 12, 2017, 3:03am

“In theory there is no difference between theory and practice. In practice there is. - Yogi Berra” - Kevin Bird

vikbehal · December 12, 2017, 3:05am

For each mini-batch it automatically happens in pytorch!

gerardo · December 12, 2017, 3:07am

What else can I include on that tokenize?

KevinB · December 12, 2017, 3:08am

Look at get_tokenizer:

github.com

pytorch/text/blob/master/torchtext/data/utils.py

def get_tokenizer(tokenizer):
    if callable(tokenizer):
        return tokenizer
    if tokenizer == "spacy":
        try:
            import spacy
            spacy_en = spacy.load('en')
            return lambda s: [tok.text for tok in spacy_en.tokenizer(s)]
        except ImportError:
            print("Please install SpaCy and the SpaCy English tokenizer. "
                  "See the docs at https://spacy.io for more information.")
            raise
        except AttributeError:
            print("Please install SpaCy and the SpaCy English tokenizer. "
                  "See the docs at https://spacy.io for more information.")
            raise
    elif tokenizer == "moses":
        try:
            from nltk.tokenize.moses import MosesTokenizer
            moses_tokenizer = MosesTokenizer()

This file has been truncated. show original

narvind2003 · December 12, 2017, 3:08am

Are you sure BPTT is not a constant?

yinterian · December 12, 2017, 3:09am

For words it can be quite complicated
https://nlp.stanford.edu/IR-book/html/htmledition/tokenization-1.html

vikbehal · December 12, 2017, 3:09am

Jeremy explaining now.

narvind2003 · December 12, 2017, 3:11am

you’re right…last time we saw something approx 70…not exactly 70. So, if we did the Python RNN from scratch,we could approzimately ramdomize it. cool.

vikbehal · December 12, 2017, 3:16am

If my Language model isn’t predicting well, what could be the reason?
all we’ve is bs, bptt and some dropout to play with?

bhollan · December 12, 2017, 3:17am

Do you have any details about what we email for part 2? Do we just say “yes, I’d like to do part 2” to the same email address? @yinterian / @jeremy?

pete.condon · December 12, 2017, 3:17am

How much training are you doing? I was a bit surprised when I kicked off the imdb code and found it did 70+ epochs of 4000+ batches.

vikbehal · December 12, 2017, 3:18am

I’ve less data than IMDB.

yinterian · December 12, 2017, 3:18am

International you have to contact fast.ai.
In person would be through the data institute at USF.

pete.condon · December 12, 2017, 3:19am

How many words are you working with?

vikbehal · December 12, 2017, 3:22am

bs=64; bptt=70