Okay, so there are much complaint about slowness of lesson, so I did something about it.
I used google’s
and messed around it a lot.
I tried vocabulary size of 64, 32, 16 and 8 thousand, and makes almost no difference in accuracy.
There are other optimizations/fine tunings I did too.
So, all in all, I reduced the language model (encoder) traning time to 10 min per epoch,
and the classifier 3 min per epoch.
But I didn’t beat @jeremy. His was 95.4, mine best is 95.2. Sad.
I’be been trying to get that last 0.2 percent the last 2 weeks but I can’t.
I do want to write a blog about it, but my result is not as good as his. So I don’ tknow if I should.
And it’s a NLP classification task, what do I show in blog? like, plots of loss functions?
Just don’t know what’s there to write.
Also, shall I try to improve it? I’m pretty exhausted from trying to make it better. Is the last 0.2+% important/hard?
I need help… @rachel