Bidirectional model for imdb: am I interpreting it right? (read inside for details)

YangL · April 11, 2018, 10:02pm

So if I’m not mistaken, this is my understanding:

use the same vocabulary and everything, but flip the data around, so that you have anther new set of data, and model&encoder. OKAY.

Then I use the new model and train my classifier on that, and get some predictions.

… and I combine the prediction of original classifier and the new classifier to get better result?

Am I getting this right/wrong?

jeremy · April 11, 2018, 10:03pm

That’s right. You just average the predictions of the two models.

pandeyanil · April 11, 2018, 10:20pm

Ensemble forward and backward prediction

Vishucyrus · April 12, 2018, 12:12pm

What exactly do you mean by the predictions… (I am getting little confused…)

keratin · April 12, 2018, 4:52pm

The probability vectors of size vocab_length for the next word (predicted word).

yang-zhang · December 18, 2018, 3:23pm

You just average the predictions of the two models.

I think “predictions” means the output of the classifiers, not

The probability vectors of size vocab_length for the next word (predicted word).

(Is it possible to ensemble the result of the two language models?)

ankit1996 · January 16, 2019, 3:12pm

What do you mean by it, please elaborate backward prediction