[Solved] Reproducibility: Where is the randomness coming in?

rpcoelho · June 27, 2019, 12:37am

Ok, finally got it to work. So just detailing the instructions a bit more:

You have to run random_seed(0), before the first fit;
You have to run it before creating the databunch;
And you have to call it every time for each different time you call fit.

I was calling it before creating the databuch and assuming the seed would be set. So besides the code above, this solved it for me:

    random_seed(0) #Need to insert this line here again before calling fit
    x=[500, 500, 100, 0.0005, 0.4, 8]
    learn3 = tabular_learner(data, layers=[x[0],x[1],x[2]], ps=[0.09,0.5,0.5], emb_drop=0.04, 
                        y_range=y_range, metrics=mae)
    learn3.fit_one_cycle(1, x[3], wd=x[4], div_factor=x[5])