Saw learner.fit(3e-3, 1, wds=1e-6)
in lang_model-arxiv and not sure what it meant.
I know! Asked about this earlier It is just l2 regularization. Additional information where I got it from
1 Like
Ohhhh that makes sense! Thanks! Was racking my brains out trying to figure out what that acronym is
1 Like