learner.fit(3e-3, 1, wds=1e-6) in lang_model-arxiv and not sure what it meant.
I know! Asked about this earlier It is just l2 regularization. Additional information where I got it from
Ohhhh that makes sense! Thanks! Was racking my brains out trying to figure out what that acronym is