Hey! Have you tried by tweaking the probability of dropout (‘ps’) and weight decay(‘wd’)? Try increasing them for a few epochs and tell us what you find!
Hey! Have you tried by tweaking the probability of dropout (‘ps’) and weight decay(‘wd’)? Try increasing them for a few epochs and tell us what you find!