I’m working with a small data set on a multi class problem, and want to know how I can best find the right bptt, batch size, etc. for this specific project. I’ve seen this question asked around a couple times on other threads but haven’t seen any answers.