Sorry to post this again, but I’m hoping someone will have an idea about what I might try to fix this. I’ve since found that if I restart the kernel and set the batch size to 1 (bs=1), I get a different error:
ValueError: Expected more than 1 value per channel when training, got input size torch.Size([1, 4096])
So that makes sense, and it shows that I’m successfully changing the batch size, but changing the batch size does not resolve the memory error.