[Solved] Seq2seq training cuda out of memory error

anz9990 · January 15, 2018, 9:37am

Anyone tried training a seq2seq model using pytorch? I’ve been trying to train a model for a while but keep getting RuntimeError: cuda runtime error (2) : out of memory no matter what I do. I’ve tried with smaller hidden size, layers, batch_size and limiting the max_sequence_length but still getting the same error. My next move is to try on a machine with more GPU memory, but was wondering if there’s anything I could do to check for memory leaks and/or other issues that might be making it run out of memory.

Interogativ · January 16, 2018, 11:43pm

This usually means you need a smaller batch size, How much memory does your GPU have and what is your batch size?

anz9990 · January 17, 2018, 12:51am

Using a batch size of 8 with 6GB of GPU memory.

anz9990 · January 19, 2018, 8:38am

Managed to work it out. The embedding I was using had too high of a dimensionality (300) and vocab size (around 70000). Reducing them both ended up using less memory.

egma · April 26, 2018, 9:45am

Thanks @Interogativ for your advice, it solved my problem like a charm !!