How to use multiple gpus

shaun1 · October 23, 2018, 3:08pm

Thank you @sgugger . I did as you instructed and it worked. So just to recap (in case other people find it helpful), to train the RNNLearner.language_model with FastAI with multiple GPUs we do the following:

Once we have our learn object, parallelize the model by executing learn.model = torch.nn.DataParallel(learn.model)
Train as instructed in the docs
Save the parallelized model learn.save('name')
As instructed by @sgugger here, load the saved model, and strip out the word module from the keys of the dictionary and create and new state_dict
Load the newly created state_dict into the model, which will now be unparallized
Save both the model and encoder by executing learn.save('name-unpr') and learn.save_encoder('name').