Pkl and pth

Stephen_F · April 26, 2019, 1:11am

Can someone help me understand the difference between the .pkl and the .pth files? I was looking at the repo for Render deployment and saw that one of the files has the message, “Download export.pkl in the current dir”, but when I open that file it instead says, “The .pth files should get downloaded here.”

I know the .pkl file has my full serialized model. So what’s the .pth file?

yeldarb · April 26, 2019, 3:55am

pkl stands for a “pickle” file which is a way of serializing objects in Python. Its contents can be almost anything; it just depends on what was serialized.

The pth file is your model’s weights (and optimizer state if saved with_opt).

Stephen_F · April 26, 2019, 7:06pm

Ah, great! So is there an advantage to using the pth file instead of the pkl file for my model deployment? For example, the pth file is more lightweight?

yeldarb · April 26, 2019, 10:18pm

Depending on the model you may need both.

For example, for the pre-trained language models this is what gets downloaded behind the scenes:

A pkl file containing the vocabulary (integer -> string mapping) and a pth file containing the weights of the network.

On the other hand, for computer vision models, only the weights need to be downloaded:

franva · December 29, 2019, 6:08am

Also, I somehow recall that
pkl is for deploying to production.

Am I correct?

kogam22 · May 9, 2020, 9:52am

Were you correct ? I have the same doubt.

RJSD3V · May 20, 2020, 8:02am

is there a way to convert a pth to a pkl file? cant load a pth file using load_learner()

ML4Noobs · August 1, 2020, 7:07am

How do we convert from .pkl to .pth? I’ve tried using this but it just returns a string of integers:

with file.open('export.pkl', 'rb') as fid:
  print(pickle.loads(fid.read())) # => 102846209128121496
  # what now?