Shape of an image input in resnet

What shape does a resnet convnet expect input images to have?
I’m using HxWxD (height times width x depth, depth being the three color channels), which is the way numpy encodes jpeg images. However the code in tye pytorch transfer learning tutorial (http://pytorch.org/tutorials/beginner/transfer_learning_tutorial.html) seems to expect depth to be the first axis.

In keras I use this HxWxD

1 Like