Input dimensions for modern networks

VGG net and Alex net only accepted fixed input size (I believe 224*224). Is it changed in Resnet? If yes, how?
What is the fundamental difference there?

Yes, resnet’s penultimate layer is a pooling layer that pools down to a 1x1 size. Pretty much all modern architectures do this, and fastai does a neat trick that converts all architectures (including VGG) to this approach!

4 Likes

Thank you, So that I understand better:
You are saying that the reason that Alex net cannot deal with different input size is that the layer before FC layers would end up having different spatial dimensions (Width * Height * filter_bank_size)? And they fix it by pooling to (1 * 1 * filter_bank_size)?

2 Likes

Exactly! And nicely described :slight_smile:

2 Likes