What does “Average Pool” do, it takes a Kernel Size and gives you the Average val, then move by the Stride value. Similar to Max Pool that we have used a few times (takes the max value in a given kernel size).
So, what does Global Average Pool do, the Kernel size is of dimensions H x W. So, it takes the Global Average across Height and Width and gives you a tensor with dimensions of 1xC for an input of H x W x C