BatchNormalization(axis = 1) when used on convolutional layers

It the different default dimension ordering. For theano, channel is the 2nd dim, for tensorflow, it’s the last dim.

(Edited with correction from @skottapa)