Why does the conv2d in vision nets have no bias term?

(Malcolm McLean) #1

I am looking at the more recent architectures structured conv2d - activation - batchnorm.

It seems to me that a bias learned by the conv2d would have a major effect on the model’s learning capacity.

Thanks for any insights.