Calculating bias for categorical classification problem

Hi all,

I’m attempting to address an imbalanced classification problem (9 classes) and looking to find what the system of equations would be for calculating the bias term for the output layer. Not sure if it matters too much, but I’m using either sigmoid with binary_crossentropy or softmax with categorical_crossentropy.

Best wishes,
Pej