For regression problem, should we normalize labels to a certain range before training?

sky1ove · March 11, 2021, 5:41pm

I’d like to train a neural network using MSE as loss function for a regression problem. However, my labels are log ratios of some gene change which are range [-5, 1]. To achieve a better accuracy, should I normalize labels to a balanced range like [-1,1]? Or it is totally fine to train a model from that range?

Pomo · March 12, 2021, 7:14am

Hi. Typically you would leave the labels as is, and tack on a k1+sigmoid()*k2 layer at the end of the model that can only generate [-5,1] regardless of the input. I don’t think there is any well-defined theory behind this practice. “It makes it easier for the model,” in the sense that the original model is allowed to produce any value and it will still map to the label space. In practice, the underlying function may be learned faster and with more accuracy.

IDK how the log ratio would matter. If I were doing it, I’d experiment with and without sigmoid, and predicting the log ratio vs, the actual ratio.

Just some ideas and HTH,

tcapelle · March 12, 2021, 11:07am

As @Pomo said, I regularly use the fastai layer called SigmoidRange([-5,1]) at the end of the network.