Can we somehow specify the loss criterion as AUC in tabular_learner? I want to train a deep model on a binary classification problem (tabular data with all numeric features) with high class imbalance. Augmenting the data set with copies of negative samples doesn’t seem to help. Using AUC criterion in LightGBM is giving high accuracy on test set.
You mean to use it as evaluation metric?! AUC is not yet implemented in the fastai library but instead of accuracy you can for example use FBeta or Precision and specify the averaging parameter to account for class imbalance.
As I understand it, AUC cannot be used as a loss function in a Neural Network context for a couple of reasons.
The only thing that matters for an AUC score is the relative order of predictions in terms of probability - the metric is actually invariant under scaling. AUC is typically computed on the entire set of predictions - in other words on all training examples (or perhaps all validation examples). You could compute AUC on a mini-batch of examples, but I suspect you’d run into trouble with the differences that arise from the way mini-batches are sampled.
The more important issue is that the AUC function is not differentiable and consequently the back-propogation of errors into the network for gradient descent will not be possible. There have been various attempts to create differentiable approximations of the AUC metric to be used as a loss function with gradient descent - but they don’t seem to have gathered much momentum. I believe it is because they require changes to the gradient descent algorithm with respect to the way that training examples are fed in. See these two papers:
If as @SBecker suggests you want to use AUC as an evaluation metric that is possible. There is some neat code here that @joshfp posted which uses a callback function to calculate the AUC score at the end of each epoch of training: