Working on integrating a seq-to-seq model training into v2 and would love to be able to pass an ignore_index that could be used to tell LabelSmoothingCrossEntropy to not look at certain token ids when calculating the loss (e.g., ignore any -1 token ids) when comparing the generated text to the actual.
In v1, not knowing any better, I just created my own class to create this one modification in the forward pass:
And one last kinda related question … when I use splitter to create my own “layer groups”, how can I display those layer groups in v2? Tried len(learn.layer_groups) but that looks deprecated now.