Errors using export and TensorBoard

feribg · December 12, 2019, 3:17am

I’m having the following errors when calling export:

learn.export('data/nn_bilstm/models/best.pkl')
---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-15-46f99ea18b04> in <module>
----> 1 learn.export('data/nn_bilstm/models/best.pkl')

~/Dev/anaconda3/envs/kaggle/lib/python3.7/site-packages/fastai/basic_train.py in export(self, file, destroy)
    238             state['model'] = m
    239             xtra = dict(normalize=self.data.norm.keywords) if getattr(self.data, 'norm', False) else {}
--> 240             state['data'] = self.data.valid_ds.get_state(**xtra)
    241             state['cls'] = self.__class__
    242             try_save(state, self.path, file)

AttributeError: 'TensorDataset' object has no attribute 'get_state'

and when calling LearnerTensorBoardCallback,

I get RNN backwards method can only be called during training. When I remove the TB callback it works just fine.

Can you please help. The export is particularly cumbersome. Data is created like so, where the sets are TensorDataset pytorch objects

project_path = 'data/nn_bilstm'
db = DataBunch.create(train_ds=train_ds, valid_ds=valid_ds, path=project_path)
b1 = db.one_batch()
print(f'Train size: {b1[0].shape}, valid size {b1[1].shape}')

sgugger · December 12, 2019, 8:07pm

Learner.export is not intended to work if you didn’t build your data using the fastai library (it wants to remember the transforms/preprocessing you applied to be able to do it for you at inference).
You can try to patch a method get_state to TensorDataset that takes some args/kwargs and return an empty dictionary, but I don’t guarantee it will work.

feribg · December 13, 2019, 6:57pm

Got it thanks! Actually it looks like every single method in the predict and inference API fails when using pytorch datasets, so maybe that needs to be noted in the docs.

Do you have any clue why the TB callback breaks with RNN’s. Here’s the full model, loss and metric:

class CustomLoss(torch.nn.Module):
    def __init__(self):
        super(CustomLoss, self).__init__()
        self.loss = nn.CrossEntropyLoss()

    def forward(self, input, target):
        #reshape input to (batch x RNN seq len, cat_count), target to [RNN_seq_len] long tensor 0-N cat
        score = self.loss.forward(input.view(-1,input.shape[2]), target.view(-1))
        return score
    
def accuracy(input:Tensor, targs:Tensor)->Rank0Tensor:
    "Computes accuracy with `targs` when `input` is bs * n_classes."
    n = targs.shape[0]
    input = input.argmax(dim=-1).view(n,-1)
    targs = targs.view(n,-1)
    return (input==targs).float().mean()    

# Bidirectional recurrent neural network (many-to-one)
class BiRNN(nn.Module):
    def __init__(self, input_size, hidden_size, num_layers, num_classes):
        super(BiRNN, self).__init__()
        self.hidden_size = hidden_size
        self.num_layers = num_layers
        self.lstm = nn.LSTM(input_size, hidden_size, num_layers, batch_first=True, bidirectional=True)
        self.fc = nn.Linear(hidden_size*2, num_classes)  # 2 for bidirection
    
    def forward(self, x):
        # Set initial states
        h0 = torch.zeros(self.num_layers*2, x.size(0), self.hidden_size).to(device) # 2 for bidirection 
        c0 = torch.zeros(self.num_layers*2, x.size(0), self.hidden_size).to(device)
        
        # Forward propagate LSTM
        out, _ = self.lstm(x.unsqueeze(-1), (h0, c0))  # out: tensor of shape (batch_size, seq_length, hidden_size*2)
        # Decode the hidden state of the last time step
        out = self.fc(out.view(x.shape[0], x.shape[1], -1))
        out = F.log_softmax(out, dim=1)
        return out