Time series/ sequential data study group

gkumarg · April 29, 2021, 1:11pm

@oguiza First of all, thank you for your valuable contributions!

I am looking for some guidance on a time series classification project.
I have a dataset with 1500 sample meters, 1 feature and 1440 time intervals of data. The twist is that I am also given target variable for each of the time intervals.
So it is not the typical y shape of (1500, ). I need to do target prediction for each time interval in the test set as well. I could not find a way to prepare my data using df2xy mentioned in tsai. What is the best way to approach this problem?

This is what the data looks like, if it helps for visualization:

vrodriguezf · April 29, 2021, 2:57pm

Maybe you want to approach it as a segmentation problem? There is one discussion around that topic in the tsai discussion section:

gkumarg · April 30, 2021, 12:55am

Thanks @vrodriguezf. I see the situation is the same. Just need to figure out how to get my input data into a way that tsai/minirocket can use.

duality · April 30, 2021, 7:54am

I’m not sure what to share since the data itself can be created in so many ways. E.g. I can either predict price movement up or down (categorical) or target price in the future. I can also use up to 14000 variables or as little as 1, but I’ve tried 1 to 8 variables and it’s the same result- 50/50. My problem is broad- like, could it be that the way I have upsampled the data is the problem? Or do I need to include more variables, like maybe 50 to 100? Or do I need to use sliding windows? Or do all my predictions need to be the same number of steps in the future or can they vary by sample? Or should I persist with image classification or tabular versions instead? Or… there’s like a million questions and different thing I can change but what I am asking is how can I figure out for myself what I need to learn/change/do different. All the discussions here are so high level and beyond my current understanding that the leap from where I am to where you guys are seems like walking on the moon.

duality · April 30, 2021, 8:13am

@oguiza also, thanks for being patient, even though I was rude.

I just thought of something looking at your regression notebook. Would there likely be a material difference in accuracy if one model was set to predict a 50/50 increase/decrease in stock price (categorisation) vs predicting stock price (regression)? Using the exact same set of data and all else otherwise the same? In other words, are some problems likely to be more accurate as a regression problem vs a categorisation problem. My initial intuition is that categorisation would be easier but as I think about it maybe regression helps the algorithm learn quicker?

remapears · May 6, 2021, 6:17pm

Dear Dr. Ignacio,

I have tried this latest tutorial for Multti-label classification. I only have a problem with testing on a new set of data.
I tried doing this :

valid_dl = dls.valid

test_ds = valid_dl.dataset.add_test(X_test, y_test.values)

test_dl = valid_dl.new(test_ds)

_, temp_targets, temp_preds = learn.get_preds(dl=test_dl, with_decoded=True, save_preds=None, save_targs=None)

but I get predictions as :

So is this how I should be getting the predictions? if yes, how would I be able to specify the labels’ names and their corresponding columns?

Thank you for all your help!

remapears · May 7, 2021, 11:18am

Updates:
I was able to map the integers to their corresponding labels by manually investigating the test instances of the origianl test set … Then, I computed metrics for each label:

shado · May 11, 2021, 2:02am

Hi everyone,
This looks like an interesting group. I am needing time series forecasting, hoping deep learning can work.

This might be a stupid question, but when you talk about number of samples, does this mean one long time series split into multiple parts, similar to a sliding window?

What if I have multiple datasets from different sources. Can the samples be of different sources and not split up? I.e. use the whole sample and have many different samples?

Hope that makes sense.

Thanks in advance!

williamsdoug · May 11, 2021, 10:01pm

Hi @remapears. I’ve created an updated version of the 01a_MultiClass_MultiLabel_TSClassification.ipynb tutorial notebook to help answer your questions. Examples of label mapping are shown in cells 16-25 for multi-class and cells 44-51 for multi-label.

The updated tutorial is currently available as a gist at either:

Let me know if this addresses your questions. If so, I’ll submit ths update to tsai as well.

remapears · May 12, 2021, 9:57am

Oh yes please! it helps tremendously!
Although I don’t think the L is defined prior of using it in:

decoded_preds = L(vocab[p] for p in temp_preds)

Thanks again!

williamsdoug · May 12, 2021, 1:23pm

Hi @remapears. Glad the updated tutorial was of value. Sorry for any confusion around L() – it’s an enhanced list class defined in fastcore and is included via the statement from tsai.all import * in cell 2.

remapears · May 12, 2021, 2:45pm

oh I see! great!

shado · May 14, 2021, 11:18pm

Hi. If I have 1000 different cake stalls that each run for only 3 weeks, is it better to train a new model for every single stall, or is can I train a model that generalises for all stalls?

dnth · May 15, 2021, 3:33am

Training a new model for every stall sounds like a big task, considering you have 1000 stalls! Probably train one model that ingests the time series data from all the stalls? You will then have 1000 sequences (one for each stall) each with input and label (x,y) pairs.

shado · May 15, 2021, 4:51am

Thank you. I tried one model for the whole lot but it didn’t predict anything.

tcapelle · May 18, 2021, 8:34pm

Has someone tried this new loss on torch 1.8:
torch.nn.` `GaussianNLLLoss
It looks cool for probabilistic forecasting. Maybe we can do something like here

geoHeil · May 20, 2021, 7:47am

I have a question regarding batch size:

Checklist for debugging neural networks | by Cecelia Shao | Towards Data Science for example shares the advice of:
large-batch methods tend to converge to sharp minimizers of the training and testing functions — and as is well known, sharp minima lead to poorer generalization

I am using 2048/4096 as the batch size with TSAI, as it simply trains much much faster!
Is this way too large?

I have observed when using the image preprocessing then I have to go down to a couple of hundreds - otherwise my GPU OOMs.

oguiza · May 20, 2021, 8:11am

Hi @geoHeil,
I don’t think you can say a priori that’s a too large batch size. It all depends on your dataset and task. I normally use large batch sizes (especially with large datasets) and then adjust the rest of parameters (like lr, regularization, etc). But it’s always a good idea if you have the time to test different batch sizes to see if there’s any performance degradation. If your dataset is too large, you can always use a fraction of the training set to test the impact of batch size.

vrodriguezf · May 20, 2021, 8:18am

I normally follow the rule of using the largest possible for the GPU I am using…there’s also the bs finder that OpenAI proposed, which is implemented here for fastai, but I’ve never tried it.

oguiza · May 20, 2021, 8:51am

That makes sense @vrodriguezf.

There’s something else I forgot to mention when benchmarking different batch sizes. You will need to adjust learning rate accordingly. Usually larger batch sizes require larger lrs. So if you adjust the batch size, it’s always a good idea to run lr_find.