I’m trying to predict 4 classes of video dataset. my videos are composed from 32 frames saved as images on my hard disk.
The dataset is structured as follows:
- Label 1:
- Video 1 Folder:
I tried writing my own VideoClassifierData which is implementing all the methods in BaseDataset. (if needed i will post my entire code of the VideoClassifierData)
My dataset works, it returns a list of numpy images of the frames as a video.
When i’m passing my VideoClassifierData to ConvLearner and fitting the model, i’m getting an error that the output is not 1 dimensional array when trying to create a variable in model.py (loss = stepper.step(V(x),V(y)))
Anyone has insight regarding this issue? or maybe direct me to what i’m doing wrong?
Thanks for the reply.
First, i’m using the fastai framework to train my network so I built my own VideoClassifierData similar to ImageClassifierData that just loading video instead of single image, that’s why the code above not really helps me in this current situation.
Second, ere you can see the problem i’m having: https://github.com/yana25/video_classification
just install the fast.ai library using the link in the readme and run “fastai_video_train.py” and see the error i’m