Regression with multiple Pictures

Hey, guys,

I want to use multiple frames of a video to predict a numerical result.
I wonder if there is already a template somewhere where they use multiple pictures for the regression.

Any help would be very welcome, thank you very much!