So in short, we have..
Code to generate images from noise (can probably be extended to sketches?)
Code to colorize sketches
Code to go from sketches to images with Pix2Pix and DiscoGAN
Code to take colorized sketches and generate images matching those sketches
Code To predict the next sequence in a cartoon game -- PacMan
Code to generate images from text with StackGAN
It seems like we can combine these to build something interesting. Text --> Sketch --> Animation --> Video?
Charades is dataset composed of 9848 videos of daily indoors activities collected through Amazon Mechanical Turk. 267 different users were presented with a sentence, that includes objects and actions from a fixed vocabulary, and they recorded a video acting out the sentence (like in a game of Charades). The dataset contains 66,500 temporal annotations for 157 action classes, 41,104 labels for 46 object classes, and 27,847 textual descriptions of the videos.
UCF101 is an action recognition data set of realistic action videos, collected from YouTube, having 101 action categories. This data set is an extension of UCF50 data set which has 50 action categories.
Human Face Sketches and Images
3D Semantic Segmentation of Cities
Vintage Cartoons Dataset
Optical Flow Dataset
Celebrity Cartoon Faces
Github Download Script