Halfway through the first course, I’m trying to find the right words to describe what I’m trying to do.
I’d like to take a video and replace a certain aspect of it with its nearest approximation in a separate corpus of video.
For example, I’d like to have a video of someone, identify their arm, and replace it with a gorillas arm, taken from many videos of gorillas etc.
I understand the first step would be an image classifier that identifies the subject to be replaced, but I’m not sure about the next steps. Is this concatenative synthesis, or something else? Has anyone seen this kind of thing implemented before?
I am halfway through the course as well, although I have watched all the videos a few times making apps for each of the lessons means that some the lessons take me months not weeks. currently trying to deploy my 7 of my apps on Docker for Mac.
I plan to create an app like you describe as my final project to celebrate completing part 1 of the fast.ai course Practical Deep Learning for Coders .