I tired a few different ways of comparing the diffused images against the original. Comparing the latent embeddings of the SD generated images against the original image looked to work as well or better than the other methods I tried, and doesn’t require another network to embed.
Oh wow! Sounds like we more or less took the same direction, and that you did some amazing research here! sorry I missed it (so many amazing works here, it’s hard to keep track of all of them ;)). I will look at your notebook more in depth when I have more time. Thanks!
Guys, I was able to put my DiffEdit version on Gradio spaces, Diffedit - a Hugging Face Space by aayushmnit. Hugging face spaces graciously granted me a free GPU to run my app. Give it a spin
Maybe late to the party but I was exploring redoing my portfolio website because I will take a sabbatical soon. Tried Github profile README and workflows to pull tweets. I like how it turned out. It’s free, clean, and feature-rich. If anybody wants to build their portfolio website, give Github Profile a try before trying more complex options like Github Pages, Ghost, etc.
The aim of the method is to mix two different concepts in a semantic manner to synthesize a new concept while preserving the spatial layout and geometry.
The method takes an image that provides the layout semantics and a prompt that provides the content semantics for the mixing process.
Here are some examples I reproduced from the paper:
The method sometimes needs a bit of fiddling around with the parameters to get the best result but overall it was fun implementing the method and reproducing the examples from the paper.
Here is the notebook of the implementation for anyone interested in trying it out.
First preprint! The first in a series of planned papers, explaining visualisations built on the logits from a vision encoder trained using self-supervised learning. The challenge/dataset was this one, and the project repo is here. Here is the paper:
Last weekend, together with my teammate, we won second place in a $50k AI Hackathon organized by AssemblyAI
Our project was a web app that creates a toy story and its illustration for kids based on the photos of their favorite toy. More about the project here and you can try the app here. The app is not in its best form yet but we are planning to work on it and I would be glad if I get some feedback from you guys.
I am very grateful for the opportuniteis that attending fast ai course is providing me so far. Thank you very much @jeremy!