Hello everyone, I was wondering, has anyone had any experience with record linkage (entity deduplication) using fastai?
In my case I am working on detecting duplicate products, a common way of doing this is via text similarity using siamese networks, but im sure if I managed to combine the product image + the text it would yield better results.
This paper, Merging Datasets through Deep Learning, could be a good starting point. It sounds like they pursued the implementation (not using fastai) that you’re describing