Entity Resolution(Record Linkage) - Deep Learning

Need to clean up the master data - Address and Customer. Address majorly text, there are many duplicate records available corresponding to a customer with similar looking address. Can these texts be cleaned using ML/DL technique, i have looked through papers in this space but couldn’t find very relevant papers. In case any one has any experience of cleaning master data, please share some thoughts and if possible guide to some relevant research papers.

Many thanks for reading it.

Have you seen the Dedupe Python library? It sounds like it may be a good fit for your situation.

Dedupe

Hi - have you found a good solution for your problem?