Need to clean up the master data - Address and Customer. Address majorly text, there are many duplicate records available corresponding to a customer with similar looking address. Can these texts be cleaned using ML/DL technique, i have looked through papers in this space but couldn’t find very relevant papers. In case any one has any experience of cleaning master data, please share some thoughts and if possible guide to some relevant research papers.
Many thanks for reading it.