Doing my first real project after completing the Fast.ai 2 book through CH 11
I have some company data which includes a column of tags.
I’m struggling to find a good way to compile a distinct list of all tags
so that I can 1 hot encode for each company by adding (X new columns)
Does Fast.ai, postgres, or pandas have an efficient way to do this?
This is a single column that contains a CSV list per row. I would like to expand it out for 100K+ rows
categories
Publishing,Social Media,Social Media Management
Cloud Computing,Collaboration,CRM,Developer Tools,Enterprise Software,Information Services,Information Technology,Network Security,Project Management,Software
Internet,Social Media,Social Network