Hi I’m confused about the process in the data block API. does the data block API transform the categorical variables into One-Hot-Encoded vectors? when I print: data.show_batch it looks like the categorical variables are now pandas categories. Is there another step in the data block API that …

One Hot Encoding

Part 1 (2019)

knesgood (Kyle Nesgood) December 9, 2019, 4:12pm 2

No - nothing is being one-hot encoded. Instead, each categorical column is given it’s own embedding matrix in which it learns robust representations of each item. Check out lesson five or look at the lesson notes here: