Trying to create a model based on csv data using a the process that Jeremy uses in Intro to Machine learning: Lesson 1. Appears that the newer version of Fast.ai doesn’t include the structured import, and therefore doesn’t have the proc_df method for processing a data frame to convert categorical items into numbers.
I had this problem too, but figured it out. Copy paste the structured.py code below from original Fastai into one of your cells and run. It has all the functions that no longer are supported in DL1 course from the ML course like proc_df, train_cats, apply_cats, etc.
@petulla Instead we pass in ‘Categorify’ as a proc in the fastai tabular modules. This does the above and generates an embedding matrix for the categorical variables.
FillMissing and Normalize are the other two yes. I’d recommend lesson 4 of the Practical Deep Learning for Coders for a refresher on what’s new in that regards in the library.
Sorry to reactivate this.
What if you want to use the preprocessing chain provided by fastai with another model like a random forest which does not take databunches as input? All examples I found online used proc_df and then random forest from sklearn.
Try changing the model, you will find them with sklearn. But all in all, you have to first process the data with fastai, if you’re using fastai. And later, the model will anyway be followed with sklearn so try anymodel.
Also if I have interpreted your question correctly, proc_df is with the fastai only! You can use whatever model you wish depending upon what is the way it takes in the input.