Understanding tabular_learner

krisho007 · March 10, 2019, 7:09am

I created a tabular_learner like this.
learn = tabular_learner(data, layers=[16], metrics=[accuracy], use_bn=False)

Input of this model has 200 columns and I am trying to predict a single target variable (which is a probability). When I do learn.model, I get below output.
Question is why do I have out_features as 2 (in the last layer)? It should be 1 (since I am predicting one target value), isn’t it?

rpicatoste · March 10, 2019, 9:55am

Editing the code in lesson4-tabular.ipynb to have a continuous variable as output, everything looks normal:

df.age = df.age.astype('float')

dep_var = 'age'
cat_names = ['salary', 'workclass', 'education', 'marital-status', 'occupation', 'relationship', 'race']
cont_names = ['fnlwgt', 'education-num']
procs = [FillMissing, Categorify, Normalize]

test = TabularList.from_df(df.iloc[800:1000].copy(), 
                           path=path, 
                           cat_names=cat_names, 
                           cont_names=cont_names, 
                           procs=procs
                          )

data = (TabularList.from_df(df, path=path, cat_names=cat_names, cont_names=cont_names, procs=procs)
                           .split_by_idx(list(range(800,1000)))
                           .label_from_df(cols=dep_var)
                           .add_test(test)
                           .databunch())

learn = tabular_learner(data, layers=[200,100])

learn.fit(1, 1e-2)

row = df.iloc[1]
learn.predict(row)

>>> (<fastai.core.FloatItem at 0x1e802bda6d8>,
 tensor([48.0932]),
 tensor([48.0932]))

And looking at the layers:

print(learn.layer_groups)

[Sequential(
  (0): Embedding(3, 3)
   ...
  (15): Linear(in_features=100, out_features=1, bias=True)
)]

I suggest making sure that you are selecting as output an actual continuous variable, because if you are, normally you shouldn’t be allowed to have accuracy as a metric for the tabular_learner.

krisho007 · March 10, 2019, 11:14am

My output is actually a probability. Can it be a reason why output feature is 2? (One for the probability of true and other for the probability of false?).

Why do you say for continuous variable

you shouldn’t be allowed to have accuracy as a metric

rpicatoste · March 10, 2019, 11:24am

I would understand that probability is a continuous variable from 0.0 to 1.0. If your output is either 0 or 1, it’s a categorical variable and therefore it’s normal to have 2 outputs, one for each possible output or class.

And the output contains the predicted class and the probability of each of the possible classes.

travis · March 10, 2019, 11:32am

It’s giving you the probabilities of ‘0’ & ‘1’, for example. So when you do learn.get_preds, it gives you a tensor, where the first value is probability of ‘0’ and second value is probability of ‘1’.

krisho007 · March 10, 2019, 11:39am

@rpicatoste, @travis Thanks for inputs. Few further questions.

I thought the probability of zero is (1 - the probability of One).
Consider a scenario where Y = 1. So how does the model get (Y^ - Y) with these 2 output features?

If you can also point me the code, it will be great.

rpicatoste · March 10, 2019, 11:48am

I think that softmax is automatically applied, so both probabilities should add up to 1. Do you see something different?

travis · March 10, 2019, 12:11pm

You can do doc(BCEFlat) in a notebook and it will take you to the Fastai Docs for the loss function. BCEFlat is built on pytorch’s nn.BCELoss. The Fastai docs are wonderful, with links directly to the source code.

It calculates (Y^ - Y) for each class (or output feature). So if your target is ‘0’ or ‘1’, you have two classes. As your model goes through its layers, it works to minimize the loss for each prediction for each class.

krisho007 · March 10, 2019, 12:29pm

Yes. They add up to 1. Thanks.

mike00 · January 24, 2020, 4:16pm

I was having trouble running this model against a different CSV file and went back to the original. When I run a copy of that, I get results identical to what was shown in the lesson, except learn.predict. The prediction is wrong and the tensor values seem to be the reverse of what they should be. I added the row command to make sure.

row = df.iloc[0]
row

age 49
workclass Private
fnlwgt 101320
…
native-country United-States
salary >=50k
Name: 0, dtype: object

learn.predict(row)
(Category <50k, tensor(0), tensor([0.5364, 0.4636]))

Running environment is Win10 on a cpu-only environment. fastAI 1.0 Anaconda 3.7, all freshly gotten and installed and updated.

PalaashAgrawal · April 1, 2020, 9:59am

Question: What is Tabular Learner?

Hello Everyone,
I want to ask a more general question- What is fastai’s tabular learner? What actually goes in, and what comes out of it? How, and where in the model do categorical and continuos variables combine, and how are they treated individually in the earlier layers?
If someone can provide an explanation, or a link to one, that would be great!

muellerzr · April 1, 2020, 1:53pm

@PalaashAgrawal Disregarding the word engineering here is the tabular model in a nutshell.

Now we can see that our continuous variables go into batch normalization while the categorical first into an embedding matrix (a look up table of sorts) before applying any drop out. Afterwards they get concatenated and fed through 2-3 fully connected LinBnDrop layers. This example has my layer selection as [1000,500]. Also visualization was made with the fastdot library.

PalaashAgrawal · April 1, 2020, 8:10pm

Thankyou so much @muellerzr . Its very helpful, and just what i Needed.