TabNet: Attentive Interpretable Tabular Learning

Hannibal · November 18, 2019, 11:08am

This paper seems to have flown a bit under the radar in August. It was put out by two authors from Google Cloud and it proposes a network design based on attention for tabular data which appears to outperform tree based models AND provide interpretability.

Seems too good to be true, but worth examining.

There are tensorflow and pytorch implementations of this network out there. Given how much tabular data is out there, I’m surprised this hasn’t gotten more attention.

muellerzr · November 18, 2019, 11:16am

Shouldn’t be too hard to try playing with in fastai. If I have time I’ll try it out later this week. That pytorch implementation is here: https://github.com/dreamquark-ai/tabnet

Meditation · January 5, 2020, 7:00am

@muellerzr How did you with TabNet?

rahulagarwal · June 9, 2020, 4:46pm

@muellerzr - were you ever able to test out Tabnet. Any thoughts?

muellerzr · June 9, 2020, 4:54pm

Yes I did, and I taught it in my course, see here:

github.com

muellerzr/Practical-Deep-Learning-for-Coders-2.0/blob/master/Tabular Notebooks/04_TabNet.ipynb

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "\u001b[?25l\r",
      "\u001b[K     |█▉                              | 10kB 26.2MB/s eta 0:00:01\r",
      "\u001b[K     |███▋                            | 20kB 846kB/s eta 0:00:01\r",
      "\u001b[K     |█████▍                          | 30kB 1.3MB/s eta 0:00:01\r",
      "\u001b[K     |███████▏                        | 40kB 1.4MB/s eta 0:00:01\r",
      "\u001b[K     |█████████                       | 51kB 1.0MB/s eta 0:00:01\r",
      "\u001b[K     |██████████▊                     | 61kB 1.2MB/s eta 0:00:01\r",
      "\u001b[K     |████████████▌                   | 71kB 1.4MB/s eta 0:00:01\r",
      "\u001b[K     |██████████████▎                 | 81kB 1.5MB/s eta 0:00:01\r",

This file has been truncated. show original

If you’re attention focused and can’t be convinced by FI (though I argue it’s a pretty good idea if it makes sense to you), then use TabNet. It may not be as accurate as the straight fastai model and probably will take much longer to train though (in terms of epochs, but not necessarily seconds), keep that in mind.

(Note @grankin did most of the heavy lifting )