Multi-label classification by Category?

caro · October 27, 2019, 8:40am

Hi all,

first of all, sorry if my question is trivial.

I’ve to guess different labels (around 50 labels in total) in different categories (around 10 categories) for a picture.

Is a way to configure The model with labels split by category?

Example with a Dress:

Let’s say I’ve to predict lable on a dress picture:

Category A:"Length"

Label 1: "Short"

Label 2: "Medium"

Label 3: "Long"

Category B:"Color Type"

Label 1: "Printed"

Label 2:"Unicolor"

If I put mix all labels together (like in Planet Classification),

I get sometimes “Short” and “Medium” as predictions for the same picture and nothing for Color Type, that it’s not my goal

So my question is, should I

1/ stay with what i did: create only one Model, mixing all labels, but my model is not accurate

2/ create one model for each category, one model A "Length" one model B "Color Type" but could be complicated and time consuming

3/ find another way to explain that labels are split by category ?

Any help ?

thanks a lot

Daniel.R.Armstrong · October 27, 2019, 12:08pm

I don’t know if this will help you, but if you look at this example: https://www.kaggle.com/nikkisharma536/fastai-toxic
At the bottom there is a table of predictions, you could group them based on your different categories, and return the one with the highest prediction value per group. I am thinking that would be the easiest way.

When we did it we trained different classifiers, and we trained and ran the prediction in a loop, my question for you is if you really need to do it as a multi-label classification problem. I think you can get a better understanding by creating 10 different models.

maral · October 27, 2019, 11:10pm

If you are primarily interested in classifying based on a combinations of properties then you can combine features and treat them as 0 = not all present, 1 = all present.

For example:

feature 1: cata_short_catb_printed
feature 2: cata_short_catb_unicolor
feature 3: cata_medium_catb_printed
…
feature N: cata_long_catb_unicolor

caro · October 28, 2019, 4:52pm

Thanks Daniel, thanks for your input, I’m just affraid that 10 models can be very slow for predictions

caro · October 28, 2019, 4:53pm

thanks but seems that we will have a lot of features combination without connection beetween

kushty · October 28, 2019, 6:53pm

Why do you think you will get more accuracy with different models? Wouldn’t you get the same accuracy with one model, where you just ignore predictions that aren’t relevant, as someone suggested?

amritv · October 28, 2019, 7:57pm

There is another way if I get what you are trying to accomplish. Using more labels inadvertently means there are more errors possible within the model. Here is an example I have been working on that could relate to what you want. This may point you in the right direction.

I have about 92 different overall labels and each of those 92 labels have about 5 sub labels. The data consisted of 92 folders each with their corresponding labels in this case are numbers corresponding to what I was working on. So for example folder 1 would be ‘000024463’, folder 2 ‘000024464’…etc (you can really set this to what is relevant to you)

I train the model and then test the model on an image (like normal):

And get a prediction:

Rightly, I get a prediction that corresponds to the label - in this case a number. I then load a json file (this file has all the additional characteristics of the images in each folder - in your case the json file will contain the length, color type etc) which corresponds to this number.

Load the file as before but also load the json file and note the additional info in the file

Now get a prediction for the same image:

but this time you get the additional characteristics. Hope that helps

github.com

asvcode/fastai_resources/blob/master/Using_Json_example.ipynb

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "import fastai\n",
    "from fastai.vision import *\n",
    "from fastai.widgets import *\n",
    "from fastai.callbacks import *\n",
    "from fastai.vision.gan import *\n",
    "\n",
    "from fastai.metrics import accuracy_thresh, top_k_accuracy, error_rate, FBeta, root_mean_squared_error, mean_squared_error, mean_absolute_error\n",
    "\n",
    "from torchvision.models import vgg16_bn"
   ]
  },
  {

This file has been truncated. show original

caro · October 29, 2019, 1:31pm

Thanks Amrit,
if i understand well, you predict a features A and you got all differents information about A
it’s not exactly what i’m looking for.
I’ve A-1 A-2 or A-3, so if i predict A, I’ve to predict also 1 or 2 or 3