Other datasets to predict bounding boxes on

wdhorton · March 23, 2018, 12:19am

I know that a good way to practice what we’ve learned is applying it to other datasets. I remembered that in the old part 1, Jeremy had done a notebook about the Kaggle fisheries competition, which someone labeled with bounding boxes to aid in classification (https://github.com/fastai/courses/blob/master/deeplearning1/nbs/lesson7.ipynb) Might be interesting to try to apply what we’re learning now to that same task!

jeremy · March 23, 2018, 12:23am

The most popular one is COCO: http://cocodataset.org/#home

jamesrequa · March 23, 2018, 12:43am

If you’re feeling up to it, there’s also ImageNet Object Localization Challenge which just opened up for submissions on Kaggle.

There’s also Kaggle’s Data Science Bowl. Although you’ll need to generate masks as the final output (something we haven’t covered as of yet).

suvash · March 23, 2018, 3:23pm

I’m hoping Jeremy will touch on this at some point, as it seems to me like this (generating masks) involves classification of every pixel ( semantic segmentation perhaps ? )

wdhorton · March 23, 2018, 3:34pm

I’m pretty sure we’re going to be covering segmentation, since there’s a notebook for the Carvana competition. Although from what I’ve read about the Data Science Bowl 2018, the problem is instance segmentation rather than semantic segmentation.

suvash · March 23, 2018, 3:40pm

That’s true. I should be looking at other notebooks as well

tensoralex · March 24, 2018, 3:34pm

lesson 8 on COCO 2017 dataset: https://github.com/tensoralex/misc_notebooks/blob/master/fastai_dl2_L8_coco_2017.ipynb

radek · March 24, 2018, 3:38pm

Nice @tensoralex! You beat me to it I am also planning on redoing lesson 8 on the coco 2017 dataset

tensoralex · March 24, 2018, 3:48pm

i also wanted to try run it on resnext50, but couldn’t make it work yesterday night.

suvash · March 24, 2018, 5:09pm

You mention the datasets links at the top. I’m assuming you only used the validation dataset zip for both validation and testing (as done in the lecture), or did you make use of the other downloads as well ?

tensoralex · March 24, 2018, 5:12pm

yeah, i haven’t used “test2017.zip”

suvash · March 24, 2018, 5:17pm

okies. i was thinking of giving it a go with the other pascal voc dataset. maybe this evening.

suvash · March 24, 2018, 6:05pm

btw, if I understood the online docs correctly, tqdm.monitor_interval = 0 disables the monitor thread.(Does this have a side effect of disabling the RuntimeError: Set changed size during iteration warning.

Seems to me like multiple threads are updating the tqdm counter for the warning to pop up so often(at least on my notebooks), but I’m not really sure.

tensoralex · March 24, 2018, 6:08pm

that what i was trying to fix “RuntimeError: Set changed size during iteration warning.” by setting tqdm.monitor_interval = 0
Did not help.

suvash · March 24, 2018, 6:12pm

Agreed. Doesn’t help.

jeremy · March 24, 2018, 10:22pm

Excellent!

Moody · March 28, 2018, 6:18am

OpenImages dataset is mentioned in YOLOv3 paper.
https://github.com/openimages

surmenok · March 28, 2018, 5:46pm

I try to use this one to detect cars and other object on the road: http://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=2d

adrian · April 2, 2018, 11:31pm

Lesson-8 applied to the tiny imagenet dataset here:

github.com

adriangrepo/my_fastai/blob/master/dl2/pascal_tiny_imagenet.ipynb

{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Lesson8 and 9 pascal object detecion applied to Tiny ImageNet Visual Recognition Challenge\n",
    "https://tiny-imagenet.herokuapp.com/\n",
    "\n",
    "\"*Tiny Imagenet has 200 classes. Each class has 500 training images, 50 validation images, and 50 test images. We have released the training and validation sets with images and annotations. We provide both class labels and bounding boxes as annotations; however, you are asked only to predict the class label of each image without localizing the objects. The test set is released without labels*\"\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {
    "ExecuteTime": {
     "end_time": "2018-03-30T13:04:00.727976Z",
     "start_time": "2018-03-30T13:04:00.545471Z"
    }

This file has been truncated. show original

I had a few issues with bounding boxes and the geometry of the Linear layer in the new head, is working fairly well but needs a bit more qc.