Quick question: resize of target image?

tcapelle · December 6, 2018, 4:10pm

I am building an autoencoder with the ImageImageLIst class
I am able to create the data but I don’t get resizing of the output.

pct = 0.1
sample = 5000
src = (ImageImageList.from_df(df=df_.sample(sample), path=PATH, folder=TRAIN_AUG, suffix='.png')
       .random_split_by_pct(pct, seed=42)
      )

def get_data(sz=64, bs=64):
    data = (src.label_from_func(lambda x: x)
            .transform(tfms, size=sz, tfms_y=True)
            .databunch(bs=bs).normalize(stats, do_y=True)
           )
    data.c = 3
    return data

This only applies tfms and not the resize to output.

data = get_data(64, 2)
x,y = next(iter(data.train_dl))
>>(torch.Size([2, 3, 64, 64]), torch.Size([2, 3, 512, 512]))

sgugger · December 7, 2018, 3:28am

Could you share how you defined your src? Just tested it and I got the right sizes.

tcapelle · December 7, 2018, 10:33am

Thanks for looking, you can check the full nb here:

github.com

tcapelle/kaggle_hpa/blob/master/autoencoder.ipynb

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "%reload_ext autoreload\n",
    "%autoreload 2\n",
    "\n",
    "from fastai import *\n",
    "from fastai.vision import *\n",
    "from pathlib import Path\n",
    "import PIL\n",
    "import cv2\n",
    "\n",
    "from utils import *\n",
    "from dark import *\n",
    "from wrn4 import *\n",

This file has been truncated. show original

I noticed the problem when I saw the images with:

data.show_batch

and then torch.shape shows the issue.

tcapelle · December 10, 2018, 4:06pm

Any ideas @sgugger ?? I am in 1.34 btw.

sgugger · December 10, 2018, 4:14pm

There is no s in tfm_y=True. You just made a typo

tcapelle · December 10, 2018, 4:15pm

LOL! why there is no warning or error? There is no auto complete or shift+tab magic with data block API. That’s something I miss. Actually, this line works:

    data = (src.label_from_func(lambda x: x)
#             .transform(tfms, size=sz, tfms_y=True)
            .transform(superluly, trump_donald=sz, sgugger=True)

sgugger · December 10, 2018, 4:23pm

Note that you get the tab completion if you do it bits by bits (instead of in one line of code). As for transform, it accepts any kwargs for now, which is something I’m not sure how to fix (since there are several things that can be passed along).

Edit: It’s actually easier than I thought, preparing a fix for this.

larcat · December 10, 2018, 8:14pm

Tacking on to this – with you the tfm_y=True – are the transforms only being applied to the training data during a train loop and not the validation data?

Thanks.

sgugger · December 10, 2018, 8:22pm

You specify to lists of transforms, one for the training set, one for the validation/test set.

tcapelle · December 10, 2018, 9:04pm

I will do more tests tomorrow.

sxela · November 16, 2019, 8:46am

How can I resize x to size, and y to 2*size? I’m doing that for super resolution and I’m not using the unet, so I don’t need an upscaled input.

tcapelle · November 16, 2019, 8:41pm

That’s a good question btw. Don’t know if this can be done on the fly, in lesson superres they built a low_res folder and a high_res labels.

sxela · November 17, 2019, 6:24pm

Yup, and the low res gets upscaled to your target size, while high gets downscaled to it. Say, I have a folder with 96x96ish images, and a folder of original 3000x3000ish or something.
If I set size=256, lowres folder will be upscaled to 256, highres will be downscaled to 256. But what if I need low res to stay 96ish, and highres get downscaled to 256?
I just don’t want to write a dataloader from scratch, especially if I’m going to use all the data augmentation transforms.

Prince · March 22, 2020, 3:44pm

Hi, I’m having trouble with ImageImageList It is setting the inputs and output to the same size.
@sgugger

src = ImageImageList.from_folder(path_lr).split_by_rand_pct(0.1, seed=42)
def get_data(bs,size):
    data = (src.label_from_func(lambda x: path_hr/x.name)
           .transform(get_transforms(max_zoom=2.), size=size, tfm_y=True)
           .databunch(bs=bs).normalize(imagenet_stats, do_y=True))

    data.c = 3
    return data

data = get_data(bs, sz_lr*scale)

data

ImageDataBunch;

Train: LabelList (6651 items)
x: ImageImageList
Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192)
y: ImageList
Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192)
Path: /root/.fastai/data/oxford-iiit-pet/small-96;

Valid: LabelList (739 items)
x: ImageImageList
Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192)
y: ImageList
Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192),Image (3, 192, 192)
Path: /root/.fastai/data/oxford-iiit-pet/small-96;

Test: None

Prince · March 22, 2020, 4:03pm

Because ImageImagelist now ties the size of both x and y my SuperRes model output is scaled by a factor of two so there is a mismatch.

Using a target size (torch.Size([16, 3, 96, 96])) that is different to the input size (torch.Size([16, 3, 192, 192])). This will likely lead to incorrect results due to broadcasting. Please ensure they have the same size.

muellerzr · March 22, 2020, 4:08pm

@Prince if you notice you’re using 200% zoom (which 2*96 is exactly the 196). You should probably not use it or ensure another size inside of it so they’re the same in and out (just for an idea of where to start debugging). You can also check the sizes quickly by looking at one batch (IE databunch.one_batch()[0].shape())

Prince · March 22, 2020, 4:11pm

I did that, it is still the same.

I even ran the 2019 course on Super Resolution, it is also having the same bug.

Something is wrong inside the library.

Prince · March 22, 2020, 4:11pm

ImageImageList is behaving the wrong way,
It is resizing both the input and output image to the same size in the transform()
that the big problem. Because it is supposed to keep the size of the images loaded in src.

Prince · March 22, 2020, 4:18pm

If I don’t pass size for the High Resolution image it sort of behaves as expected but then the size of the original images are too big to what my model will output.

sgugger · March 22, 2020, 4:25pm

You specify tfm_y=True, so yes it is applying transforms on the input and targets. Remove the tfm_y=True to not apply them to the target.
And if you don’t want transforms applied to your inputs, you need to use a custom class you have to write since your inputs are not traditional images using data augmentation. It’s not because ImageImageList does not behave the way you want, that you have to say it behaves the wrong way.