Resizing image segmentation masks introduce new classes because of pixel value interpolation!

ai_padawan · January 24, 2020, 6:09pm

I was working on a pixel-level segmentation project with multiple classes using fastai. So in my example, there are 4 classes, and the information is encoded as integers 0 to 3.

I recently discovered when I resize my image in openCV, it would interpolate pixel values. So for example, my original mask may only has 0s and 3s, but after resizing I’m getting a small amount of 1s and 2s. While for an image this would be fine, this is a segmentation mask, and a different integer means a different class entirely, and I’m wondering if this is messing up my model.

Has anyone run into this issue? FYI, this resizing was done outside fastai using my own scripts, and now I’m wondering if the fastai resizing operations also introduce this artifact???

juvian · January 24, 2020, 6:27pm

Fastai applies transformations to imagesegment class differently to prevent those issues. You can see in the implementation that it changes interpolation mode to be nearest:

github.com

fastai/fastai/blob/9ab92e7c8615daf7035e8c60e9230fc8ec4518c5/fastai/vision/image.py#L224


        return self.px


    def show(self, ax:plt.Axes=None, figsize:tuple=(3,3), title:Optional[str]=None, hide_axis:bool=True,
              cmap:str=None, y:Any=None, **kwargs):
        "Show image on `ax` with `title`, using `cmap` if single-channel, overlaid with optional `y`"
        cmap = ifnone(cmap, defaults.cmap)
        ax = show_image(self, ax=ax, hide_axis=hide_axis, cmap=cmap, figsize=figsize)
        if y is not None: y.show(ax=ax, **kwargs)
        if title is not None: ax.set_title(title)


class ImageSegment(Image):
    "Support applying transforms to segmentation masks data in `px`."
    def lighting(self, func:LightingFunc, *args:Any, **kwargs:Any)->'Image': return self


    def refresh(self):
        self.sample_kwargs['mode'] = 'nearest'
        return super().refresh()


    @property
    def data(self)->TensorImage:
        "Return this image pixels as a `LongTensor`."

aviopene · March 28, 2022, 12:33pm

I think I’m having this issue right now, despite my encodes() function says that the unique values in that specific mask are only [0, 2]. The major suspect at this point is the resize function.

matdmiller · March 28, 2022, 2:34pm

Can you post your Datablock/Dataloader code?

aviopene · March 28, 2022, 2:55pm

Obviously it was my fault (as usual). I had to do a little trick to be able to load RGB segmentation masks and convert them on the fly to grayscale. I wanted this feature because I wanted to be able to look at masks on disk “with the naked eye” and if you keep them in grayscale… well, you just see black. Anyway, I was doing this:

but it was not sufficient, Resize() in fastai/vision/augment.py was thinking those PILMasks were not true fastai.vision.core.PILMasks, so between the tuple resamples=(Image.BILINEAR, Image.NEAREST) it was applying Image.BILINEAR to my “false PILMasks”. I just added a cast to fastai.vision.core.PILMask at the end of my encodes() function and now it seems that Resize() behaves correctly.

jimmiemunyi · March 31, 2022, 8:13am

Don’t know if it will help but there is some sort of custom resizer in the segmentation example:

class CustomResizer(Transform):
    order=1
    "CutomResizer that resizes images with BILINEAR and masks with NEAREST"
    def __init__(self, size, resample=Image.BILINEAR):
        if not is_listy(size): size=(size,size)
        self.size,self.resample = (size[1],size[0]),resample

    def encodes(self, o:PILImage): return o.resize(size=self.size, resample=self.resample)
    def encodes(self, o:PILMask):  return o.resize(size=self.size, resample=Image.NEAREST)

I got this from here: Tutorial - Custom transforms | fastai