What's the difference between training on jpeg and png?

xiaoxin_yi · July 10, 2017, 4:30pm

I train a CNN for classification using jpegs. The accuracy dropped 20% when predicting png images. Has anyone come across such problems?

sibnick · July 11, 2017, 8:54am

Jpeg adds artefacts to image and png has no artefacts. Your model trained with artefacts and expects them.

xiaoxin_yi · July 11, 2017, 9:20am

Thanks for your reply. And how to deal with it?

sibnick · July 13, 2017, 5:41am

You can just convert png to jpeg or train NN again over png images (but do not try convert images from jpeg to png!)

pietz · July 13, 2017, 12:16pm

JPEG is a lossy compression. Although you may not be able to see the difference, the network might when applying hundreds of filters. 20% drop in accuracy is still pretty extreme.

Does anybody have first hand experience how float16 and float32 compares? i usually use float16 for space saving reasons, but it may be a good idea to up my game.

xiaoxin_yi · July 14, 2017, 1:21am

Is any way to design a keras layer to convert png to jpeg? Since there exist two kind of images when doing inference, thus I can put such layer on top of trained model for png images.

sibnick · July 14, 2017, 4:13am

It is not part of keras. I may suggest convert all png to jpg from command line: https://superuser.com/questions/71028/batch-converting-png-to-jpg-in-linux

pietz · July 14, 2017, 7:52am

seems like youre suffering from neural-networks-will-solve-all-my-problems syndrome.

JPG and PNG are just image formats. they vanish once you read the data. all images will be uncomressed tensors by then. why would you convert PNGs to JPGs anyway if we just told you this will hurt accuracy?

sibnick · July 14, 2017, 10:08am

It is necessary because NN was trained over images with JPEG artefacts

pietz · July 14, 2017, 11:06am

while in theory you could be right - i dont think that will make a difference

sibnick · July 14, 2017, 2:40pm

Yes most likely you are right. I just re-read original question. I think png vs jpg was false idea. Most likely it is over-fitting or incorrect training set.

ZhangLi:
Are you tried predict on jpeg or you use jpeg only for training? In other words is 20% difference between jpg vs png prediction or between jpg test set and png real data?

xiaoxin_yi · July 14, 2017, 2:53pm

I only use jpeg for training and validation. And test model for jpg and png images of two different test datasets .

juan-widyaya · October 28, 2020, 9:08am

Do you have any reference about this one?