Training 4.5 million parameters takes 30 mins per epoch?

I don’t know. Try to use some simpler loss and see if it works. VGGPerceptualLoss is quite complicated.