Hi @dipam7,
I’ve take a look at you post: It’s not a good idea to use default image data augmentation on spectrograms (especially big rotations and vertical translations):
If you want some suggestion on sound data augmentation on see: Deep Learning with Audio Thread .