I have a question regarding a training time (maybe its a bit silly). Why 1 epoch takes 4-5 time slow than others (please see the attached screenshot)?


We can’t answer that question without knowing you code and hardware. Sometimes the first epoch is slower (though you are showing an extreme example of that).

I fixed the error, my CUDA Driver was not 9.2… =)
Everything looks smooth now =)