I found this thread discussing the same issue:
The problem is attributed to underfitting. But I am still unconvinced as underfitting means that the model has high bias, systematically gives non-correct predictions. Instead here the model gives good prediction on the validation set.