SGD with warm restarts - should I use momentum?

I’m just wondering if it’s a good idea to use (Nesterov) momentum with cosine lr annealing. Thanks