Do Overshooting really happen in Gradient Descent or it just circles around minima?

utsav · May 5, 2020, 2:41pm

We all might have seen video of Andrew NG from Machine Learning Course ( Gradient Descent in Practice II Learning Rate by Andrew Ng):

Figure :