Implemented SGD from scratch for 5 features like in Lesson 2. How do I optimize?

As Jeremy does with a single feature in Lesson 2. I choose more features and tried to implement SGD from scratch but as the o/p post SGD compares with my initial parameters, there is still a lot of be desired.
I tweaked the number of loops and the LR but it didn’t change much.
Can someone please tell me what I’m missing here and why I can’t make my regression come up with better solutions?

Thank You!

1 Like