As Jeremy does with a single feature in Lesson 2. I choose more features and tried to implement SGD from scratch but as the o/p post SGD compares with my initial parameters, there is still a lot of be desired.
I tweaked the number of loops and the LR but it didn’t change much.
Can someone please tell me what I’m missing here and why I can’t make my regression come up with better solutions?