Simple Linear Regression not converging

Question

In my attempt to dig deeper in the math behind machine learning models, I'm implementing a Ordinary Least Square algorithm in Python, using vectorization. My references are: https://github.com/paulaceccon/courses/blob/main/machine_learning_specialization/supervisioned_regression/2_multiple_regression.pdf https://www.geeksforgeeks.org/linear-regression-implementation-from-scratch-using-python/ This is what I have now: The problem I'm facing is that my weights keep increasing until I end up getting a bunch of nans. I've been trying to find out

Accepted Answer

Your code seems actually to work fine; except for learning rate, really! Just reduce it from 0.01 to e.g. 0.0001 and everything works fine (well, I would also reduce tolerance to something much much smaller, like 1e-5, to make sure it actually converges to the right solution).Small image showing that it works:clf = LinearRegression(learning_rate=0.0001)clf.fit(X_train, y_train)b, m = clf._weights[:, 0]plt.scatter(X_train[:, 0], y_train)plt.plot([-2, 4], [-2 * m + b, 4 * m + b])givesLinear regression is a convex optimization problem, so you can imagine it like putting a ball on a parabola and then moving it towards the bottom by a fixed amount of space multiplied by the slope of the position you&#8217;re at. If that &#8220;fixed amount&#8221; is small enough, you get closer and closer to the bottom, until you find the optimum position. But if you get the value too large, you jump from one side of the parabola to the other, and if it&#8217;s large enough you land in a place which is actually higher than where you started from. Iterate this a few times and you get indeed in the exact situation you had&#8230;

Advertisement

Answer