python divide by zero encountered in log – logistic regression

Question

I&#8217;m trying to implement a multiclass logistic regression classifier that distinguishes between k different classes. This is my code. I can verify that cost and gradient are returning values that are in the right dimension (cost returns a scalar, and gradient returns a 1 by n row vector), but i get the e…

Accepted Answer

You can clean up the formula by appropriately using broadcasting, the operator * for dot products of vectors, and the operator @ for matrix multiplication — and breaking it up as suggested in the comments.Here is your cost function:def cost(X, y, theta, regTerm):    m = X.shape[0]  # or y.shape, or even p.shape after the next line, number of training set    p = expit(X @ theta)    log_loss = -np.average(y*np.log(p) + (1-y)*np.log(1-p))    J = log_loss + regTerm * np.linalg.norm(theta[1:]) / (2*m)    return JYou can clean up your gradient function along the same lines.By the way, are you sure you want np.linalg.norm(theta[1:]). If you&#8217;re trying to do L2-regularization, the term should be np.linalg.norm(theta[1:]) ** 2.

Advertisement

Answer