How to deal with “ValueError: array must not contain infs or NaNs” while running regressions in python

Question

I have a df with growth variables and often some initial values are 0, in which case it produces infinite values when the value moves from zero to non-zeros. i.e. when i run PanelOLS, i get an error message Is there a way to ignore these entries to continue with the regression without having to drop them and …

Accepted Answer

No, you can&#8217;t ignore these entries. This issue need to be handle before training the model, if not, you can not train it.Depending on your data and application a different method is preferred to handle these NaN and inf. One example of code that is posted in this SO question:df.replace([np.inf, -np.inf], np.nan).dropna(axis=1) # You can replace inf and -inf with NaN, and then select non-null rows.In this case, we are removing all rows that have inf or NaN values.

Advertisement

Answer