Confuse why my KNN code is throwing a ValueError

Question

I am using sklearn for KNN regressor: I get this error message: Could someone please explain this? My data is in the hundred thousands for target and the thousands for input. And there is no blanks in the data. Answer Before answering the question, Let me refactor the code. You are using a dataframe so you can index single or

Accepted Answer

Before answering the question, Let me refactor the code. You are using a dataframe so you can index single or muliple fields of the dataframe without going through the extra steps you&#8217;ve used:#importing libraries and dataimport pandas as pdfrom sklearn.neighbors import KNeighborsRegressor as KNRtheta = pd.read_csv("train.csv") # pandas dataframe#getting data wanted from theta and putting it in a new dataframex = theta[["YearBuilt", "YrSold"]] # index multiple fields#getting target datay = theta["SalePrice"] # index single field#using KNNhorses = KNR(n_neighbors = 3)horses.fit(x,y) # fit KNNRegarding your error, it indicates that you have some NaN, Inf, large values in your data. You can ensure these doesnt occur by filtering out the NaN and inf values using this:theta = theta.replace([np.inf, -np.inf], np.nan)theta.dropna(inplace=True)

Advertisement

Answer