How do I create a linear regression model for a file that has about 500 columns as y variables? Working with Python

Question

This code manually selects a column from the y table and then joins it to the X table. The program then performs linear regression. Any idea how to do this for every single column from the y table? Answer You can regress multiple y's on the same X's at the same time. Something like this should work produces The first

Accepted Answer

You can regress multiple y&#8217;s on the same X&#8217;s at the same time. Something like this should workimport numpy as npfrom sklearn.linear_model import LinearRegressiondf_X = pd.DataFrame(columns = ['x1','x2','x3'], data = np.random.normal(size = (10,3)))df_y = pd.DataFrame(columns = ['y1','y2'], data = np.random.normal(size = (10,2)))X = df_X.iloc[:,:]y = df_y.iloc[:,:]lm = LinearRegression().fit(X,y)print(lm.coef_)produces[[ 0.16115884  0.08471495  0.39169592] [-0.51929011  0.29160846 -0.62106353]]The first row here ([ 0.16115884  0.08471495  0.39169592]) are the regression coefs of y1 on xs and the second are the regression coefs of y2 on xs.

Advertisement

Answer