Convert rec.array to dataframe

Question

I've been trying to convert a numpy rec.array into a dataframe. The current array looks like: The result should be a five-column dataframe like the following: Weights v_1 v_2 v_3 v_4 0.2 1.76405235 0.40015721 0.97873798 2.2408932 0.2 1.86755799 -0.97727788 0.95008842 -0.15135721 .... .... ... ... ... 0.05882353 0.17742614 -0.40178094 -1.63019835 0.46278226 and so on.. However, as I do pd.DataFrame(my_list), the

Accepted Answer

I assume your recarray is stored in a variable called data. You can convert the array to dataframe using pd.DataFrame and pd.concat. Then you can use pandas.DataFrame.pop to drop the array of lists and pandas.DataFrame.explode to convert column containing list to data in multiple columns.Reading Datadf = pd.DataFrame()for record in data:    temp_df = pd.DataFrame(record.tolist())    df = pd.concat([df, temp_df])Pre-processing and Unraveling datadf[['v_1', 'v_2', 'v_3', 'v_4']] = pd.DataFrame(df[1].tolist(), index= df.index)df['weights'] = df.pop(0).explode()df.pop(1)Output :This gives us the expected output :         v_1       v_2       v_3       v_4   weights0   1.764052  0.400157  0.978738  2.240893       0.21   1.867558 -0.977278  0.950088 -0.151357       0.22  -0.103219  0.410598  0.144044  1.454274       0.23   0.761038  0.121675  0.443863  0.333674       0.24   1.494079 -0.205158  0.313068 -0.854096       0.25   1.764052  0.400157  0.978738  2.240893       0.16   1.867558 -0.977278  0.950088 -0.151357       0.17  -0.103219  0.410598  0.144044  1.454274       0.18   0.761038  0.121675  0.443863  0.333674       0.19   1.494079 -0.205158  0.313068 -0.854096       0.110 -2.552990  0.653619  0.864436 -0.742165       0.111  2.269755 -1.454366  0.045759 -0.187184       0.112  1.532779  1.469359  0.154947  0.378163       0.113 -0.887786 -1.980796 -0.347912  0.156349       0.114  1.230291  1.202380 -0.387327 -0.302303       0.115  1.764052  0.400157  0.978738  2.240893  0.16666716  1.867558 -0.977278  0.950088 -0.151357  0.16666717 -0.103219  0.410598  0.144044  1.454274  0.16666718  0.761038  0.121675  0.443863  0.333674  0.16666719  1.494079 -0.205158  0.313068 -0.854096  0.16666720 -2.552990  0.653619  0.864436 -0.742165  0.16666721  1.764052  0.400157  0.978738  2.240893  0.05882422  1.867558 -0.977278  0.950088 -0.151357  0.05882423 -0.103219  0.410598  0.144044  1.454274  0.05882424  0.761038  0.121675  0.443863  0.333674  0.05882425  1.494079 -0.205158  0.313068 -0.854096  0.05882426 -2.552990  0.653619  0.864436 -0.742165  0.05882427  2.269755 -1.454366  0.045759 -0.187184  0.05882428  1.532779  1.469359  0.154947  0.378163  0.05882429 -0.887786 -1.980796 -0.347912  0.156349  0.05882430  1.230291  1.202380 -0.387327 -0.302303  0.05882431 -1.048553 -1.420018 -1.706270  1.950775  0.05882432 -0.509652 -0.438074 -1.252795  0.777490  0.05882433 -1.613898 -0.212740 -0.895467  0.386902  0.05882434 -0.510805 -1.180632 -0.028182  0.428332  0.05882435  0.066517  0.302472 -0.634322 -0.362741  0.05882436 -0.672460 -0.359553 -0.813146 -1.726283  0.05882437  0.177426 -0.401781 -1.630198  0.462782  0.058824AlternativelyThe same thing can be done using np.hstack as well, where data is the list of your recarray.df = pd.DataFrame(np.hstack(data).tolist())df['weights'] = df[0].explode()df[['v_1', 'v_2', 'v_3', 'v_4']] = pd.DataFrame(df[1].tolist())df.drop([0, 1], inplace=True, axis=1)OutputThis gives us the same output     weights       v_1       v_2       v_3       v_40        0.2  1.764052  0.400157  0.978738  2.2408931        0.2  1.867558 -0.977278  0.950088 -0.1513572        0.2 -0.103219  0.410598  0.144044  1.4542743        0.2  0.761038  0.121675  0.443863  0.3336744        0.2  1.494079 -0.205158  0.313068 -0.8540965        0.1  1.764052  0.400157  0.978738  2.2408936        0.1  1.867558 -0.977278  0.950088 -0.1513577        0.1 -0.103219  0.410598  0.144044  1.4542748        0.1  0.761038  0.121675  0.443863  0.3336749        0.1  1.494079 -0.205158  0.313068 -0.85409610       0.1 -2.552990  0.653619  0.864436 -0.74216511       0.1  2.269755 -1.454366  0.045759 -0.18718412       0.1  1.532779  1.469359  0.154947  0.37816313       0.1 -0.887786 -1.980796 -0.347912  0.15634914       0.1  1.230291  1.202380 -0.387327 -0.30230315  0.166667  1.764052  0.400157  0.978738  2.24089316  0.166667  1.867558 -0.977278  0.950088 -0.15135717  0.166667 -0.103219  0.410598  0.144044  1.45427418  0.166667  0.761038  0.121675  0.443863  0.33367419  0.166667  1.494079 -0.205158  0.313068 -0.85409620  0.166667 -2.552990  0.653619  0.864436 -0.74216521  0.058824  1.764052  0.400157  0.978738  2.24089322  0.058824  1.867558 -0.977278  0.950088 -0.15135723  0.058824 -0.103219  0.410598  0.144044  1.45427424  0.058824  0.761038  0.121675  0.443863  0.33367425  0.058824  1.494079 -0.205158  0.313068 -0.85409626  0.058824 -2.552990  0.653619  0.864436 -0.74216527  0.058824  2.269755 -1.454366  0.045759 -0.18718428  0.058824  1.532779  1.469359  0.154947  0.37816329  0.058824 -0.887786 -1.980796 -0.347912  0.15634930  0.058824  1.230291  1.202380 -0.387327 -0.30230331  0.058824 -1.048553 -1.420018 -1.706270  1.95077532  0.058824 -0.509652 -0.438074 -1.252795  0.77749033  0.058824 -1.613898 -0.212740 -0.895467  0.38690234  0.058824 -0.510805 -1.180632 -0.028182  0.42833235  0.058824  0.066517  0.302472 -0.634322 -0.36274136  0.058824 -0.672460 -0.359553 -0.813146 -1.72628337  0.058824  0.177426 -0.401781 -1.630198  0.462782

Weights	v_1	v_2	v_3	v_4
0.2	1.76405235	0.40015721	0.97873798	2.2408932
0.2	1.86755799	-0.97727788	0.95008842	-0.15135721
….	….	…	…	…
0.05882353	0.17742614	-0.40178094	-1.63019835	0.46278226

Advertisement

Answer

Reading Data

Pre-processing and Unraveling data

Output :

Alternatively

Output