pandas concat generates nan values

Question

I am curious why a simple concatenation of two dataframes in pandas: of the same shape and both without NaN values can result in a lot of NaN values if joined. How can I fix this problem and prevent NaN values being introduced? Trying to reproduce it like failed e.g. worked just fine as no NaN values were introduced. Answer

Accepted Answer

I think there is problem with different index values, so where concat cannot align get NaN:aaa  = pd.DataFrame([0,1,0,1,0,0], columns=['prediction'], index=[4,5,8,7,10,12])print(aaa)    prediction4            05            18            07            110           012           0bbb  = pd.DataFrame([0,0,1,0,1,1], columns=['groundTruth'])print(bbb)   groundTruth0            01            02            13            04            15            1print (pd.concat([aaa, bbb], axis=1))    prediction  groundTruth0          NaN          0.01          NaN          0.02          NaN          1.03          NaN          0.04          0.0          1.05          1.0          1.07          1.0          NaN8          0.0          NaN10         0.0          NaN12         0.0          NaNSolution is reset_index if indexes values are not necessary:aaa.reset_index(drop=True, inplace=True)bbb.reset_index(drop=True, inplace=True)print(aaa)   prediction0           01           12           03           14           05           0print(bbb)   groundTruth0            01            02            13            04            15            1print (pd.concat([aaa, bbb], axis=1))   prediction  groundTruth0           0            01           1            02           0            13           1            04           0            15           0            1EDIT: If need same index like aaa and length of DataFrames is same use:bbb.index = aaa.indexprint (pd.concat([aaa, bbb], axis=1))    prediction  groundTruth4            0            05            1            08            0            17            1            010           0            112           0            1

Advertisement

Answer