extract new columns and fill values based on categorical values data frame in python

Question

I have a data frame where one column is categorical strings and the next one is the values corresponding to it: I want to create new columns based on df.status column, and fill empty ones with np.nan, requires pivot on multiple columns: I am looking for an efficient solution that works for large data frames. …

Accepted Answer

You want:In [255]: df.pivot(index=['attr1', 'attr2', 'attr3'],columns='status', values='value').rename_axis(None, axis=1).reset_index()Out[255]:   attr1 attr2 attr3 buy returned sold0     a     b     c   5      yes    61     a     b     f   4      NaN  NaN2     f     b     a   2      NaN  NaN

Advertisement

Answer