Chain df.str.split() in pandas dataframe

Question

Edit: 2022NOV21 How do we chain df.col.str.split() since this returns the split columns if expand = True I am trying to split a column after performing .melt(). If I use assign I end up using the original column and the melted column actually does not even exist. Answer Using expand converts it into a DataFrame, which you do not really

Accepted Answer

Using expand converts it into a DataFrame, which you do not really want here; secondly with chaining, use an anonymous function to refer to the previous dataframe:(df.melt(id_vars='id',var_name='fy',value_name='num')assign(year = lambda df: df.fy.str.split('_').str[0],       t = lambda df: df.fy.str.split('_').str[1]))   id        fy   num  year    t0   1  2022_amt  10.1  2022  amt1   2  2022_amt  20.2  2022  amt2   3  2022_amt  30.3  2022  amt3   4  2022_amt  40.4  2022  amt4   1  2022_qty  10.0  2022  qty5   2  2022_qty  20.0  2022  qty6   3  2022_qty  30.0  2022  qty7   4  2022_qty  40.0  2022  qtyFor your use case, there are simpler, more efficient ways to do this:with pd.stack:df = df.set_index('id')df.columns = df.columns.str.split('_', expand = True)df.columns.names = ['year', 't']df.stack(['year', 't']).reset_index(name='num')   id  year    t   num0   1  2022  amt  10.11   1  2022  qty  10.02   2  2022  amt  20.23   2  2022  qty  20.04   3  2022  amt  30.35   3  2022  qty  30.06   4  2022  amt  40.47   4  2022  qty  40.0with pivot_longer from pyjanitor:# pip install pyjanitorimport pandas as pdimport janitor as jndf.pivot_longer(index = 'id', names_to = ('year','t'), names_sep = '_')   id  year    t  value0   1  2022  amt   10.11   2  2022  amt   20.22   3  2022  amt   30.33   4  2022  amt   40.44   1  2022  qty   10.05   2  2022  qty   20.06   3  2022  qty   30.07   4  2022  qty   40.0

Advertisement

Answer