pandas: Explode (duplicate) by group

Question

I have a df that looks like this : It's an example for one row but there are thousands of rows. I want to explode each value where there are multiple values in these four "TEST" columns ie. I want each one to duplicate the row for each one of the "test" that is the same and if there are

Accepted Answer

Here is an alternative, using a helper function:def split(df):    return (df.apply(lambda c: c.str.split(' / '))  # split cells              .apply(lambda x: x.explode().reset_index(drop=True)) # explode              .fillna({c: 'X' for c in df.filter(like='test_').columns}) # fill missing test with X              .ffill() # fill non-test columns            )## single rowsplit(df)## multiple rowsdf.groupby('id').apply(split).droplevel(0)output:     id test_A test_B test_C test_D0  idx1    ABC      X    ABC    ABC1  idx1      X      X    XYZ    JKL2  idx1      X      X      X    XYZoutput on @jezrael&#8217;s better example:     id test_A test_B test_C test_D0  idx1    ABC      X    ABC    ABC1  idx1      X      X    XYZ    JKL2  idx1      X      X      X    XYZ0  idx2    SSD   "ABC     aa    ABC1  idx2      X    JKL      X    JKL2  idx2      X    XYZ      X      X

Advertisement

Answer