Python pandas group non repeating values

Question

Hi I have a data frame which looks like this I would like to groupby and sum for non repeating values in col1 for e.g. Is there any way I can do this via pandas functions? Answer IIUC, you could create groups using groupby + cumcount (where the nth occurrences of each col1 value will be grouped the same); then

Accepted Answer

IIUC, you could create groups using groupby + cumcount (where the nth occurrences of each col1 value will be grouped the same); then groupby the groups and join &#8220;col1&#8243;s and sum &#8220;col2&#8243;s:out = df.groupby(df.groupby('col1').cumcount()).agg({'col1':','.join, 'col2':'sum'})Output:    col1  col20  A,B,C     61    A,C     92      A     6

Advertisement

Answer