Pandas groupby and count across multiple columns

Question

I have data ordered by ID, Year, and then a series of event flags indicating whether a thing did or did not happen for that ID in that year: ID Year x y z 1 2015 0 1 0 1 2016 1 1 0 1 2017 0 1 1 2 2015 1 0 1 2 2016 1 1 0 2

Accepted Answer

You can use .groupby() + .cumsum() to get the cumulative count to each &#8220;event&#8221; column.  Then add _total as suffix to the column names by .add_suffix() and then join with the first 2 columns:df[['ID', 'Year']].join(df.groupby('ID')[['x', 'y', 'z']].cumsum().add_suffix('_total'))Result:   ID  Year  x_total  y_total  z_total0   1  2015        0        1        01   1  2016        1        2        02   1  2017        1        3        13   2  2015        1        0        14   2  2016        2        1        15   2  2017        2        2        2

Advertisement

Answer