Sequentially counting repeated entries

Question

I am currently working on a project where I have to measure someones activity over time on a site, based on whether they edit a site. I have a data frame that looks similar to this: I want to add a column to the dataframe such that it counts the number of repeated values (number of edits, which is column

Accepted Answer

groupby and cumcountdf['activity'] = df.groupby('x').cumcount() + 1df   x       y  z  activity0  a     red  1         11  b    blue  2         12  c   green  3         13  b  yellow  4         24  b     red  5         3

Sequentially counting repeated entries

Advertisement

Answer

`groupby` and `cumcount`