pandas how to iteratively count instances of a category by row and reset them when the other category appears?

Question

I have a DataFrame that shows the behavior of a machine. This machine can be in two states: Production or cleaning. Hence, I have a dummy variable called &#8220;Production&#8221;, that shows 1 when the machine is producing and 0 when it is not. I would like to know the production cycles (how many hours does t…

Accepted Answer

You can first detect the turning points by looking at the points where it differs from the previous one. Then cumulative sum of this gives the needed groupings. We transform this with count to get the size of each group:>>> grouper = df.production.diff().ne(0).cumsum()>>> df["production_cycle"] = df.groupby(grouper).transform("count")>>> df    production  production_cycle0            1                 51            1                 52            1                 53            1                 54            1                 55            0                 26            0                 27            1                 18            0                 39            0                 310           0                 3the grouper is>>> grouper0     11     12     13     14     15     26     27     38     49     410    4

Advertisement

Answer