Add column with a specific sequence of numbers depending on value

Question

I have this dataframe: I want to add a new column Sequence with a sequence of numbers. The condition is when the first True appears in the Condition column, the following rows must contain the sequence 1, 2, 3, 1, 2, 3... until another True appears again, at which point the sequence is restarted again. Furthermore, ideally, until the first

Accepted Answer

Let us do cumsum to identify blocks of rows, then group the dataframe by blocks and use cumcount to create sequential counter, then with some simple maths we can get the outputb = df['Condition'].cumsum()df['Seq'] = df.groupby(b).cumcount().mod(3).add(1).mask(b < 1, 0)ExplainedIdentify blocks/groups of rows using cumsumb = df['Condition'].cumsum()print(b)0     01     02     1 # -- group 1 start --3     14     15     16     17     18     19     1 # -- group 1 ended --10    211    2Name: Condition, dtype: int32Group the dataframe by the blocks and use cumcount to create a sequential counter per blockc = df.groupby(b).cumcount()print(c)0     01     12     03     14     25     36     47     58     69     710    011    1dtype: int64Modulo(%) divide the sequential counter by 3 to create a repeating sequence that repeats every three rowsc = c.mod(3).add(1)print(c)0     11     22     13     24     35     16     27     38     19     210    111    2dtype: int64Mask the values in sequence with 0 where the group(b) is < 1c = c.mask(b < 1, 0)print(c)0     01     02     13     24     35     16     27     38     19     210    111    2Result    ID  Condition  Seq0    1      False    01    1      False    02    1       True    13    1      False    24    1      False    35    1      False    16    1      False    27    1      False    38    1      False    19    1      False    210   1       True    111   1      False    2

Advertisement

Answer

Explained

Result