Column as a sum of its cumulative value only if other column meets condition

Question

I am struggling to produce below df[&#8216;res&#8217;] without loop / within pandas. Loop implementation of df[&#8216;res&#8217;] In pandas, it could be something like The issue is that df[&#8216;res&#8217;] is previously empty. Any hint how to think about these decompositions? Answer As per your requirement,…

Accepted Answer

As per your requirement, the value for temp will be reset as soon as we reach a 0 in a. So,  I decided to first group your data set and then apply the rules. In a way that we group all rows in the same group up until there is a non-zero value for column a. Also all rows from one non-zero value for a till the next non-zero value for a. In this way we can make use of cumulative values of b for computing temp.:import numpy as npimport pandas as pddf['id'] = (~ df['a'].eq(0)).cumsum()df['temp2'] = df.groupby('id')['b'].cumsum()df['res2'] = np.where(df['a'].eq(0), 0, df['a'] + df['temp2'].shift(fill_value=0))df.drop(columns=['id'], inplace=True)    a    b  res  temp  temp2  res20   0   -2    0    -2     -2     01   0    0    0    -2     -2     02   0    0    0    -2     -2     03   5   -5    3    -5     -5     34   0    0    0    -5     -5     05  12    0    7     0      0     76   0   -2    0    -2     -2     0

Column as a sum of its cumulative value only if other column meets condition

Loop implementation of df[‘res’]

Advertisement

Answer