Calculate the weighted average using groupby in Python

Question

here is the dataframe I&#8217;m currently working on : What I&#8217;d like to calculate is the average of the variable &#8220;avg_lag&#8221; weighted by &#8220;tot_SKU&#8221; in each product_basket for both SMB and CORP groups. This means that, taking CORP as an example, I want to calculate something as: (585…

Accepted Answer

So this should do the trick I thinkimport pandas as pddef calculator(df, columns):    weighted_sum = (df[columns[0]]*df[columns[1]]).sum()/df[columns[0]].sum()    return weighted_sumcols = ['tot_SKU', 'avg_lag']Sums = df.groupby('SF_type').apply(lambda x: calculator(x, cols))df.join(Sums.rename(('sums')), on='SF_type')Edit: Added the requested merge with the old dataframe

Advertisement

Answer