Different aggregate function based on value of column pandas

Question

I have the following dataframe I would like to add a new column, agg_y, which would be the the max(y) if label=="bottom" and min(y) if label=="top". I have tried this which gives the correct result I am just looking fora one-liner solution, if possible Answer Your solution in one line solution is: Solution without groupby, thank you @ouroboros1: Another idea

Accepted Answer

Your solution in one line solution is:test['agg_y'] = np.where(test.label == "bottom",                         test.groupby('label').y.transform('max'),                          test.groupby('label').y.transform('min'))Solution without groupby, thank you @ouroboros1:test['agg_y'] = np.where(test.label == 'bottom',                          test.loc[test.label.eq('bottom'), 'y'].max(),                          test.loc[test.label.ne('bottom'), 'y'].min())Another idea is mapping values, idea is similar like ouroboros1 solution:d = {'bottom':'max', 'top':'min'}test['agg_y'] = test['label'].map({val:test.loc[test.label.eq(val),'y'].agg(func)                                    for val, func in d.items()})print (test)   y   label  agg_y0  1  bottom      51  2     top      22  3  bottom      53  4     top      24  5  bottom      55  6     top      2

Advertisement

Answer