Pandas: using groupby to calculate a ratio by specific values

Question

Hi I have a dataframe that looks like this: and I want to calculate a ratio in the column 'count_number', based on the values in the column 'tone' by this formula: ['blue'+'grey']/'red' per each unite combination of 'participant_id', 'session', 'block' - here is part of my dataset as text, the left column 'RATIO' is my expected output: participant_id session block

Accepted Answer

Here&#8217;s one approach:We could create separate Series objects for numerator and denominator of the divisions; then groupby + transform sum + div will fetch the desired ratio:num = df['tone'].isin(['blue','grey']) * df['count_number']denom = df['tone'].eq('red') * df['count_number']cols = [df[c] for c in ['participant_id', 'session', 'block']]df['RATIO'] = (num.groupby(cols).transform('sum')               .div(denom.groupby(cols).transform('sum'))               .replace(float('inf'), '#DIV/0!'))Another approach could be to use groupby + apply a lambda that calculates the required ratio for each group; then map the ratios back to the original DataFrame:cols = ['participant_id', 'session', 'block']mapping = (df.groupby(cols)           .apply(lambda x: (x.loc[x['tone'].isin(['blue','grey']), 'count_number'].sum() /                              x.loc[x['tone'].eq('red'), 'count_number']))           .droplevel(-1))df['RATIO'] = df.set_index(cols).index.map(mapping)df['RATIO'] = df['RATIO'].replace(float('inf'), '#DIV/0!')Output:    group  participant_id  session block  tone  count_number     RATIO0       1              10        1   neg  blue             0       0.01       1              10        1   neg  grey             0       0.02       1              10        1   neg   red             3       0.03       1              10        1   neu  blue             1   #DIV/0!4       1              10        1   neu  grey             1   #DIV/0!5       1              10        1   neu   red             0   #DIV/0!6       1              10        2   neg  blue             3  2.3333337       1              10        2   neg  grey             4  2.3333338       1              10        2   neg   red             3  2.3333339       1              10        2   neu  blue             4  1.33333310      1              10        2   neu  grey             0  1.33333311      1              10        2   neu   red             3  1.33333312      1              11        1   neg  blue             0       0.013      1              11        1   neg  grey             0       0.014      1              11        1   neg   red             3       0.0

Advertisement

Answer