Derive consumption from existing column (Pandas)

Question

Data Desired Doing first create derived column Any suggestion is helpful Answer You aren't using a correct aggregation function. You should be using sum on both your "used" and "total" columns:

Accepted Answer

You aren&#8217;t using a correct aggregation function. You should be using sum on both your &#8220;used&#8221; and &#8220;total&#8221; columns:df = pd.DataFrame(    {        'client': ['charles', 'charles', 'charles', 'cara', 'cara', 'cara', 'cara'],        'total': [5000, 8000, 500, 1000, 2000, 500, 100],        'used': [0, 3000, 0, 900, 1550, 500, 100],        'date': ['2/1/2022', '2/1/2022', '3/1/2022', '3/1/2022', '2/1/2022', '2/1/2022', '3/1/2022']    })tmp = df.groupby(['client', 'date']).agg({'used': 'sum', 'total': 'sum'})tmp['used_frac'] = tmp['used'] / tmp['total']tmp.reset_index(inplace=True)tmp# Out:     client  date      used      total   used_frac0   cara    2/1/2022    2050    2500    0.8200001   cara    3/1/2022    1000    1100    0.9090912   charles 2/1/2022    3000    13000   0.2307693   charles 3/1/2022    0       500     0.000000

Advertisement

Answer