Pandas compare and sum values between two DataFrame with different size

Question

Suppose I have two Dataframes with different sizes: to which I have: and: Now I want to add a third column to df1 say total_volume, where it is the summation of the volume that lie between individual row of xlow and xup of df1. I can do this using: we can check the value of say the second row as:

Accepted Answer

If bins are not overlapping is possible use cut with aggregate sum and then add to df1 by DataFrame.join:df2['g'] = pd.cut(df2['x'], bins=[0] + df1['xup'].tolist(), labels=df1['xup'])df2 = df1.join(df2.groupby('g')['volume'].sum(), on='xup')print (df2)    xlow xup  volume0    0.0   1       01    1.0   2      202    2.0   3      153    3.0   4       44    4.0   5       05    5.0   6       06    6.0   7       37    7.0   8       28    8.0   9      209    9.0  10      1010  10.0  11       0

Advertisement

Answer