Using Python Pandas to bin data in one df according to bins defined in a second df

Question

I am attempting to bin data in one dataframe according to bins defined in a second dataframe. I am thinking that some combination of pd.bin and pd.merge might get me there? This is basically the form each dataframe is currently in: df: And this is the table with the bins, df2: I would like to match the bin, a…

Accepted Answer

First merge df and df2 on the bin column, and then select the rows where cut_min <= perc < cut_max: In [95]: result = pd.merge(df, df2, on='bin').query('cut_min <= perc < cut_max'); resultOut[95]:     bin id  perc  cut_max  cut_min  result0     1  a   0.1      0.2      0.0     low5     2  b   0.9      1.0      0.7    high7     2  e   0.5      0.7      0.3  medium9     3  c   0.3      0.4      0.0     low13    3  d   0.7      0.8      0.4  mediumIn [97]: result = result[['bin', 'id', 'perc', 'result']]In [98]: result.sort('id')Out[98]:     bin id  perc  result0     1  a   0.1     low5     2  b   0.9    high9     3  c   0.3     low13    3  d   0.7  medium7     2  e   0.5  medium

Advertisement

Answer