How to apply pandas groupby to a dataframe to use both rows and columns when calculating a mean

Question

I have a dataframe df in the format: And I am looking to group it such that I intersect the Rating as the index, the Height (split into buckets) as the columns, and within the individual cells have the average value for the combination of Grade and Height. So, the output dataframe would look something like th…

Accepted Answer

Use pd.cut to break your Height Column into bins.Create a new column  of Speed * ValuePivot your table, mean is the default pivot function.dropna=False is used so that even null bins are shown.df.Height = pd.cut(df.Height, bins=[0, 10, 25, 50, 100])df['speed_value'] = df.Speed.mul(df.Value)out = df.pivot_table(index='Grade', columns='Height', values='speed_value', dropna=False)print(out)Output:Height  (0, 10]  (10, 25]  (25, 50]  (50, 100]GradeA           NaN      50.0       NaN        NaNB           NaN      30.0       NaN        NaNC           NaN       NaN       NaN      120.0

Advertisement

Answer