How to groupby and calculate new field with python pandas?

Question

I'd like to group by a specific column within a data frame called 'Fruit' and calculate the percentage of that particular fruit that are 'Good' See below for my initial dataframe Dataframe See below for my desired output data frame Note: Because there is 1 "Good" Apple and 1 "Bad" Apple, the percentage of Good Apples is 50%. See below

Accepted Answer

We can compare Condition with eq and take advantage of the fact that True is (1) and False is (0) when processed as numbers and take the groupby mean over Fruits:new_df = (    df['Condition'].eq('Good').groupby(df['Fruit']).mean().reset_index())new_df:    Fruit  Condition0   Apple        0.51  Banana        1.0We can further map to a format string and rename to get output into the shown desired output:new_df = (    df['Condition'].eq('Good')        .groupby(df['Fruit']).mean()        .map('{:.0%}'.format)  # Change to Percent Format        .rename('Percentage')  # Rename Column to Percentage        .reset_index()  # Restore RangeIndex and make Fruit a Column)new_df:    Fruit Percentage0   Apple        50%1  Banana       100%*Naturally further manipulations can be done as well.

Advertisement

Answer