How do I do a groupby in python to split between orders?

Question

I have a dataframe that shows order listing. How do I use it to find the number of orders that contain spicy food? Using this code gives me 2 Yes and 2 No, but it should actually be 2 Yes and 1 No as order 1001 is duplicated. Thank you. I would like to get an output that shows the

Accepted Answer

There&#8217;s perhaps a simpler way, but this works.First group by order and spicy counts, to get the count of spicy for each order.  Then sort by spicy and drop duplicates by order number (removes the &#8216;No&#8217; in spicy column if a yes exists for that order). Then group by Spicy again and count to get the counts.df_orders = pd.DataFrame({'Order' : [1000, 1001, 1001, 1002],                          'Item Name' : ['Calamari Rings', 'Cheesecake', 'Spicy Chicken', 'Spicy Lamb'],                          'Spicy' : ['No', 'No', 'Yes', 'Yes']})df_grouped = df_orders.groupby(['Order', 'Spicy']).count().reset_index()df_grouped = df_grouped.sort_values(by='Spicy').drop_duplicates(subset='Order', keep='last')df_grouped = df_grouped.groupby('Spicy').count()['Order'].reset_index()Output:  Spicy  Order0    No      11   Yes      2

Advertisement

Answer