Pandas find most bought item given ClientID ItemID ItemQuantity

Question

Among the columns of my DataFrame I have ClientID CartID FoodID Quantity, I would like to find what is the food that the client has bought the most. I tried this: But got a completely wrong output: EDIT: I also tried but this results in a pair (ClientID, Quantity of the most bought food), I need (Client, FoodID) Answer First

Accepted Answer

df.groupby(["ClientID", "FoodID"])['Quantity'].sum().reset_index().sort_values(    ["ClientID", 'Quantity'], ascending=False).drop_duplicates(    ["ClientID"]).sort_values('ClientID')First get a df with contains the total Quantity for each ClientID, FoodID combination. Then sort the df on ClientID, Quantity so that highest Quantity per client appears on the top and finally drop the duplicates per client which will drop all the clients records except the top which happens to be max quantity.Test case:np.random.seed(0)df = pd.DataFrame({    'ClientID' : np.random.randint(1,10, 1000),    'FoodID' : np.random.randint(1,10, 1000),    'Quantity' : np.random.randint(1,10, 1000),})df.groupby(["ClientID", "FoodID"])['Quantity'].sum().reset_index().sort_values(    ["ClientID", 'Quantity'], ascending=False).drop_duplicates(    ["ClientID"]).sort_values('ClientID')Output:    ClientID    FoodID  Quantity3   1           4       9716  2           8       8226  3           9       10035  4           9       9844  5           9       8547  6           3       10754  7           1       9469  8           7       10773  9           2       109

Advertisement

Answer