How to get unique counts based on a different column, with pandas groupby

Question

I have the following dataframe: I would like to groupby effortduration and get the count of each column based on the unique count of the user column. This is what I have tried so far: However, that is again not what I am looking for because the values of callbacks and applications are not based on the user co…

Accepted Answer

This works with the sample data, I&#8217;m not sure with real dataReplace 0, with NaN, and then drop NaN if 'effortduration', 'callbacks', and 'applications' are all NaN.Drop all duplicatesBased on the desired result, it only matters if a user called/applied, once.Groupby countimport pandas as pd# sample datadf = pd.DataFrame({'user': ['user122', 'user122', 'user124', 'user125', 'user125', 'user126', 'user126'], 'effortduration' : ['2 weeks', np.nan, '2 weeks', '3 weeks', np.nan, '2 weeks', '2 weeks'], 'callbacks' : [0, 0, 0, 0, 0, 1, 1], 'applications': [0, 0, 1, 0, 0, 1, 1]})# replace 0 and drop nandf = df.replace(0, np.nan).dropna(how='all', subset=['effortduration', 'callbacks', 'applications'])# drop duplicatesdf = df.drop_duplicates()# groupby and countdfg = df.groupby(['effortduration']).count()# dfg                user  callbacks  applicationseffortduration                               2 weeks            3          1             23 weeks            1          0             0nuniqueAs already noted, this option returns the count of the number of unique values in the column, so doesn&#8217;t return the desired output.df = pd.DataFrame({'user': ['user122', 'user122', 'user124', 'user125', 'user125', 'user126', 'user126'], 'effortduration' : ['2 weeks', np.nan, '2 weeks', '3 weeks', np.nan, '2 weeks', '2 weeks'], 'callbacks' : [0, 0, 0, 0, 0, 1, 1], 'applications': [0, 0, 1, 0, 0, 1, 1]})# using nuniquedfg = df.groupby('effortduration').nunique()# dfg                user  callbacks  applicationseffortduration                               2 weeks            3          2             23 weeks            1          1             1

How to get unique counts based on a different column, with pandas groupby

Advertisement

Answer

`nunique`