Get the max value from each group with pandas.DataFrame.groupby

Question

I need to aggregate two columns of my dataframe, count the values of the second columns and then take only the row with the highest value in the &#8220;count&#8221; column, let me show: so far so good, but now I need to get only the row of each &#8216;col1&#8217; group that has the maximum &#8216;count&#8217;…

Accepted Answer

From your original DataFrame you can .value_counts, which returns a descending count within group, and then given this sorting drop_duplicates will keep the most frequent within group.df1 = (df.groupby('col1')['col2'].value_counts()         .rename('counts').reset_index()         .drop_duplicates('col1'))  col1 col2  counts0    A   AY       32    B   BX       34    C   CX       5

Advertisement

Answer