Group by and find top n value_counts pandas

Question

I have a dataframe of taxi data with two columns that looks like this: Basically, each row represents a taxi pickup in that neighborhood in that borough. Now, I want to find the top 5 neighborhoods in each borough with the most number of pickups. I tried this: Which gives me something like this: How do I filt…

Accepted Answer

I think you can use nlargest &#8211; you can change 1 to 5:s = df['Neighborhood'].groupby(df['Borough']).value_counts()print sBorough                      Bronx          Melrose            7Manhattan      Midtown           12               Lincoln Square     2Staten Island  Grant City        11dtype: int64print s.groupby(level=[0,1]).nlargest(1)Bronx          Bronx          Melrose        7Manhattan      Manhattan      Midtown       12Staten Island  Staten Island  Grant City    11dtype: int64additional columns were getting created, specified level info

Advertisement

Answer