Pandas get topmost n records within each group

Question

Suppose I have pandas DataFrame like this: which looks like: I want to get a new DataFrame with top 2 records for each id, like this: I can do it with numbering records within group after groupby: which looks like: then for the desired output: Output: But is there more effective/elegant approach to do this? And also is there more

Accepted Answer

Did you trydf.groupby('id').head(2)Output generated:       id  valueid             1  0   1      1   1   1      2 2  3   2      1   4   2      23  7   3      14  8   4      1(Keep in mind that you might need to order/sort before, depending on your data)EDIT: As mentioned by the questioner, usedf.groupby('id').head(2).reset_index(drop=True)to remove the MultiIndex and flatten the results:    id  value0   1      11   1      22   2      13   2      24   3      15   4      1

Advertisement

Answer