GroupBy Column1, then get all elements with the first/last element on Column2 (Python)

Question

I want to group by user_id, then get the first element of survey_id, and get all elements related to this selection In the same way I want to group by user_id, then get the last element of survey_id, and get all elements related to this selection Is there a quick groupby command to get this? I can do this by

Accepted Answer

Solution with no merging:df_head = df[df.survey_id.eq(df.groupby('user_id').transform('min').survey_id)]result:    user_id  survey_id answer0         1          1     no1         1          1    yes2         1          1     no3         1          1     no7         2          4    yes8         2          4     no9         2          4    yes14        3          7     no21        4         10     no22        4         10    yesdf_tail = df[df.survey_id.eq(df.groupby('user_id').transform('max').survey_id)]result:    user_id  survey_id answer6         1          3     no12        2          6    yes13        2          6     no17        3          9    yes18        3          9    yes19        3          9     no20        3          9     no25        4         12     no26        4         12    yesIdea is to calculate min / max of survey_id per user_id and compare it to survey_id at row level of df. Please note that original index of dataframe is preserved. If You need new index just add at the end:df_head = df_head.reset_index(drop = True)df_tail = df_tail.reset_index(drop = True)

Advertisement

Answer