after a groupby create a new column with a list of unique values for another column of the groupes values

Question

So i have a dataframe with two columns: artistID and genre: And what I want to do is to group by the column artistID (so the resulting datafdrame has as many rows as artistID there are in this dataframe), and the second column of the new dataframe I want it to be like a list or an array or whatever

Accepted Answer

Use Groupby.agg:In [2237]: df.groupby('artistID')['genre'].agg(set).reset_index()OR:In [2240]: df.groupby('artistID')['genre'].apply(lambda x: list(set(x)))Out[2237]:    artistID                   genre0        52      [rock, pop, metal]1        63  [pop, hiphop, electro]2        64   [salsa, jazz, latino]3        73         [salsa, latino]4        94       [reggaeton, trap]5       456                [hiphop]6       862                 [metal]7      6177             [rock, pop]

Advertisement

Answer