Combining unique elements of a DataFrame in a list

Question

I&#8217;ll try to ask my question as clearly as possible. I have the following DataFrame which looks like this Now I want to keep values unique to each player only once. Ideally in a list, but that&#8217;s not a big deal. For example, player A and B play soccer so I don&#8217;t want soccer in the output. tenn…

Accepted Answer

It seems need remove duplicates with keep last per column &#8216;game&#8217; by DataFrame.drop_duplicates and then if need lists aggregate them by list:df = (df.drop_duplicates('game', keep='last')        .groupby('player')['game']        .agg(list)        .reset_index())print (df)  player                               game0      A            [Basketball, Ping pong]1      B                   [Soccer, Tennis]2      C  [Baseball, Volleyball, Dodgeball]

Advertisement

Answer