Python/Pandas: If Column has multiple values, convert to single row with multiples values in list

Question

In my DataFrame, I have many instances of same AutoNumber having different KeyValue_String. I would like to convert these instances to a single row where the KeyValue_String is a list comprised of the multiple unique values. The desired output would look like this, except I want to keep all of the other colum…

Accepted Answer

If I understand correctly, you could opt for using groupby, transform, and unique. df['KeyValue_String'] = df.groupby('AutoNumber').KeyValue_String.transform('unique')Then you can drop duplicates assuming as mentioned in the comments that rows with the same AutoNumber contain duplicate information besides the KeyValue_String. df = df.drop_duplicates(subset='AutoNumber')I would advise if you want arrays you keep everything in the column as an array, and don&#8217;t expend effort putting mixed types in the column which will just be harder to work with anyways.  Demo>>> df    AutoNumber KeyValue_String0        50899              DD1        50905          Cheque2        50906              DD3        50907          Cheque4        50908              DD5        50909              DD6        50910          Cheque7        50911          Cheque8        50912          Cheque9        50913          Cheque10       50914          Cheque11       50914              DD12       50915          Cheque13       50916          Cheque14       50916          Cheque>>> df['KeyValue_String'] = df.groupby('AutoNumber').KeyValue_String.transform('unique')>>> df.drop_duplicates(subset='AutoNumber')    AutoNumber KeyValue_String0        50899            [DD]1        50905        [Cheque]2        50906            [DD]3        50907        [Cheque]4        50908            [DD]5        50909            [DD]6        50910        [Cheque]7        50911        [Cheque]8        50912        [Cheque]9        50913        [Cheque]10       50914    [Cheque, DD]12       50915        [Cheque]13       50916        [Cheque]

Advertisement

Answer