Transform Pandas column to get a key value pair in a column post group by

Question

My DataFrame: Output required: Approach tried so far: Answer Use GroupBy.apply with lambda function: Duplicated keys not exist in python dictionary. You can aggregate values, e.g. by sum:

Accepted Answer

Use GroupBy.apply with lambda function:df['ID'] = df['ID'].str.strip("'")df1 = (df.groupby(['Col X', 'Col Y'])[['ID','Value']]        .apply(lambda x: dict(x.to_numpy()))        .reset_index(name='Out'))print (df1)  Col X Col Y                       Out0     A     a  {'r': 3, 'b': 2, 'c': 1}1     B     b          {'d': 7, 's': 6}Duplicated keys not exist in python dictionary. You can aggregate values, e.g. by sum:df['ID'] = df['ID'].str.strip("'")df = df.groupby(['Col X', 'Col Y','ID'], as_index=False)['Value'].sum()print (df)  Col X Col Y ID  Value0     A     a  b      21     A     a  c      12     A     a  r      33     B     b  d     124     B     b  s      6df1 = (df.groupby(['Col X', 'Col Y'])[['ID','Value']]        .apply(lambda x: dict(x.to_numpy()))        .reset_index(name='Out'))print (df1)  Col X Col Y                       Out0     A     a  {'b': 2, 'c': 1, 'r': 3}1     B     b         {'d': 12, 's': 6}

Advertisement

Answer