Pandas – take multiple columns and transform them into a single column of dictionary objects?

Question

I am trying to transform a DataFrame by combining extra columns into a dictionary. my DataFrame will always have four columns, at least: record, yhat, residual, and hat, with additional columns in different cases. My current df head looks like this: If we look at the top column, we see that there are 2 additional columns, RinvRes and AOMstat I

Accepted Answer

in one step with .join, .agg(dict) and .dropfirst create your list of aggregate columnsagg_cols = ['RinvRes', 'AOMstat']df1 = df.join(df[agg_cols].agg(dict,axis=1)                          .to_frame('additional')).drop(agg_cols,1)print(df1)   record    yhat  residual      hat                                  additional0       1  6.7272   -0.5713  0.04985   {'RinvRes': 0.009825, 'AOMstat': 0.02041}1       2  6.5568    0.1946  0.09771  {'RinvRes': -0.01493, 'AOMstat': -0.03078}2       3  6.5457    0.1619  0.09765      {'RinvRes': 0.2728, 'AOMstat': 0.5626}

Advertisement

Answer