How to do line level logic in Pandas

Question

I have a table that has a bunch of columns and I need to create a new column based on the row type but the logic will be different for each type of row. My data looks like this: type field1 field2 field3 field4 1 a b c 17 2 e f g 20 3 i j k 100 the

Accepted Answer

Merge the rules to the data and use eval method to evaluate rules according to type# datadf = pd.DataFrame({'type': [1, 2, 3],                   'field1': ['a', 'e', 'i'],                   'field2': ['b', 'f', 'j'],                   'field3': ['c', 'g', 'k'],                   'field4': [17, 20, 100]})# rules dfrules = pd.DataFrame({'type': [1, 2, 3],                      'rule': ['field1+field2+field3', 'field2+field3+field4', 'field4**2']})# merge the dfs to be able to do a rules lookup laterdf = df.merge(rules, on='type')# create a list in a looplst = []for _, d in df.groupby("type"):    # get the field columns    f_cols = [c for c in d.columns if 'field' in c]    # get the rule     r = d.rule.iat[0]    # rules with + concatenates strings and ints, so convert such rows to string dtype    if '+' in r:        d[f_cols] = d[f_cols].astype(str)    # evaluate the rule    d['new'] = d[f_cols].eval(f"{r}", engine='python')    # append to lst    lst.append(d)# concatenate all dfs in lst into a single dfres = pd.concat(lst)resLet me know if you have any questions.

Type	Rule
1	field1+field2+field3
2	field2+field3+field4
3	field4**2

output
abc
fg20
10000

Advertisement

Answer