I want to compare the values in pandas

Question

I have two dataframes. First one: second one: this is the first one: this is the second one: I want to do two things: if the second one's user is not in the first one, then print out. like: www is not in the list if the user is in the list, but group is not equal then print out:

Accepted Answer

If there are same index values ina number of length in both DataFrame and need compare values per rows:print (df1.index.equals(df2.index))True#compare rows for not equalmask = df1['user'].ne(df2['user'])#filter rows by mask and column user in df2a = df2.loc[mask, 'user'].tolist()print (a)['www']#join both DataFrames togetherdf1 = pd.concat([df1, df2], axis=1, keys=('a','b'))df1.columns  = df1.columns.map('_'.join)#filter only same user rowsdf1 = df1[~mask]#split columns by , ans convert to setsdf1['a'] = df1['a_groups'].apply(lambda x: set(x.split(',')))df1['b'] = df1['b_groups'].apply(lambda x: set(x.split(',')))#get difference of sets, join to strings with separator ,df1['a_diff'] = [', '.join(x.difference(y)) for x, y in zip(df1['b'],df1['a'] )]df1['b_diff'] = [', '.join(x.difference(y)) for x, y in zip(df1['a'],df1['b'] )]print (df1)  a_user                a_groups b_user           b_groups  0    xxx                   admin    xxx  admin,super admin   2    zzz  guest,admin,superadmin    zzz   guest,superadmin                               a                     b       a_diff b_diff  0                     {admin}  {admin, super admin}  super admin         2  {admin, superadmin, guest}   {superadmin, guest}               admin   #filter by casting set columns to boolean, empty sets are converted to Falseb = df1.loc[df1['a_diff'].astype(bool), ['a_user','a_diff']]print (b)  a_user       a_diff0    xxx  super adminc = df1.loc[df1['b_diff'].astype(bool), ['a_user','b_diff']]print (c)  a_user b_diff2    zzz  admin

Advertisement

Answer