Pandas multiple comparison on a single row

Question

My source data looks like this: I need to compare Second column with Third, Fourth Column with Fifth Column, Sixth with Seventh. Column names can change. So I have to consider the column positions and my first column with always has column name as id. so if atleast one of comparisons (&#8216;1_src1&#8217; vs …

Accepted Answer

You can use:import numpy as np# compare columns by pair after 1st onecomp = df.iloc[:, 1::2].ne(df.iloc[:, 2::2].to_numpy())# select rules                     # True in last     # True in first 2 compdf['res'] = np.select([comp.iloc[:, 2], comp.iloc[:, :2].any(1)],                      [2, 1], # matching values                      0) # defaultoutput:   id 1_src1 1_src2 2_src1 2_src2 3_src1 3_src2  res0   1      a      a      a      a      a      a    01   2      b      b      b      b      b      b    02   3      c      c      f      c      c      1    23   4      d      d      b      d      d      d    14   5      e      e      e      e      e      m    2

Advertisement

Answer