pandas: Create new column by comparing DataFrame rows with columns of another DataFrame

Question

Assume I have df1: And a df2: I'm looking for a way to create a new column in df2 that gets number of rows based on a condition where all columns in df1 has values greater than their counterparts in df2 for each row. For example: To elaborate, at row 0 of df2, df1.alligator_apple has 4 rows which values are

Accepted Answer

I believe this does what you want:df2['greater'] = df2.apply(    lambda row:     (df1['alligator_apple'] > row['alligator_apple']) &     (df1['barbadine'] > row['barbadine']) &     (df1['capulin_cherry'] > row['capulin_cherry']),     axis=1,).sum(axis=1)print(df2)output:   alligator_apple  barbadine  capulin_cherry  greater0                6          3               1        41                7         19               9        12               15         25              15        03                5         12              27        3Edit: if you want to generalize and apply this logic for a given column set, we can use functools.reduce together with operator.and_:import functoolsimport operatorcolumns = ['alligator_apple', 'barbadine', 'capulin_cherry']df2['greater'] = df2.apply(    lambda row: functools.reduce(        operator.and_,         (df1[column] > row[column] for column in columns),    ),     axis=1,).sum(axis=1)

Advertisement

Answer