Use fields of one dataframe as conditions to fill a field of another dataframe

Question

I have 2 dataframes, the first is a small dataframe (df1) with information to use to fill a field (named Flag) of the second dataframe (df2). I need to write a function that uses each row of df1 as parameters to fill each row of df2 with a certain value (Y or N). df1 = type q25 q75 A 13

Accepted Answer

From what I understand you would like to have a flag column which tell you whether the particular row is an outlier or not. Here is a vectorized and concise way to achieve that:# Merge the dataframes on type columns = df2.merge(df1, left_on='TYPE', right_on='type', how='left')# calculate IQR and condition to check for outliers['IQR'] = s['q75'] - s['q25']is_outlier = ~s['delta'].between(s['q25'] - 1.5 * s['IQR'], s['q25'] + 1.5 * s['IQR'])# Use np.where to select Y/N based on the outlier conditions['Flag'] = np.where(s['Flag'].ne('Y') & is_outlier, 'Y', s['Flag'])# drop the columns from df1s = s.drop(columns=df1.columns)Resultprint(s)   field1  field2  ... TYPE  delta Flag  IQR0  field1  field2  ...    A    379    Y   851  field1  field2  ...    C     90    N   692  field1  field2  ...    A     50    N   853  field1  field2  ...    B   2000    Y  119

type	q25	q75
A	13	98
B	381	500
C	34	103

field1	field2	…	TYPE	delta	Flag
field1	field2	…	A	379	Y
field1	field2	…	C	90	N
field1	field2	…	A	50	N
field1	field2	…	B	2000	Y

Advertisement

Answer