Loop through unique values in a column to check another column and create another column – pandas

Question

I would like to create a function to go through each unique value in a column and check if another column contains a value and then create a column that shows the result. For example: for each unique ID in df, check if the stats is A then create a result column: df: ID Status 1 A 1 B 2

Accepted Answer

I am not sure if I understand the rules correctly. Should I always take the first occurrence of the ID? Then the second row in your expected output is wrong.You can use numpy.wheredf = pd.DataFrame({'ID': {0: 1, 1: 1, 2: 2, 3: 2}, 'Status': {0: 'A', 1: 'B', 2: 'B', 3: 'C'}})new_df = df.drop_duplicates(subset=["ID"]).copy()new_df["Result A?"] = np.where(new_df.Status == "A", "YES", "NO")to get this:   ID Status Result A?0   1      A       YES2   2      B        NOEdit: Your desired output is ambiguous, two things you can do:df.groupby("ID")["Status"].apply({"A"}.issubset).replace({True: 'Yes', False: 'No'}).rename("Result A?")gives you:ID1    Yes2     NoName: Result A?, dtype: objectOr:df["Result A?"] = np.where(df.groupby("ID")["Status"].apply({"A"}.issubset).loc[df.ID], "YES", "NO")which gives you:   ID Status Result A?0   1      A       YES1   1      B       YES2   2      B        NO3   2      C        NO

Advertisement

Answer