Pandas apply condition on a column that contains list

Question

I want to create a new column based on a condition that have to be applied on a list. Here's a reproducible example: As one can see, each object in the BRAND column is a list that can contain one or more elements (the list can also be empty as for the row where ID = 1). Now, given the

Accepted Answer

Option 1Seems to be a bit faster on a larger set than Option 2 below:df['FLAG'] = df.BRAND.explode().isin(target_list).groupby(level=0, sort=False)    .any().map({True:'Y',False:'N'})print(df)   ID            BRAND FLAG0   1               []    N1   2            [LVH]    Y2   3  [FER, MER, POR]    N3   4  [WDC, AUD, LVH]    Y4   5            [AST]    NExplanation:Use Series.explode to &#8220;[t]ransform each element of a list-like to a row&#8221;.Check for matches with Series.isin, and get True or False.We now have a series with duplicate rows, so use Series.groupby to isolate the groups, apply any, and get a pd.Series back with booleans in the correct shape.Finally, use Series.map to turn False and True into "N" and "Y" respectively.Option 2:Basically same performance as the answer by @AnoushiravanRdf['FLAG'] = df.BRAND.apply(lambda x: 'Y' if len(set(x) & set(target_list))                             else 'N')print(df)   ID            BRAND FLAG0   1               []    N1   2            [LVH]    Y2   3  [FER, MER, POR]    N3   4  [WDC, AUD, LVH]    Y4   5            [AST]    NExplanation: set(list_a) & set(list_b) being a shorthand for set_a.intersection(set_b), which we pass to len(). If len(...) == 0, this will result in False.

Advertisement

Answer