How can I use python conditionals to map columns in a dataframe with duplicates in them?

Question

I am trying to create a mapping where there are duplicates in certain columns in a dataframe. Here are two examples of dataframes I am working with: Here is what I need; 3 conditional python logic that does: when we see the first issue_status of 100 and trading_state of None, map F in the reason column. when …

Accepted Answer

You can filter by 400 and None values for df1, create helper Series with range and mapping last and second last values, for first 100 and None values use Series.duplicated, last join both Series by Series.combine_first:#if None is string#m1 = df['trading_state'].eq('None')m1 = df['trading_state'].isna()m2 = df['issue_status'].eq(400)m3 = df['issue_status'].eq(100)df1 = df[m1 & m2].copy()s1 = pd.Series(range(len(df1), 0, -1), index=df1.index).map({1:'L', 2:'SL'})s2 = df.loc[m1 & m3, 'issue_status'].copy().duplicated().map({False:'F'})df['reason'] = s1.combine_first(s2)print (df)   issue_status trading_state reason0           100          'A0'    NaN1           100          None      F2           400          None    NaN3           100          None    NaN4           400          None     SL5           100          'B2'    NaN6           400          None      L7           100          None    NaN8           400          'A6'    NaNFor second:df['reason'] = s1.combine_first(s2)print (df)   issue_status trading_state reason0           400          None     SL1           100          'A0'    NaN2           400          None      L3           400          'A0'    NaN4           100          None      F5           100          None    NaNIf necessary empty strings in reason column use:df['reason'] = df['reason'].fillna('')

Advertisement

Answer