Apply multiple criteria to select current and prior row – Pandas

Question

I have a dataframe like as shown below I would like to select rows based on the criteria below criteria 1 - pick all rows where source-system = I criteria 2 - pick prior row (n-1) only when source-system of (n-1)th is O and diff is zero. This criteria 2 should be applied only when nth row has source-system =

Accepted Answer

I prefer not one line solution, because hard readable if more complicated code, so better is use:m1 = df['visit_source_value'] == 'I'm2 = df['r_diff'] <= 0m3 = df.groupby('person_id')['visit_source_value'].shift(-1) == 'I'df = df[m1 | (m2 & m3)]print (df)    person_id visit_source_value  r_diff5           2                  I    20.07           2                  O     0.08           2                  I    21.011          2                  O     0.012          2                  I    12.0

Advertisement

Answer