Efficient chaining of boolean indexers in pandas DataFrames

Question

I am trying to very efficiently chain a variable amount of boolean pandas Series, to be used as a filter on a DataFrame through boolean indexing. Normally when dealing with multiple boolean conditions, one chains them like this but this becomes a problem with a variable amount of conditions. I have tried out some possible solutions, but I am convinced

Accepted Answer

Use np.logical_and:import pandas as pdimport numpy as npdf = pd.DataFrame({'A': [0, 1, 2], 'B': [0, 1, 2], 'C': [0, 1, 2]})m1 = df.A > 0m2 = df.B <= 1m3 = df.C == 1m = np.logical_and.reduce([m1, m2, m3])# OR m = np.all([m1, m2, m3], axis=0)out = df[np.logical_and.reduce([m1, m2, m3])]Output:>>> pd.concat([m1, m2, m3], axis=1)       A      B      C0  False   True  False1   True   True   True2   True  False  False>>> marray([False,  True, False])>>> out   A  B  C1  1  1  1

Advertisement

Answer