How do I filter multi-level columns using notnull() in pandas?

Question

I generate a multi-index dataframe that has some NAN values using this: Which will create something like this: I'd like to get rows of a specific subset of top-level columns (eg df[['baz','qux']]) that have no nulls. For example in df[['baz','qux']] I'd like to get rows 0 and 1 since they both have all nulls in 3. Hoping things would just

Accepted Answer

df[cols].notna() is not a 1D boolean mask. You have to reduce the dimension using all or any on axis.>>> df[df[cols].notna().all(1)]        bar                 baz                 foo                 qux        one       two       one       two       one       two       one       two0  1.799680 -0.901705 -1.575930  0.185863 -0.793007  1.485423       NaN       NaN2  1.379878 -0.748599  0.661697 -1.015311 -0.858144       NaN -1.623013  0.340043

Advertisement

Answer