Drop Non-equivalent Multiindex Rows in Pandas Dataframe

Question

Goal If sub-column min equals to sub-column max and if min and max sub-column do not equal to each other in any of the column (ao, his, cyp1a2s, cyp3a4s in this case), drop the row. Example Want Attempt Note The actual dataframe has 50+ columns. Answer Use DataFrame.xs for DataFrame by second levels of MultiIndex, replace NaNs: Or convert data

Accepted Answer

Use DataFrame.xs for DataFrame by second levels of MultiIndex, replace NaNs:df1 = df.xs('min', axis=1, level=1).fillna('nan')df2 = df.xs('max', axis=1, level=1).fillna('nan')Or convert data to strings:df1 = df.xs('min', axis=1, level=1).astype('str')df2 = df.xs('max', axis=1, level=1).astype('str')Compare Dataframes by DataFrame.eq and test if all Trues by DataFrame.all and last filter by boolean indexing:df = df[df1.eq(df2).all(axis=1)]print (df)    ao       hia      cyp1a2s     cyp3a4s        min  max  min  max     min max     min  max1  1.0  1.0  0.0  0.0     NaN NaN     0.0  0.0

Advertisement

Answer