Skip to content
Advertisement

get all pairs of columns where only one value in third column

I am try to get all pairs of columns where a third column has only one value, such that (given pair a,b and third column c):

JavaScript

only returns 1,2 and 2,1 (the results from the last two rows). The first two rows are excluded since they describe the same pair but with different values in the third column.

To be extra clear: df.groupby(['a', 'b'])['c'].nunique() above would yield:

JavaScript

and df.groupby(['a', 'b'])['c'].nunique() == 1 yields:

JavaScript

I want the a,b values for each pair that is true in that condition. I was thinking:

JavaScript

but this is clearly not right as a condition on the non-grouped df.

JavaScript

It also doesn’t work like this:

JavaScript

Advertisement

Answer

Once you get the GroupBy result, you can use a boolean condition to filter the index to get a list of tuples:

JavaScript

This works because the GroupBy result uses the grouper columns (“a” and “b” in this case) as its index (since there are two columns, you get a MultiIndex with two levels in the result)

User contributions licensed under: CC BY-SA
6 People found this is helpful
Advertisement