Pandas, get all possible value combinations of length k grouped by feature

Question

I have a Pandas dataframe something like: Feature A Feature B Feature C A1 B1 C1 A2 B2 C2 Given k as input, i want all values combination grouped by feature of length k, for example for k = 2 I want: How can I achieve that? Answer This is probably not that efficient but it works for small scale.

Accepted Answer

This is probably not that efficient but it works for small scale.First, determine the unique combinations of k columns.from itertools import combinationsk = 2cols = list(combinations(df.columns, k))Then use MultiIndex.from_product to get cartesian product of k columns.result = []for c in cols:    result += pd.MultiIndex.from_product([df[x] for x in c]).values.tolist()

Advertisement

Answer