In Pandas, how to group by column name and condition met, while joining the cells that met the condition in a single cell

Question

I am having a hard time knowing how to even formulate this question, but this is what I am trying to accomplish: I have a pandas datatable with thousands of rows that look like this: id text value1 value2 1 These are the True False 2 Values of &#8220;value1&#8221; True False 3 While these others False True 4 …

Accepted Answer

You could set_index with &#8220;id&#8221; and &#8220;text&#8221;; then stack df. Then (i) filter the Series by itself; (ii) groupby &#8220;value&#8221; and join &#8220;text&#8221;:s = df.set_index(['id','text']).stack()out = s[s].reset_index(level=1).groupby(level=1)['text'].apply(' '.join).reset_index()Output:    index                                           text0  value1               These are the Values of "value1"1  value2  While these others are the Values of "value2"

values	merge_text
value1	These are the Values of “value1”
value2	While these others are the Values of “value2”

id	text	value1	value2
1	These are the	True	False
2	Values of “value1”	True	False
3	While these others	False	True
4	are the Values of “value2”	False	True

Advertisement

Answer