Unstack and return value counts for each variable?

Question

I have a data frame that records responses of 19717 people's choice of programing languages through multiple choice questions. The first column is of course the gender of the respondent while the rest are the choices they picked. The data frame is shown below, with each response being recorded as the same name as column. If no response is selected,

Accepted Answer

Another idea would be to apply join values along axis 1, get_dummies then groupby:(df.loc[:, 'Python':] .apply(lambda x: '|'.join(x.dropna()), axis=1) .str.get_dummies('|') .groupby(df['Gender']).sum())[out]                   Bash  C++  JavaScript  Python  RGender                                             Female                0    1           1       0  1Male                  0    0           1       1  0Prefer not to say     1    0           0       1  0

Advertisement

Answer