Make dummy variable for categorical data, based on ID column with duplicate values in python

Question

I have the following pandas dataframe: I want to make dummy variables for the values in the column &#8216;value&#8217;, for each value in the column &#8216;ID&#8217;. So I want it this: How can I do this in python? Answer Use crosstab with limit counts to 1 by DataFrame.clip:

Accepted Answer

Use crosstab with limit counts to 1 by DataFrame.clip:df1  = (pd.crosstab(df['ID'], df['value'])          .clip(upper=1)          .reset_index()          .rename_axis(None, axis=1))print (df1)   ID  A  B  C0   1  1  1  11   2  0  1  02   4  1  0  13  10  0  0  1

Advertisement

Answer