Make a new column for each category in a particular column and repeat this for all columns in a Pandas dataframe

Question

I have a dataset like below-: I want new columns for each category in all columns for each state. An example of a row is below-: EDIT Data dump of 1st 5 rows as asked-: Answer Use pd.get_dummies + Groupby.sum(), as follows: Result: If you want to exclude the entries with value NA, you can use: Result:

Accepted Answer

Use pd.get_dummies + Groupby.sum(), as follows:(pd.get_dummies(df.set_index('state'))   .groupby('state').sum()   .reset_index())Result:           state  population_0-50  population_100-150  population_150-200  population_50-100  population_NA  locale_NA  locale_rural  locale_suburb  locale_town  locale_urban0     California                1                   0                   1                  2              0          0             1              1            1             11        Florida                2                   1                   1                  0              1          1             1              2            0             12      Minnesota                0                   1                   0                  1              1          0             0              0            2             13  New Hampshire                0                   0                   0                  0              1          0             1              0            0             0If you want to exclude the entries with value NA, you can use:(pd.get_dummies(df[df != 'NA'].set_index('state'))   .groupby('state').sum()   .reset_index())Result:           state  population_0-50  population_100-150  population_150-200  population_50-100  locale_rural  locale_suburb  locale_town  locale_urban0     California                1                   0                   1                  2             1              1            1             11        Florida                2                   1                   1                  0             1              2            0             12      Minnesota                0                   1                   0                  1             0              0            2             13  New Hampshire                0                   0                   0                  0             1              0            0             0

Advertisement

Answer